Models excelling at highly structured mathematical benchmarks exhibit a 'unified capability substrate' enabling dominance in seemingly unrelated fields (coding, logical reasoning, scientific discovery) — the 'mathematical skeleton' of the technological...
Predictor: Alex Wissner-Gross
Prediction text
Models excelling at highly structured mathematical benchmarks exhibit a 'unified capability substrate' enabling dominance in seemingly unrelated fields (coding, logical reasoning, scientific discovery) — the 'mathematical skeleton' of the technological singularity. Autonomous agents will seamlessly interface with environmental + biological sensors to continuously monitor, model, and manipulate physical reality via this cross-domain transfer. | Next frontier-model cross-domain benchmark release
Key catalyst: Next frontier-model cross-domain benchmark release
Watch events: Cross-domain-transfer benchmarks (BIG-Bench, GPQA Diamond); physical-world-model scaling
Resolution evidence
GPT-5 / Claude 4 / Gemini 3 cross-domain benchmarks (math + coding + science) empirically validate transfer. AlphaFold → AlphaProof → AlphaGeometry demonstrate unified-substrate.
Predictor: Alex Wissner-Gross
Calibration plot (stated vs observed)
Evidence about this node from Alex Wissner-Gross is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-04-24hitFrontier model exceeds 50% on FrontierMath (math benchmark)How: Any frontier LLM crosses 50% accuracy on Epoch AI's FrontierMath benchmarkSource: https://epoch.ai/benchmarks/frontiermath-tier-4 — GPT-5.5 Pro 52.4%, GPT-5.5 51.7% as of April 2026conf 99%
- 2026-09-15pendingQ1 window check-in (25%)
- 2026-04-01 → 2027-06-30pendingSame model leads both math + SWE-bench coding leaderboards simultaneouslyHow: A single model variant simultaneously occupies top-3 on both FrontierMath and SWE-bench Verified leaderboardsSource: Epoch AI, SWE-bench, Scale Labs leaderboardsconf 80%Notes: Direct evidence of 'unified capability substrate' — math leadership transfers to coding.
- 2027-05-30pendingQ2 window check-in (50%)
- 2026-06-01 → 2028-12-31pendingAI model produces peer-reviewed scientific discovery in non-CS fieldHow: Peer-reviewed paper attributes a novel discovery (in chemistry, biology, physics, or math) primarily to a frontier LLM/agent systemSource: Nature, Science, peer-reviewed journals tracking AI co-authorshipconf 65%Notes: Tests cross-domain transfer to scientific discovery — second pillar of the claim.
- 2028-02-11pendingQ3 window check-in (75%)
- 2027-01-01 → 2029-10-31pendingAI agent integrates with biological/environmental sensor stack in published studyHow: Published research demonstrates AI agent autonomously interfacing with biological or environmental sensor network to monitor and act on physical realitySource: arxiv, IEEE proceedings, robotics journalsconf 55%Notes: 'Manipulate physical reality via cross-domain transfer' element of the claim.
- 2027-06-01 → 2029-11-30pendingComposite cross-domain leaderboard launched (math+code+science+reasoning)How: Major eval org (Epoch, METR, Stanford HAI) publishes composite cross-domain benchmark with at least one model scoring ≥80%Source: Stanford AI Index 2027/2028, Epoch AIconf 45%Notes: Cascade — formal recognition of 'unified capability substrate' as measurable thing.
No downstream cascades — this prediction is a leaf in the dependency graph.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
Top outgoing (children)
Predictions THIS node influences
No outgoing edges.
Ticker exposure
Beneficiaries (6)
Adverse (4)
Prerequisites (6)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| correlate | S_AGI_MID_2029 | AGI mid: Kurzweil 2029 path | agi_general_capability | — |
| correlate | S_AGI_FAST_2027 | AGI fast: drop-in remote worker by 2027-09 | agi_general_capability | — |
| correlate | S_AGI_SLOW_2031 | AGI slow: Schmidt/Hassabis 5-10 year path | agi_general_capability | — |
| correlate | S_AGI_WINTER_2036PLUS | AGI delayed: capability plateau or AI winter | agi_general_capability | — |
| killer | TK11 | Autonomous Regulatory Block (Level 4 Halt) | — | — |
| killer | TK06 | China-Taiwan Military Conflict | — | — |
Dependents (0)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| No dependents | ||||
Validations (1)
| Observed at | Status | By | Notes |
|---|---|---|---|
| 2026-04-29 | partial | thesis_timeline_v1.0_import | GPT-5 / Claude 4 / Gemini 3 cross-domain benchmarks (math + coding + science) empirically validate transfer. AlphaFold → AlphaProof → AlphaGeometry demonstrate unified-substrate. |
Linked documents (10)
Raw metadata
{
"nia": false,
"mode": "FORECAST",
"role": "Cited-Other",
"context": "Extends SEM_032 (Wissner-Gross Clay Millennium) and 248_002 (LEO-to-phone). Specific cross-domain-capability-substrate framing.",
"to_year": 2029,
"conv_cues": "coined framing; singularity-mathematical-skeleton",
"direction": "HAPPEN",
"from_year": 2026,
"timeframe": "2026-2029",
"conv_level": "HIGH",
"milestones": [
{
"kind": "llm_pre_event",
"label": "Frontier model exceeds 50% on FrontierMath (math benchmark)",
"source": "https://epoch.ai/benchmarks/frontiermath-tier-4 — GPT-5.5 Pro 52.4%, GPT-5.5 51.7% as of April 2026",
"status": "hit",
"weight": 0.4,
"ordinal": -9,
"source_id": null,
"confidence": 0.99,
"source_url": "https://epoch.ai/benchmarks/frontiermath-tier-4",
"expected_date": "2026-04-30",
"observed_date": "2026-04-24",
"research_origin": "deep_research",
"measurement_criterion": "Any frontier LLM crosses 50% accuracy on Epoch AI's FrontierMath benchmark"
},
{
"kind": "quartile_checkpoint",
"label": "Q1 window check-in (25%)",
"status": "pending",
"weight": 0.05,
"ordinal": -8,
"source_id": null,
"expected_date": "2026-09-15",
"observed_date": null
},
{
"kind": "llm_pre_event",
"label": "Same model leads both math + SWE-bench coding leaderboards simultaneously",
"notes": "Direct evidence of 'unified capability substrate' — math leadership transfers to coding.",
"source": "Epoch AI, SWE-bench, Scale Labs leaderboards",
"status": "pending",
"weight": 0.4,
"ordinal": -7,
"source_id": null,
"confidence": 0.8,
"source_url": "https://labs.scale.com/leaderboard",
"expected_date": "2026-11-14",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2027-06-30",
"from": "2026-04-01"
},
"measurement_criterion": "A single model variant simultaneously occupies top-3 on both FrontierMath and SWE-bench Verified leaderboards"
},
{
"kind": "quartile_checkpoint",
"label": "Q2 window check-in (50%)",
"status": "pending",
"weight": 0.05,
"ordinal": -6,
"source_id": null,
"expected_date": "2027-05-30",
"observed_date": null
},
{
"kind": "llm_pre_event",
"label": "AI model produces peer-reviewed scientific discovery in non-CS field",
"notes": "Tests cross-domain transfer to scientific discovery — second pillar of the claim.",
"source": "Nature, Science, peer-reviewed journals tracking AI co-authorship",
"status": "pending",
"weight": 0.4,
"ordinal": -5,
"source_id": null,
"confidence": 0.65,
"expected_date": "2027-09-16",
"research_origin": "training",
"expected_date_range": {
"to": "2028-12-31",
"from": "2026-06-01"
},
"measurement_criterion": "Peer-reviewed paper attributes a novel discovery (in chemistry, biology, physics, or math) primarily to a frontier LLM/agent system"
},
{
"kind": "scenario_signal",
"label": "Scenario fires: AGI fast: drop-in remote worker by 2027-09",
"status": "pending",
"weight": 0.3,
"ordinal": -4,
"source_id": "S_AGI_FAST_2027",
"expected_date": "2027-09-30",
"observed_date": null
},
{
"kind": "quartile_checkpoint",
"label": "Q3 window check-in (75%)",
"status": "pending",
"weight": 0.05,
"ordinal": -3,
"source_id": null,
"expected_date": "2028-02-11",
"observed_date": null
},
{
"kind": "llm_pre_event",
"label": "AI agent integrates with biological/environmental sensor stack in published study",
"notes": "'Manipulate physical reality via cross-domain transfer' element of the claim.",
"source": "arxiv, IEEE proceedings, robotics journals",
"statu
... (truncated)