Everyone is underestimating the next year of AI improvements (more than 2x gain)
Predictor: Dave Blundin · ep#240 "NVIDIA's $1 Trillion Prediction, Anthropic Beats OpenAI, Tesla vs. TSMC & The CS Job Collapse" · source
Prediction text
Everyone is underestimating the next year of AI improvements (more than 2x gain) | that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about?
Verbatim quote
that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about?
Predictor: Dave Blundin
Calibration plot (stated vs observed)
Evidence about this node from Dave Blundin is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-04-15hitARC-AGI-2 doubles year-over-yearHow: Best ARC-AGI-2 score reaches 77.1% (Gemini 3.1 Pro), double its predecessorSource: LM Council Benchmarks — 'Gemini 3.1 Pro 77.1% on ARC-AGI-2, double predecessor'conf 95%
- 2026-04-23hitGPT-5.5 ships with massive coding/reasoning benchmark gainsHow: OpenAI ships GPT-5.5 scoring 82.7% on Terminal-Bench 2.0 (up from 75.1% GPT-5.4) and 74.0% on 1M-token MRCR v2 (up from 36.6%)Source: Artificial Analysis — 'OpenAI's GPT-5.5 is the new leading AI model'conf 99%Notes: HIT — direct evidence of >2x improvement on long-context reasoning benchmark.
- 2026-04-30hitHumanity's Last Exam top score crosses 50% thresholdHow: Top frontier model achieves >=50% on Humanity's Last Exam vs 38.3% Stanford 2025 baseline (Claude Opus 4.6 / Gemini 3.1 Pro)Source: IEEE Spectrum — 'Stanford's AI Index 2026' + LM Council Benchmarksconf 95%
- 2026-06-01 → 2027-06-30pendingAI investment exceeds $700B annualized run-rateHow: Stanford AI Index or equivalent confirms 2026 AI investment >=$700B (vs $581B in 2025, $253B in 2024) — ~20% YoY growth above already-record baseSource: Build Fast — 'Best AI Models April 2026' + Stanford Index trajectoryconf 70%
- 2027-04-01 → 2027-12-31pendingTop-tier benchmark performance growth >=2x by mid-2027How: Top-of-leaderboard scores on SWE-bench Verified, ARC-AGI-2, and HLE all show >=2x improvement vs April 2026 baselinesSource: Trend extrapolation from 2025-2026 capability curveconf 55%Notes: Validates Blundin's '>2x next year' thesis if borne out across major benchmarks.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| killer | TK09 Energy Grid Cap (Data Center Power Wall) | 35.0% | 0.050 | 0.600 | -0.088 |
| prereq | SEM_015 Nvidia agreed to remit 15% of China chip-sale revenue direct — Jensen Huang | 66.3% | 0.600 | 0.050 | -0.076 |
| prereq | SEM_027 Nvidia Data Center revenue +66% YoY, contributing ~90% of $5 — Joseph Moore | 68.3% | 0.600 | 0.050 | -0.075 |
| killer | TK05 Rate Regime Persistence (10y > 5% through 2028) | 30.0% | 0.050 | 0.600 | -0.060 |
| killer | TK03 AI Regulatory Moratorium (EU/US Capability Freeze) | 10.0% | 0.050 | 0.600 | +0.050 |
Top outgoing (children)
Predictions THIS node influences
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | 248_033 Superhuman AI will make BCI-enhanced humans irrelevant compa — Dave Blundin | 36.7% | 0.600 | 0.050 | -0.049 |
| prereq | 242_031 Most large companies' business models will be disrupted in 2 — Peter Diamandis | 36.1% | 0.650 | 0.050 | -0.018 |
| prereq | 232_055 We're exiting the industrial age permanently as recursive se — Peter Diamandis | 35.5% | 0.700 | 0.050 | +0.013 |
| prereq | 244_019 Peter's son won't need a driver's license in 2 years — Peter Diamandis | 48.4% | 0.920 | 0.050 | -0.010 |
| prereq | 230_020 Peter's 14-year-old son Milan will never get a driver's lice — Peter Diamandis | 34.7% | 0.650 | 0.050 | -0.004 |
Ticker exposure
Beneficiaries (24)
Adverse (6)
Prerequisites (10)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | SEM_011 | Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips. | Capital Markets | — |
| prereq | SEM_027 | Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon. | Capital Markets | — |
| prereq | SEM_014 | Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025). | Manufacturing | — |
| prereq | SEM_012 | Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering. | AI/Manufacturing | — |
| prereq | SEM_015 | Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans. | Policy/Semis | — |
| killer | TK09 | Energy Grid Cap (Data Center Power Wall) | — | — |
| killer | TK05 | Rate Regime Persistence (10y > 5% through 2028) | — | — |
| killer | TK01 | AGI Capability Plateau (2026-27 Training Stall) | — | — |
| killer | TK02 | AI Compute Supply Shock (TSMC/Taiwan Disruption) | — | — |
| killer | TK03 | AI Regulatory Moratorium (EU/US Capability Freeze) | — | — |
Dependents (5)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | 244_019 | Peter's son won't need a driver's license in 2 years | Auto/Transport | — |
| prereq | 232_055 | We're exiting the industrial age permanently as recursive self-improvement unfolds. | AI | — |
| prereq | 242_031 | Most large companies' business models will be disrupted in 2-5 years | Markets/Stocks | — |
| prereq | 230_020 | Peter's 14-year-old son Milan will never get a driver's license. | Auto/Transport | — |
| prereq | 248_033 | Superhuman AI will make BCI-enhanced humans irrelevant compared to AI 2 years from today. | AI | — |
Linked documents (6)
| Sim | Source | Title | Market prob | Polarity | Reviewed | Published |
|---|---|---|---|---|---|---|
| 0.620 | manifold | Will "Maybe I was too harsh on deep learning theory..." make the top fifty posts in LessWrong's 2026 Annual Review? | 11% | mentions | pending | 2026-05-11 |
| 0.618 | github_release | facebookresearch/neuroai v0.2.1 | — | mentions | pending | 2026-05-14 |
| 0.611 | arxiv | Negation Neglect: When models fail to learn negations in training | — | mentions | pending | 2026-05-13 |
| 0.605 | github_release | pytorch/pytorch v2.2.0 | — | mentions | pending | 2024-01-30 |
| 0.579 | manifold | Will "Forecasting is Way Overrated, and We Should S..." make the top fifty posts in LessWrong's 2026 Annual Review? | 20% | mentions | pending | 2026-04-26 |
| 0.566 | github_release | pytorch/pytorch v1.13.1 | — | mentions | pending | 2022-12-16 |
Raw metadata
{
"nia": false,
"qty": ">>2x",
"url": "https://www.youtube.com/watch?v=uOGHXAfvK8w",
"mode": "PREDICTION",
"role": "Host",
"context": "that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about? Yeah, it's not it's not going to happen that way.",
"to_year": 2027,
"verbatim": "that's also why everyone's really underestimating the next year because we did a 1000x like what are you expecting in the next year? Oh 2x like what are you talking about?",
"conv_cues": "not going to happen that way",
"direction": "DOWN",
"from_year": 2026,
"timeframe": "Next year (2026-2027)",
"conv_level": "HIGH",
"milestones": [
{
"kind": "llm_pre_event",
"label": "ARC-AGI-2 doubles year-over-year",
"source": "LM Council Benchmarks — 'Gemini 3.1 Pro 77.1% on ARC-AGI-2, double predecessor'",
"status": "hit",
"weight": 0.4,
"ordinal": -9,
"source_id": null,
"confidence": 0.95,
"source_url": "https://lmcouncil.ai/benchmarks",
"expected_date": "2026-04-15",
"observed_date": "2026-04-15",
"research_origin": "deep_research",
"measurement_criterion": "Best ARC-AGI-2 score reaches 77.1% (Gemini 3.1 Pro), double its predecessor"
},
{
"kind": "llm_pre_event",
"label": "GPT-5.5 ships with massive coding/reasoning benchmark gains",
"notes": "HIT — direct evidence of >2x improvement on long-context reasoning benchmark.",
"source": "Artificial Analysis — 'OpenAI's GPT-5.5 is the new leading AI model'",
"status": "hit",
"weight": 0.4,
"ordinal": -8,
"source_id": null,
"confidence": 0.99,
"source_url": "https://artificialanalysis.ai/articles/openai-gpt5-5-is-the-new-leading-AI-model",
"expected_date": "2026-04-23",
"observed_date": "2026-04-23",
"research_origin": "deep_research",
"measurement_criterion": "OpenAI ships GPT-5.5 scoring 82.7% on Terminal-Bench 2.0 (up from 75.1% GPT-5.4) and 74.0% on 1M-token MRCR v2 (up from 36.6%)"
},
{
"kind": "prereq",
"label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
"status": "hit",
"weight": 0.5,
"ordinal": -7,
"source_id": "SEM_011",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.",
"status": "hit",
"weight": 0.5,
"ordinal": -6,
"source_id": "SEM_027",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).",
"status": "hit",
"weight": 0.5,
"ordinal": -5,
"source_id": "SEM_014",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
"status": "hit",
"weight": 0.5,
"ordinal": -4,
"source_id": "SEM_012",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "llm_pre_event",
"label": "Humanity's Last Exam top score crosses 50% threshold",
"source": "IEEE Spectrum — 'Stanford's AI Index 2026' + LM Council Benchmarks",
"status": "hit",
"weight": 0.4,
"ordinal": -3,
"source_id": null,
"confidence": 0.95,
"source_url": "https://spectrum.ieee.org/state-of-ai-index-2026",
"expected_date": "2026-04-30",
"observe
... (truncated)