← Cockpit
248_010predictionAIAI-scaling

AI capabilities will advance 100x in 2026 (by summer, potentially more).

Predictor: Dave Blundin · ep#248 "Sam Altman's Attack, Amazon vs. Starlink, and What Opus 4.7 Actually Means | #248" · source

Prior probability
35.0%
Current probability
31.0%
evolves via intake + LBP
Conviction
5/5
Signal quality
A
Resolution
pending
Window
2026-07-01 – 2026-07-31
Edges in / out
14 / 5
Tickers exposed
37

Prediction text

AI capabilities will advance 100x in 2026 (by summer, potentially more). | it locks in my prediction for the year was definitely going to be right. You know, I said this is going to be a 100x year. You know, we've been we've been doing 10x years for the last seven or eight years. is going to be 100x here. It's going to be 100x by summer. I'm going to I'm going to blow away that prediction. | Summer 2026 benchmark results

Key catalyst: Summer 2026 benchmark results

Verbatim quote

From episode "Sam Altman's Attack, Amazon vs. Starlink, and What Opus 4.7 Actually Means | #248"
it locks in my prediction for the year was definitely going to be right. You know, I said this is going to be a 100x year. You know, we've been we've been doing 10x years for the last seven or eight years. is going to be 100x here. It's going to be 100x by summer. I'm going to I'm going to blow away that prediction.

Resolution evidence

Status: pending

'100x by summer' (July 2026) is very aggressive from baseline April 2026.

Predictor: Dave Blundin

κ + Brier as of 2026-05-22
κ (discount)
0.821
Brier
0.0491
excellent
Hits / Misses
3 / 2
of 9 resolved
Hit rate
33.3%
Calibration plot (stated vs observed)

Evidence about this node from Dave Blundin is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

2 prob_history rows
0%25%50%75%100%prior 35%2026-04-302026-04-30
intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 31.0%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.
Leading chain: 6 fired ✓ · 3 pending
  1. 2026-04-15hitFrontier-model release sprint: 4 major launches in 6 weeks (Mar-Apr 2026)
    How: Within March-April 2026, GPT-5.4, Claude Mythos 5, Gemini 3.1, and Grok 4.20 all launch publicly
    Source: https://lmcouncil.ai/benchmarksconf 95%
  2. 2026-04-25hitGPT-5.5 launches with major reasoning/agent capability leap (Apr 2026)
    How: OpenAI publicly launches GPT-5.5 with disclosed major capability gains over GPT-5.4 on agent/reasoning benchmarks
    Source: https://openai.com/index/introducing-gpt-5-5/conf 95%
    Notes: HIT — on MRCR v2 at 512K-1M tokens, GPT-5.5 jumps to 74.0% from GPT-5.4's 36.6% (37-pt absolute gain).
  3. 2026-07-01 → 2026-07-31pendingSummer 2026 benchmark composite shows >=10x advance over Jan 2026 baseline
    How: Composite reasoning benchmark (e.g., METR-style time-horizon doubling, FrontierMath, GPQA) shows top model >=10x improvement vs Jan 2026 baseline (e.g., 4x time-horizon, 2x score = 8x effective)
    Source: Composite of Vals AI / LM Council / Epoch AI benchmark trackersconf 40%
    Notes: 10x is achievable in narrow benchmarks; 100x is the prediction's threshold.
  4. 2026-07-01 → 2026-07-31pendingFalsification: Summer 2026 benchmarks show <3x improvement (would be NEAR-MISS)
    How: Composite frontier benchmarks improve <3x over Jan 2026 baseline by July 2026
    Source: Benchmark trackersconf 40%
    Notes: Cascade — would falsify Blundin's 100x claim. Most likely outcome given measured 30-50pt benchmark gains.
  5. 2026-07-01 → 2026-08-31pending100x year claim publicly validated by >=1 independent analyst
    How: Independent analyst (e.g., Epoch AI, METR, Stanford HAI) publishes report concluding 2026 H1 advance is 50-100x prior years' rate
    Source: Industry reports + benchmark trackersconf 20%
    Notes: Hardest milestone — 100x is extreme claim and depends on metric chosen.

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.
(live posterior: 31%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first
LBP2026-04-30T16:39:51Z31.0%-1.6pp
Network propagation: 32.6% → 31.0%
5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3
LBP2026-04-30T02:18:57Z32.6%-2.4pp
Network propagation: 35.0% → 32.6%
5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact
All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

KindNodeTheir probP(c|s=T)P(c|s=F)Δ implied
killerTK09
Energy Grid Cap (Data Center Power Wall)
35.0%0.0500.350-0.065
prereqSEM_015
Nvidia agreed to remit 15% of China chip-sale revenue directJensen Huang
66.3%0.3500.050-0.059
prereqSEM_027
Nvidia Data Center revenue +66% YoY, contributing ~90% of $5Joseph Moore
68.3%0.3500.050-0.058
killerTK05
Rate Regime Persistence (10y > 5% through 2028)
30.0%0.0500.350-0.050
prereqSEM_012
Nvidia quadrupled chip production output while only doublingJensen Huang
75.0%0.3500.050-0.037

Top outgoing (children)

Predictions THIS node influences

KindNodeTheir probP(c|s=T)P(c|s=F)Δ implied
prereq244_019
Peter's son won't need a driver's license in 2 yearsPeter Diamandis
48.4%0.9200.050-0.169
prereq247_023
AI will be able to do everything a white collar worker does Dave Blundin
40.8%0.7200.050-0.154
prereq242_031
Most large companies' business models will be disrupted in 2Peter Diamandis
36.1%0.6500.050-0.128
prereq230_020
Peter's 14-year-old son Milan will never get a driver's licePeter Diamandis
34.7%0.6500.050-0.114
prereq232_055
We're exiting the industrial age permanently as recursive sePeter Diamandis
35.5%0.7000.050-0.107

Ticker exposure

37 ticker(s) linked

Beneficiaries (24)

MUWULFIRENEQIXALABAPLDASMIYASMLPLABNVDANBISCRWVAAPLAMTAMZNDELLGOOGLIRMLNVGYMETAMSFTORCLSFTBYSTX

Adverse (6)

ACNGENCHGGIBMWNSLRN

Prerequisites (14)

Predictions that must hit first
TypePredTitleDomainLag
prereqSEM_027Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.Capital Markets
prereqSEM_011Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.Capital Markets
prereqSEM_014Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).Manufacturing
prereqSEM_012Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering.AI/Manufacturing
prereqSEM_015Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans.Policy/Semis
correlateS_AGI_MID_2029AGI mid: Kurzweil 2029 pathagi_general_capability
correlateS_ASI_MID_2034ASI mid: Schmidt 'ASI in 6 years'asi_recursive_self_improvement
correlateS_AGI_WINTER_2036PLUSAGI delayed: capability plateau or AI winteragi_general_capability
correlateS_AI_PAUSE_2026Major-country AI pause beginning 2026ai_regulatory_pause
killerTK09Energy Grid Cap (Data Center Power Wall)
killerTK05Rate Regime Persistence (10y > 5% through 2028)
killerTK01AGI Capability Plateau (2026-27 Training Stall)
killerTK02AI Compute Supply Shock (TSMC/Taiwan Disruption)
killerTK03AI Regulatory Moratorium (EU/US Capability Freeze)

Dependents (5)

Predictions enabled by this
TypePredTitleDomainLag
prereq244_019Peter's son won't need a driver's license in 2 yearsAuto/Transport
prereq247_023AI will be able to do everything a white collar worker does imminentlyAI
prereq232_055We're exiting the industrial age permanently as recursive self-improvement unfolds.AI
prereq242_031Most large companies' business models will be disrupted in 2-5 yearsMarkets/Stocks
prereq230_020Peter's 14-year-old son Milan will never get a driver's license.Auto/Transport

Expected milestones (1)

From Sheet 17 Monitoring Triggers
Expected byDescriptionStatus
2026-07-31[Capability 2026-07] [248_010] Summer 2026 benchmark resultspending

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT
SimSourceTitleMarket probPolarityReviewedPublished
0.743manifold☀️What will happen in June 2026? [ADD RESPONSES]mentionspending2026-05-01
0.740manifold🌭What will happen in July 2026? [ADD RESPONSES]mentionspending2026-06-01
0.714manifoldJune 2026 AI model releasesmentionspending2026-05-28
0.705manifoldat the end of 2026, will an AI be able to generate a full high-quality tv ep to a prompt?35%mentionspending2026-05-31
0.694manifoldWill any China-domestic AI chip reach ≥80% of NVIDIA H100 perf on a public benchmark before 2026-12-31?14%mentionspending2026-05-04
0.692manifoldWill 1USD>97INR before 10th June 2026?6%mentionspending2026-06-04
0.690manifoldWho will have the highest sharpe ratio by June 20th 8 PM CEST [ADD ANSWERS] (2026)mentionspending2026-06-01
0.684polymarketWill Microsoft have the best AI model at the end of May 2026?0%mentionspending2026-04-13
0.682polymarketWill Meta have the best AI model at the end of May 2026?0%mentionspending2026-04-13
0.682manifoldWhat will be the best score (almost resolved) on ProgramBench at the end of 2026?mentionspending2026-05-06

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook
{
  "nia": false,
  "qty": "100x AI capability gain in one year",
  "url": "https://www.youtube.com/watch?v=LVvleNtllPk",
  "mode": "PREDICTION",
  "role": "Host",
  "context": "I said this is going to be a 100x year... It's going to be 100x by summer. I'm going to blow away that prediction.",
  "to_year": 2026,
  "verbatim": "it locks in my prediction for the year was definitely going to be right. You know, I said this is going to be a 100x year. You know, we've been we've been doing 10x years for the last seven or eight years. is going to be 100x here. It's going to be 100x by summer. I'm going to I'm going to blow away that prediction.",
  "conv_cues": "definitely going to be right; blow away that prediction",
  "direction": "UP",
  "from_year": 2026,
  "timeframe": "Summer 2026",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "llm_pre_event",
      "label": "Frontier-model release sprint: 4 major launches in 6 weeks (Mar-Apr 2026)",
      "source": "https://lmcouncil.ai/benchmarks",
      "status": "hit",
      "weight": 0.4,
      "ordinal": -9,
      "source_id": null,
      "confidence": 0.95,
      "source_url": "https://lmcouncil.ai/benchmarks",
      "expected_date": "2026-04-15",
      "observed_date": "2026-04-15",
      "research_origin": "deep_research",
      "measurement_criterion": "Within March-April 2026, GPT-5.4, Claude Mythos 5, Gemini 3.1, and Grok 4.20 all launch publicly"
    },
    {
      "kind": "llm_pre_event",
      "label": "GPT-5.5 launches with major reasoning/agent capability leap (Apr 2026)",
      "notes": "HIT — on MRCR v2 at 512K-1M tokens, GPT-5.5 jumps to 74.0% from GPT-5.4's 36.6% (37-pt absolute gain).",
      "source": "https://openai.com/index/introducing-gpt-5-5/",
      "status": "hit",
      "weight": 0.4,
      "ordinal": -8,
      "source_id": null,
      "confidence": 0.95,
      "source_url": "https://openai.com/index/introducing-gpt-5-5/",
      "expected_date": "2026-04-25",
      "observed_date": "2026-04-25",
      "research_origin": "deep_research",
      "measurement_criterion": "OpenAI publicly launches GPT-5.5 with disclosed major capability gains over GPT-5.4 on agent/reasoning benchmarks"
    },
    {
      "kind": "prereq",
      "label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -7,
      "source_id": "SEM_011",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -6,
      "source_id": "SEM_027",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -5,
      "source_id": "SEM_014",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -4,
      "source_id": "SEM_012",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans.",
      "status": "pending",
      "weight": 0.5,
      "ordinal": -3,
      "source_id": "SEM_015",
      "expected_date": "2026-06-25",
      "observed_date": null
    },
    {
      "k
... (truncated)