GPT-5.5 Spud will have equal or greater capability to Mythos.
Predictor: Peter Diamandis · ep#246 "SpaceX Goes Public, Claude's Mythos Release, and the US Data Center Delay | EP #246" · source
Prediction text
GPT-5.5 Spud will have equal or greater capability to Mythos. | it's also been cited that spud will be of equal capability to mythos or more.
Verbatim quote
it's also been cited that spud will be of equal capability to mythos or more.
Predictor: Peter Diamandis
Calibration plot (stated vs observed)
Evidence about this node from Peter Diamandis is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-04-23hitGPT-5.5 ('Spud') releases on or before April 23, 2026How: OpenAI ships GPT-5.5 to consumers and API on or before April 23, 2026; this is the model Diamandis nicknamed 'Spud'Source: OpenAI — Introducing GPT-5.5 (April 23, 2026)conf 99%Notes: HIT — GPT-5.5 ('Spud' codename) shipped on schedule.
- 2026-04-23hitGPT-5.5 leads Claude Opus 4.7 ('Mythos') on Terminal-Bench 2.0How: GPT-5.5 achieves Terminal-Bench 2.0 score >= Claude Opus 4.7 in published benchmark comparisonSource: Build Fast With AI — GPT-5.5 vs Claude (82.7% vs 69.4% on Terminal-Bench)conf 95%Notes: HIT — GPT-5.5 leads Terminal-Bench by 13+ points and FrontierMath by 8 points. Confirms 'equal or greater' in coding/math.
- 2026-04-23hitOpenAI publishes GPT-5.5 system card with capability/safety scoresHow: OpenAI Deployment Safety Hub publishes GPT-5.5 system card with autonomous capability evaluations, dangerous capability tests, and safety mitigationsSource: OpenAI Deployment Safety Hub — GPT-5.5 system cardconf 99%Notes: HIT.
- 2026-04-25partialMixed evaluation: Tom's Guide / blind testing shows Claude Opus 4.7 still wins broader categoriesHow: Independent blind comparison (e.g., Tom's Guide multi-category test) shows Claude Opus 4.7 winning on writing/reasoning/multimodal vs GPT-5.5Source: Build Fast With AI — Tom's Guide tested GPT-5.5 vs Claude Opus 4.7 across 7 categories; GPT-5.5 lost in all 7conf 95%Notes: PARTIAL — depending on benchmark, GPT-5.5 either leads (Terminal-Bench/FrontierMath) or loses (Tom's Guide qualitative). Diamandis claim 'equal capability or more' is partially supported.
- 2026-06-01 → 2026-11-30pendingAnthropic ships Claude Opus 4.8 / 5.0 reclaiming benchmark leadHow: Anthropic releases successor to Claude Opus 4.7 reclaiming Terminal-Bench / FrontierMath / OSWorld leadership over GPT-5.5Source: Anticipated — Anthropic's typical 6-month release cadenceconf 75%Notes: Cascade — direct competitor response to Spud closing the gap to Mythos.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| killer | TK09 Energy Grid Cap (Data Center Power Wall) | 35.0% | 0.050 | 0.600 | -0.088 |
| prereq | SEM_015 Nvidia agreed to remit 15% of China chip-sale revenue direct — Jensen Huang | 66.3% | 0.600 | 0.050 | -0.076 |
| prereq | SEM_027 Nvidia Data Center revenue +66% YoY, contributing ~90% of $5 — Joseph Moore | 68.3% | 0.600 | 0.050 | -0.075 |
| killer | TK05 Rate Regime Persistence (10y > 5% through 2028) | 30.0% | 0.050 | 0.600 | -0.060 |
| killer | TK03 AI Regulatory Moratorium (EU/US Capability Freeze) | 10.0% | 0.050 | 0.600 | +0.050 |
Top outgoing (children)
Predictions THIS node influences
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | 247_023 AI will be able to do everything a white collar worker does — Dave Blundin | 40.8% | 0.720 | 0.050 | -0.031 |
| prereq | 242_031 Most large companies' business models will be disrupted in 2 — Peter Diamandis | 36.1% | 0.650 | 0.050 | -0.018 |
| prereq | 232_055 We're exiting the industrial age permanently as recursive se — Peter Diamandis | 35.5% | 0.700 | 0.050 | +0.013 |
| prereq | 244_019 Peter's son won't need a driver's license in 2 years — Peter Diamandis | 48.4% | 0.920 | 0.050 | -0.010 |
| prereq | 230_020 Peter's 14-year-old son Milan will never get a driver's lice — Peter Diamandis | 34.7% | 0.650 | 0.050 | -0.004 |
Ticker exposure
Beneficiaries (24)
Adverse (6)
Prerequisites (10)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | SEM_011 | Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips. | Capital Markets | — |
| prereq | SEM_027 | Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon. | Capital Markets | — |
| prereq | SEM_014 | Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025). | Manufacturing | — |
| prereq | SEM_012 | Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering. | AI/Manufacturing | — |
| prereq | SEM_015 | Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans. | Policy/Semis | — |
| killer | TK09 | Energy Grid Cap (Data Center Power Wall) | — | — |
| killer | TK05 | Rate Regime Persistence (10y > 5% through 2028) | — | — |
| killer | TK01 | AGI Capability Plateau (2026-27 Training Stall) | — | — |
| killer | TK02 | AI Compute Supply Shock (TSMC/Taiwan Disruption) | — | — |
| killer | TK03 | AI Regulatory Moratorium (EU/US Capability Freeze) | — | — |
Dependents (5)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | 244_019 | Peter's son won't need a driver's license in 2 years | Auto/Transport | — |
| prereq | 247_023 | AI will be able to do everything a white collar worker does imminently | AI | — |
| prereq | 232_055 | We're exiting the industrial age permanently as recursive self-improvement unfolds. | AI | — |
| prereq | 242_031 | Most large companies' business models will be disrupted in 2-5 years | Markets/Stocks | — |
| prereq | 230_020 | Peter's 14-year-old son Milan will never get a driver's license. | Auto/Transport | — |
Raw metadata
{
"nia": false,
"qty": "Equal or greater capability",
"url": "https://www.youtube.com/watch?v=cFI-SqnvQK8",
"mode": "CITED_PREDICTION",
"role": "Host",
"caveats": "Cited",
"context": "it's also been cited that spud will be of equal capability to mythos or more.",
"to_year": 2026,
"verbatim": "it's also been cited that spud will be of equal capability to mythos or more.",
"conv_cues": "cited",
"direction": "HAPPEN",
"from_year": 2026,
"timeframe": "Near-term 2026",
"conv_level": "MEDIUM",
"milestones": [
{
"kind": "llm_pre_event",
"label": "GPT-5.5 ('Spud') releases on or before April 23, 2026",
"notes": "HIT — GPT-5.5 ('Spud' codename) shipped on schedule.",
"source": "OpenAI — Introducing GPT-5.5 (April 23, 2026)",
"status": "hit",
"weight": 0.4,
"ordinal": -9,
"source_id": null,
"confidence": 0.99,
"source_url": "https://openai.com/index/introducing-gpt-5-5/",
"expected_date": "2026-04-23",
"observed_date": "2026-04-23",
"research_origin": "deep_research",
"measurement_criterion": "OpenAI ships GPT-5.5 to consumers and API on or before April 23, 2026; this is the model Diamandis nicknamed 'Spud'"
},
{
"kind": "llm_pre_event",
"label": "GPT-5.5 leads Claude Opus 4.7 ('Mythos') on Terminal-Bench 2.0",
"notes": "HIT — GPT-5.5 leads Terminal-Bench by 13+ points and FrontierMath by 8 points. Confirms 'equal or greater' in coding/math.",
"source": "Build Fast With AI — GPT-5.5 vs Claude (82.7% vs 69.4% on Terminal-Bench)",
"status": "hit",
"weight": 0.4,
"ordinal": -8,
"source_id": null,
"confidence": 0.95,
"source_url": "https://www.buildfastwithai.com/blogs/gpt-5-5-review-2026",
"expected_date": "2026-04-23",
"observed_date": "2026-04-23",
"research_origin": "deep_research",
"measurement_criterion": "GPT-5.5 achieves Terminal-Bench 2.0 score >= Claude Opus 4.7 in published benchmark comparison"
},
{
"kind": "llm_pre_event",
"label": "OpenAI publishes GPT-5.5 system card with capability/safety scores",
"notes": "HIT.",
"source": "OpenAI Deployment Safety Hub — GPT-5.5 system card",
"status": "hit",
"weight": 0.4,
"ordinal": -7,
"source_id": null,
"confidence": 0.99,
"source_url": "https://deploymentsafety.openai.com/gpt-5-5",
"expected_date": "2026-04-23",
"observed_date": "2026-04-23",
"research_origin": "deep_research",
"measurement_criterion": "OpenAI Deployment Safety Hub publishes GPT-5.5 system card with autonomous capability evaluations, dangerous capability tests, and safety mitigations"
},
{
"kind": "llm_pre_event",
"label": "Mixed evaluation: Tom's Guide / blind testing shows Claude Opus 4.7 still wins broader categories",
"notes": "PARTIAL — depending on benchmark, GPT-5.5 either leads (Terminal-Bench/FrontierMath) or loses (Tom's Guide qualitative). Diamandis claim 'equal capability or more' is partially supported.",
"source": "Build Fast With AI — Tom's Guide tested GPT-5.5 vs Claude Opus 4.7 across 7 categories; GPT-5.5 lost in all 7",
"status": "partial",
"weight": 0.4,
"ordinal": -6,
"source_id": null,
"confidence": 0.95,
"source_url": "https://www.buildfastwithai.com/blogs/gpt-5-5-review-2026",
"expected_date": "2026-04-25",
"observed_date": "2026-04-25",
"research_origin": "deep_research",
"measurement_criterion": "Independent blind comparison (e.g., Tom's Guide multi-category test) shows Claude Opus 4.7 winning on writing/reasoning/multimodal vs GPT-5.5"
},
{
"kind": "prereq",
"label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
"status": "hit",
"weight": 0.5,
"ordinal": -5,
"source_id": "SEM_011",
"expec
... (truncated)