Sam Altman predicted 40x year-over-year hyperdeflation of AI costs at constant capability.
Predictor: Sam Altman · ep#235 "Amazon's $35B AGI Ultimatum to OpenAI & Anthropic Drops AI Safety | EP #235" · source
Prediction text
Sam Altman predicted 40x year-over-year hyperdeflation of AI costs at constant capability. | This goes handinhand with what we've talked about in the past, Sam Alman's comment about 40x year-over-year hyperdelation of costs at constant capability.
Verbatim quote
This goes handinhand with what we've talked about in the past, Sam Alman's comment about 40x year-over-year hyperdelation of costs at constant capability.
Predictor: Sam Altman
Calibration plot (stated vs observed)
Evidence about this node from Sam Altman is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-04-30hitOpenAI / Anthropic / Google headline frontier-model token prices fall 60-80% YoYHow: Industry pricing trackers (TokenMix, IntuitionLabs, Silicon Data) confirm headline LLM API prices for frontier-class models down 60-80% year-over-year between 2025 and 2026Source: https://tokenmix.ai/blog/ai-api-pricing-war-2026; https://intuitionlabs.ai/articles/ai-api-pricing-comparison-grok-gemini-openai-claudeconf 90%Notes: HIT-direction - Industry pricing data shows ~80% YoY drop, far below Altman's 40x but consistent with capability-deflation directional claim. Headline GPT-5.2 input ~$1.75/MTok vs GPT-4 input ~$30/MTok in 2024.
- 2026-06-01 → 2027-06-30pendingSame-capability constant-quality token cost falls >=10x in a single yearHow: Independent benchmark (e.g., Artificial Analysis, ARC-AGI cost-per-task) shows the cheapest model that matches GPT-4-Turbo (Mar-2024) on key benchmarks costs >=10x less per output token in 2026-2027Source: https://artificialanalysis.ai; https://www.silicondata.com/blog/llm-cost-per-tokenconf 85%Notes: Constant-capability is the right Altman framing. 40x annual is the upper bar; 10x+ is the academic-consensus floor.
- 2026-06-01 → 2027-06-30pendingHyperscaler GPU rental prices (H100/H200/B200) fall >=50% YoYHow: AWS / GCP / Azure / Lambda / RunPod publish on-demand hourly GPU prices >=50% lower than the same SKU one year priorSource: AWS/GCP/Azure pricing pages; Lambda Labs and RunPod historical pricingconf 60%Notes: Cascade - underlying compute deflation is the supply-side mechanism for token-cost deflation.
- 2026-06-01 → 2027-09-30pendingGPT-4-class capability available on a sub-$0.10/MTok input modelHow: A frontier vendor publicly prices a model that scores within 5pp of GPT-4-Turbo on MMLU/BBH at <=$0.10/M input tokensSource: OpenAI / Anthropic / Google / xAI pricing pages; Artificial Analysis trackersconf 70%Notes: GPT-5 nano already at $0.05/MTok input (2026), but capability-match requires further tightening.
- 2027-04-01 → 2027-12-31pendingCumulative 2-year (2025->2027) constant-capability cost reduction reaches 1,600x (40x squared)How: Two-year compounded constant-capability deflation reaches >=1,600x as Altman's 40x/yr would require, measured by Artificial Analysis or independent academic benchmarkSource: Independent benchmark trackers; OpenAI pricing historyconf 25%Notes: Cascade - 40x/yr is aggressive; observed reality has been ~10x/yr. 1,600x in 2 years would fully validate Altman's literal claim.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | SEM_042 2025 will be the definitive year that agentic systems finall — Kevin Weil | 73.8% | 0.600 | 0.050 | -0.059 |
| prereq | SEM_012 Nvidia quadrupled chip production output while only doubling — Jensen Huang | 75.0% | 0.600 | 0.050 | -0.051 |
| prereq | SEM_008 Training runs costing $10 billion for a single model will co — Dario Amodei | 76.9% | 0.600 | 0.050 | -0.042 |
| prereq | 238_009 Recursive self-improvement is already happening now (no long — Alex Wissner-Gross | 78.1% | 0.600 | 0.050 | -0.035 |
| killer | TK03 AI Regulatory Moratorium (EU/US Capability Freeze) | 10.0% | 0.050 | 0.600 | +0.035 |
Top outgoing (children)
Predictions THIS node influences
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | 232_055 We're exiting the industrial age permanently as recursive se — Peter Diamandis | 35.5% | 0.700 | 0.050 | +0.022 |
| prereq | SEM_034 True artificial general intelligence will be achieved betwee — Demis Hassabis | 28.7% | 0.550 | 0.050 | +0.015 |
| prereq | CMQ_002 By 2028, AI systems will reach 'independent researcher' leve — Sam Altman | 31.4% | 0.550 | 0.050 | -0.012 |
| prereq | 235_030 Ray Kurzweil predicts Longevity Escape Velocity (LEV) by 203 — Ray Kurzweil | 39.2% | 0.750 | 0.050 | +0.010 |
| prereq | 241_043 ASI will arrive within 2 years to 5 years to this next decad — Peter Diamandis | 35.9% | 0.650 | 0.050 | -0.007 |
Ticker exposure
Beneficiaries (23)
Adverse (6)
Prerequisites (7)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | 238_009 | Recursive self-improvement is already happening now (no longer three years out) | AI | — |
| prereq | SEM_008 | Training runs costing $10 billion for a single model will commence sometime in 2025. | AI | — |
| prereq | SEM_042 | 2025 will be the definitive year that agentic systems finally hit the mainstream. | AI/Agents | — |
| prereq | SEM_012 | Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering. | AI/Manufacturing | — |
| killer | TK14 | Superbubble Pop (S&P 500 -40%, Moonshot Capital Evaporates) | — | — |
| killer | TK01 | AGI Capability Plateau (2026-27 Training Stall) | — | — |
| killer | TK03 | AI Regulatory Moratorium (EU/US Capability Freeze) | — | — |
Dependents (5)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | 235_030 | Ray Kurzweil predicts Longevity Escape Velocity (LEV) by 2033. | Biotech/Longevity | — |
| prereq | 232_055 | We're exiting the industrial age permanently as recursive self-improvement unfolds. | AI | — |
| prereq | 241_043 | ASI will arrive within 2 years to 5 years to this next decade | AI | — |
| prereq | CMQ_002 | By 2028, AI systems will reach 'independent researcher' level — driving autonomous scientific discoveries without human intervention. | AI | — |
| prereq | SEM_034 | True artificial general intelligence will be achieved between 2032 and 2042 — 'first we solve AI, then use AI to solve everything else'. | AI/AGI | — |
Linked documents (4)
| Sim | Source | Title | Market prob | Polarity | Reviewed | Published |
|---|---|---|---|---|---|---|
| 0.613 | gdelt | — | mentions | pending | 2026-04-30 | |
| 0.599 | gdelt | meta chief zuckerberg doubles down on ai spending | — | mentions | pending | 2026-04-30 |
| 0.594 | manifold | Sam Altman is imprisoned for a crime before 2030 | 11% | mentions | pending | 2026-05-04 |
| 0.581 | arxiv | Running Vacuum in the expanding Universe: a unified QFT paradigm for Inflation and Dark Energy | — | mentions | pending | 2026-06-03 |
Raw metadata
{
"nia": false,
"qty": "40x per year cost reduction",
"url": "https://www.youtube.com/watch?v=T8X6kp-pcKs",
"mode": "CITED_PREDICTION",
"role": "Cited-Executive",
"context": "the capability density of models is increasing. This goes handinhand with what we've talked about in the past, Sam Alman's comment about 40x year-over-year hyperdelation of costs at constant capability.",
"to_year": 2027,
"cited_by": "Alex Wissner-Gross",
"verbatim": "This goes handinhand with what we've talked about in the past, Sam Alman's comment about 40x year-over-year hyperdelation of costs at constant capability.",
"direction": "DOWN",
"from_year": 2026,
"timeframe": "annual, ongoing",
"conv_level": "HIGH",
"milestones": [
{
"kind": "prereq",
"label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
"status": "hit",
"weight": 0.5,
"ordinal": -8,
"source_id": "SEM_012",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Training runs costing $10 billion for a single model will commence sometime in 2025.",
"status": "hit",
"weight": 0.5,
"ordinal": -7,
"source_id": "SEM_008",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "2025 will be the definitive year that agentic systems finally hit the mainstream.",
"status": "hit",
"weight": 0.5,
"ordinal": -6,
"source_id": "SEM_042",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Recursive self-improvement is already happening now (no longer three years out)",
"status": "hit",
"weight": 0.5,
"ordinal": -5,
"source_id": "238_009",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "llm_pre_event",
"label": "OpenAI / Anthropic / Google headline frontier-model token prices fall 60-80% YoY",
"notes": "HIT-direction - Industry pricing data shows ~80% YoY drop, far below Altman's 40x but consistent with capability-deflation directional claim. Headline GPT-5.2 input ~$1.75/MTok vs GPT-4 input ~$30/MTok in 2024.",
"source": "https://tokenmix.ai/blog/ai-api-pricing-war-2026; https://intuitionlabs.ai/articles/ai-api-pricing-comparison-grok-gemini-openai-claude",
"status": "hit",
"weight": 0.4,
"ordinal": -4,
"source_id": null,
"confidence": 0.9,
"source_url": "https://tokenmix.ai/blog/ai-api-pricing-war-2026",
"expected_date": "2026-04-30",
"observed_date": "2026-04-30",
"research_origin": "deep_research",
"measurement_criterion": "Industry pricing trackers (TokenMix, IntuitionLabs, Silicon Data) confirm headline LLM API prices for frontier-class models down 60-80% year-over-year between 2025 and 2026"
},
{
"kind": "llm_pre_event",
"label": "Same-capability constant-quality token cost falls >=10x in a single year",
"notes": "Constant-capability is the right Altman framing. 40x annual is the upper bar; 10x+ is the academic-consensus floor.",
"source": "https://artificialanalysis.ai; https://www.silicondata.com/blog/llm-cost-per-token",
"status": "pending",
"weight": 0.4,
"ordinal": -3,
"source_id": null,
"confidence": 0.85,
"source_url": "https://www.silicondata.com/blog/llm-cost-per-token",
"expected_date": "2026-12-15",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2027-06-30",
"from": "2026-06-01"
},
"measurement_criterion": "Independent benchmark (e.g., Artificial Analysis, ARC-AGI cost-per-task) shows the cheapest model that matches GPT-4-Turbo (Mar-2024) on key benchmarks costs >=10x less per outpu
... (truncated)