The inference inflection has arrived — industry transitioning from training-dominated capex (2023-2025) to inference-dominated economics (2026+).
Predictor: Jensen Huang
Prediction text
The inference inflection has arrived — industry transitioning from training-dominated capex (2023-2025) to inference-dominated economics (2026+). | the inference inflection has arrived | Hyperscaler inference revenue disclosures
Key catalyst: Hyperscaler inference revenue disclosures
Watch events: Hyperscaler inference revenue mix; token-consumption growth rates; NVIDIA inference-optimized product mix.
Verbatim quote
the inference inflection has arrived
Resolution evidence
Agentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training.
Predictor: Jensen Huang
Calibration plot (stated vs observed)
Evidence about this node from Jensen Huang is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-01-30overdueQ1 window check-in (25%)
- 2026-03-01overdueQ2 window check-in (50%)
- 2026-03-30overdueQ3 window check-in (75%)
No downstream cascades — this prediction is a leaf in the dependency graph.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Raw metadata
{
"source": "backfill_resolution_history.py",
"status": "hit",
"bayesian_v2": false,
"outcome_prob": 1,
"evidence_kind": "resolution_terminal",
"posterior_prob": 1,
"delta_to_outcome": 0.07999999999999996,
"inside_posterior": 0.92,
"validation_notes": "Agentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training.",
"validation_status": "hit",
"pre_resolution_prob": 0.92,
"resolution_evidence": "Agentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training.",
"does_not_update_current_prob": true
}Network propagation neighbors
No propagation data yet. Run inference/.venv/bin/python scripts/ops/run_loopy_belief_propagation.py on the droplet, or wait for the Sunday 02:00 UTC weekly cron.
Ticker exposure
Beneficiaries (9)
Prerequisites (0)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| No prerequisites | ||||
Dependents (0)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| No dependents | ||||
Expected milestones (1)
| Expected by | Description | Status |
|---|---|---|
| 2026-12-31 | [Capability 2026-12] / Rubin successor memory capacity specs [CMQ_027] Hyperscaler inference revenue mix; token-consumption growth rates; NVIDIA inference-optimized produc [CMQ_043] Agentic workload latency breakdowns; agent-framework optimization announcements. | pending |
Validations (1)
| Observed at | Status | By | Notes |
|---|---|---|---|
| 2026-04-29 | hit | thesis_timeline_v1.0_import | Agentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training. |
Linked documents (10)
| Sim | Source | Title | Market prob | Polarity | Reviewed | Published |
|---|---|---|---|---|---|---|
| 0.611 | github_release | facebookresearch/balance 0.16.0 | — | mentions | pending | 2026-02-09 |
| 0.604 | arxiv | Rethinking Post-Training Recipes for Multimodal Time-Series Forecasting | — | mentions | pending | 2026-05-28 |
| 0.603 | gdelt | — | mentions | pending | 2026-04-30 | |
| 0.603 | github_release | facebookresearch/balance 0.18.0 | — | mentions | pending | 2026-03-24 |
| 0.598 | github_release | facebookresearch/balance 0.13.0 | — | mentions | pending | 2025-12-02 |
| 0.598 | arxiv | Rose-SQL: Role-State Evolution Guided Structured Reasoning for Multi-Turn Text-to-SQL | — | mentions | pending | 2026-05-05 |
| 0.595 | arxiv | TS-ICL: A Flexible Time-Indexed Foundation Model for Time Series via In-Context Learning | — | mentions | pending | 2026-06-04 |
| 0.586 | github_release | tensorflow/tensorflow v2.14.0-rc0 | — | mentions | pending | 2023-08-17 |
| 0.584 | github_release | tensorflow/tensorflow v2.14.0-rc1 | — | mentions | pending | 2023-08-31 |
| 0.581 | arxiv | Towards Multidisciplinary Summarization of Hospital Stays: Efficient Sentence-Level Clinical Provenance Categorization | — | mentions | pending | 2026-06-01 |
Raw metadata
{
"nia": false,
"qty": "inference-dominant",
"mode": "FORECAST",
"role": "Cited-Executive",
"context": "Huang declared at Morgan Stanley TMT Conference 2026; reframes entire compute demand curve around continuous inference load.",
"to_year": 2030,
"verbatim": "the inference inflection has arrived",
"conv_cues": "has arrived; CEO declaration",
"direction": "HAPPEN",
"from_year": 2026,
"timeframe": "2026+",
"conv_level": "HIGH",
"milestones": [
{
"kind": "quartile_checkpoint",
"label": "Q1 window check-in (25%)",
"status": "overdue",
"weight": 0.05,
"ordinal": -3,
"source_id": null,
"expected_date": "2026-01-30",
"observed_date": null,
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep"
},
{
"kind": "quartile_checkpoint",
"label": "Q2 window check-in (50%)",
"status": "overdue",
"weight": 0.05,
"ordinal": -2,
"source_id": null,
"expected_date": "2026-03-01",
"observed_date": null,
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep"
},
{
"kind": "quartile_checkpoint",
"label": "Q3 window check-in (75%)",
"status": "overdue",
"weight": 0.05,
"ordinal": -1,
"source_id": null,
"expected_date": "2026-03-30",
"observed_date": null,
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep"
},
{
"kind": "event",
"label": "The inference inflection has arrived — industry transitioning from training-dominated capex (2023-2025) to inference-dominated economics (20",
"status": "hit",
"weight": 1,
"ordinal": 0,
"source_id": "CMQ_027",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
}
],
"repeat_eps": 1,
"sub_domain": "Compute",
"affiliation": "NVIDIA",
"attribution": "FIRST_PERSON",
"granularity": "YEAR",
"resolved_at": "2026-04-29T22:23:18.098212+00:00",
"source_refs": "19, 20",
"target_date": "2026-06-15T00:00:00",
"display_date": "2026-04-29",
"episode_date": "2026-04-21T00:00:00",
"key_catalyst": "Hyperscaler inference revenue disclosures",
"parse_method": "Report midpoint",
"domain_bucket": "AI",
"episode_title": "The Global Architecture of Machine Intelligence: Exhaustive Synthesis of AI Compute, Memory & Quantum Predictions (2023-2026)",
"fault_line_id": "F002",
"flag_repeated": false,
"in_5yr_window": true,
"source_report": "AI_Chip__Compute__Memory__Quantum_Predictions.md (2026-04-21)",
"appears_in_eps": "CMQ-RPT",
"futurist_phase": "Phase 1 (2026)",
"is_macro_claim": false,
"total_mentions": 1,
"priority_weight": 5,
"ps_cluster_tags": [
"C5"
],
"report_evidence": "Drives hardware bifurcation (training → HBM-heavy; inference → efficient, low-latency).",
"active_end_month": "2026-12",
"recent_statement": "Huang MS TMT 2026 keynote; NVIDIA FY guidance reflects inference TAM expansion.",
"watch_events_raw": "Hyperscaler inference revenue mix; token-consumption growth rates; NVIDIA inference-optimized product mix.",
"months_from_today": 2,
"probability_layer": "Higher (in-flight)",
"active_start_month": "2026-01",
"flag_nia_bracketed": false,
"resolved_at_source": "validations_observed_at",
"track_record_grade": "A",
"track_record_notes": "Already playing out in 2026 Q1-Q2 cloud AI inference demand data.",
"contradicting_notes": "Training compute still growing in absolute terms; 'inflection' is mix-shift not absolute reduction.",
"flag_near_term_2027": true,
"flag_high_conviction": true,
"milestones_derived_at": "2026-05-02T03:08:50.616477+00:00",
"reference_class_match": {
"decision": "keyword_filtered",
"computed_at": "2026-04-30T01:49:13.796883+00:00",
"best_id_unfiltered": "recession_probability_2yr",