← Cockpit
CMQ_027predictionAI/Computeinference-shift

The inference inflection has arrived — industry transitioning from training-dominated capex (2023-2025) to inference-dominated economics (2026+).

Predictor: Jensen Huang

Prior probability
92.0%
Current probability
92.0%
evolves via intake + LBP
Conviction
5/5
Signal quality
A
Resolution
hit
Window
2026-01-01 – 2026-12-31
Edges in / out
0 / 0
Tickers exposed
9

Prediction text

The inference inflection has arrived — industry transitioning from training-dominated capex (2023-2025) to inference-dominated economics (2026+). | the inference inflection has arrived | Hyperscaler inference revenue disclosures

Key catalyst: Hyperscaler inference revenue disclosures

Watch events: Hyperscaler inference revenue mix; token-consumption growth rates; NVIDIA inference-optimized product mix.

Verbatim quote

From episode "The Global Architecture of Machine Intelligence: Exhaustive Synthesis of AI Compute, Memory & Quantum Predictions (2023-2026)"
the inference inflection has arrived

Resolution evidence

Status: hit

Agentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training.

Predictor: Jensen Huang

κ + Brier as of 2026-05-22
κ (discount)
0.808
Brier
0.0128
excellent
Hits / Misses
6 / 0
of 8 resolved
Hit rate
75.0%
Calibration plot (stated vs observed)

Evidence about this node from Jensen Huang is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

1 prob_history rows
0%25%50%75%100%prior 92%2026-04-29
intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 92.0%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.
Leading chain: 3 overdue ⏱
  1. 2026-01-30overdueQ1 window check-in (25%)
  2. 2026-03-01overdueQ2 window check-in (50%)
  3. 2026-03-30overdueQ3 window check-in (75%)

No downstream cascades — this prediction is a leaf in the dependency graph.

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.
(live posterior: 92%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first
resolution_terminal2026-04-29T22:23:18Z100.0%+8.0pp
resolution_terminal hit outcome=1.0 pre_resolution=0.920
Raw metadata
{
  "source": "backfill_resolution_history.py",
  "status": "hit",
  "bayesian_v2": false,
  "outcome_prob": 1,
  "evidence_kind": "resolution_terminal",
  "posterior_prob": 1,
  "delta_to_outcome": 0.07999999999999996,
  "inside_posterior": 0.92,
  "validation_notes": "Agentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training.",
  "validation_status": "hit",
  "pre_resolution_prob": 0.92,
  "resolution_evidence": "Agentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training.",
  "does_not_update_current_prob": true
}

Network propagation neighbors

Top edges sorted by latest LBP cross-impact
All propagation →

No propagation data yet. Run inference/.venv/bin/python scripts/ops/run_loopy_belief_propagation.py on the droplet, or wait for the Sunday 02:00 UTC weekly cron.

Ticker exposure

9 ticker(s) linked

Beneficiaries (9)

NVDACRWVAPLDMSFTORCLAMZNSFTBYGOOGLMETA

Prerequisites (0)

Predictions that must hit first
TypePredTitleDomainLag
No prerequisites

Dependents (0)

Predictions enabled by this
TypePredTitleDomainLag
No dependents

Expected milestones (1)

From Sheet 17 Monitoring Triggers
Expected byDescriptionStatus
2026-12-31[Capability 2026-12] / Rubin successor memory capacity specs [CMQ_027] Hyperscaler inference revenue mix; token-consumption growth rates; NVIDIA inference-optimized produc [CMQ_043] Agentic workload latency breakdowns; agent-framework optimization announcements.pending

Validations (1)

Resolution events
Observed atStatusByNotes
2026-04-29hitthesis_timeline_v1.0_importAgentic AI workload explosion 2025-2026 shifts compute mix decisively toward inference; Anthropic/OpenAI capacity-constrained on inference not training.

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT
SimSourceTitleMarket probPolarityReviewedPublished
0.611github_releasefacebookresearch/balance 0.16.0mentionspending2026-02-09
0.604arxivRethinking Post-Training Recipes for Multimodal Time-Series Forecastingmentionspending2026-05-28
0.603gdeltmentionspending2026-04-30
0.603github_releasefacebookresearch/balance 0.18.0mentionspending2026-03-24
0.598github_releasefacebookresearch/balance 0.13.0mentionspending2025-12-02
0.598arxivRose-SQL: Role-State Evolution Guided Structured Reasoning for Multi-Turn Text-to-SQLmentionspending2026-05-05
0.595arxivTS-ICL: A Flexible Time-Indexed Foundation Model for Time Series via In-Context Learningmentionspending2026-06-04
0.586github_releasetensorflow/tensorflow v2.14.0-rc0mentionspending2023-08-17
0.584github_releasetensorflow/tensorflow v2.14.0-rc1mentionspending2023-08-31
0.581arxivTowards Multidisciplinary Summarization of Hospital Stays: Efficient Sentence-Level Clinical Provenance Categorizationmentionspending2026-06-01

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook
{
  "nia": false,
  "qty": "inference-dominant",
  "mode": "FORECAST",
  "role": "Cited-Executive",
  "context": "Huang declared at Morgan Stanley TMT Conference 2026; reframes entire compute demand curve around continuous inference load.",
  "to_year": 2030,
  "verbatim": "the inference inflection has arrived",
  "conv_cues": "has arrived; CEO declaration",
  "direction": "HAPPEN",
  "from_year": 2026,
  "timeframe": "2026+",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "quartile_checkpoint",
      "label": "Q1 window check-in (25%)",
      "status": "overdue",
      "weight": 0.05,
      "ordinal": -3,
      "source_id": null,
      "expected_date": "2026-01-30",
      "observed_date": null,
      "miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
      "miss_emitted_by": "metadata_milestone_sweep"
    },
    {
      "kind": "quartile_checkpoint",
      "label": "Q2 window check-in (50%)",
      "status": "overdue",
      "weight": 0.05,
      "ordinal": -2,
      "source_id": null,
      "expected_date": "2026-03-01",
      "observed_date": null,
      "miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
      "miss_emitted_by": "metadata_milestone_sweep"
    },
    {
      "kind": "quartile_checkpoint",
      "label": "Q3 window check-in (75%)",
      "status": "overdue",
      "weight": 0.05,
      "ordinal": -1,
      "source_id": null,
      "expected_date": "2026-03-30",
      "observed_date": null,
      "miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
      "miss_emitted_by": "metadata_milestone_sweep"
    },
    {
      "kind": "event",
      "label": "The inference inflection has arrived — industry transitioning from training-dominated capex (2023-2025) to inference-dominated economics (20",
      "status": "hit",
      "weight": 1,
      "ordinal": 0,
      "source_id": "CMQ_027",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    }
  ],
  "repeat_eps": 1,
  "sub_domain": "Compute",
  "affiliation": "NVIDIA",
  "attribution": "FIRST_PERSON",
  "granularity": "YEAR",
  "resolved_at": "2026-04-29T22:23:18.098212+00:00",
  "source_refs": "19, 20",
  "target_date": "2026-06-15T00:00:00",
  "display_date": "2026-04-29",
  "episode_date": "2026-04-21T00:00:00",
  "key_catalyst": "Hyperscaler inference revenue disclosures",
  "parse_method": "Report midpoint",
  "domain_bucket": "AI",
  "episode_title": "The Global Architecture of Machine Intelligence: Exhaustive Synthesis of AI Compute, Memory & Quantum Predictions (2023-2026)",
  "fault_line_id": "F002",
  "flag_repeated": false,
  "in_5yr_window": true,
  "source_report": "AI_Chip__Compute__Memory__Quantum_Predictions.md (2026-04-21)",
  "appears_in_eps": "CMQ-RPT",
  "futurist_phase": "Phase 1 (2026)",
  "is_macro_claim": false,
  "total_mentions": 1,
  "priority_weight": 5,
  "ps_cluster_tags": [
    "C5"
  ],
  "report_evidence": "Drives hardware bifurcation (training → HBM-heavy; inference → efficient, low-latency).",
  "active_end_month": "2026-12",
  "recent_statement": "Huang MS TMT 2026 keynote; NVIDIA FY guidance reflects inference TAM expansion.",
  "watch_events_raw": "Hyperscaler inference revenue mix; token-consumption growth rates; NVIDIA inference-optimized product mix.",
  "months_from_today": 2,
  "probability_layer": "Higher (in-flight)",
  "active_start_month": "2026-01",
  "flag_nia_bracketed": false,
  "resolved_at_source": "validations_observed_at",
  "track_record_grade": "A",
  "track_record_notes": "Already playing out in 2026 Q1-Q2 cloud AI inference demand data.",
  "contradicting_notes": "Training compute still growing in absolute terms; 'inflection' is mix-shift not absolute reduction.",
  "flag_near_term_2027": true,
  "flag_high_conviction": true,
  "milestones_derived_at": "2026-05-02T03:08:50.616477+00:00",
  "reference_class_match": {
    "decision": "keyword_filtered",
    "computed_at": "2026-04-30T01:49:13.796883+00:00",
    "best_id_unfiltered": "recession_probability_2yr",