INF_026predictionAIsoftware-3.0-utility

'Software 3.0' LLM infrastructure will operate like public utilities — requiring massive upfront capex (training compute, specialized hardware), specialized networking protocols for synchrony across hundreds of thousands of GPUs, and flawless uninterru...

Predictor: Andrej Karpathy

Prior probability

80.0%

Current probability

69.9%

evolves via intake + LBP

Conviction

4/5

Signal quality

Resolution

in_progress

Window

2026-01-01 – 2028-10-31

Edges in / out

3 / 0

Tickers exposed

Prediction text

'Software 3.0' LLM infrastructure will operate like public utilities — requiring massive upfront capex (training compute, specialized hardware), specialized networking protocols for synchrony across hundreds of thousands of GPUs, and flawless uninterrupted uptime; foundational software infrastructure will pivot from text-versioned Git to binary-weight / real-time inference platforms. | Next-gen kernel/hardware co-design releases

Key catalyst: Next-gen kernel/hardware co-design releases

Watch events: Claude / ChatGPT uptime SLAs; weights-format standardization; kernel-engineering open-source growth

Resolution evidence

Status: in_progress

ChatGPT/Claude/Gemini 99.9% uptime demands; Modal / Replicate / Anyscale / Anthropic Bedrock proliferation. Weights-as-binary tooling (LoRA, SafeTensors, ggml) mainstream.

Predictor: Andrej Karpathy

κ + Brier as of 2026-05-22

Full calibration →

κ (discount)

0.688

Brier

0.0067

excellent

Hits / Misses

3 / 0

of 3 resolved

Hit rate

100.0%

Calibration plot (stated vs observed)

Evidence about this node from Andrej Karpathy is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

8 prob_history rows

intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 69.9%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.

Leading chain: 8 pending

2026-06-20pendingQ1 window check-in (25%)
2026-07-01 → 2026-12-31pendingNVIDIA + Google Cloud announce Vera Rubin NVL72 rack-scale availability for H2 2026 cloud deployment
How: Google Cloud publicly opens Vera Rubin NVL72 rack-scale compute to customers in 2H 2026
Source: https://cloud.google.com/blog/products/compute/google-cloud-ai-infrastructure-at-nvidia-gtc-2026conf 85%
2026-12-07pendingQ2 window check-in (50%)
2026-06-01 → 2027-06-30pendingAMD MI400 series (CDNA Next) ships at 40 PFLOPS FP4 / 432 GB HBM4 / 19.6 TB/s
How: AMD publicly ships MI400 with stated peak specs (40 PFLOPS FP4, 432GB HBM4, 19.6 TB/s)
Source: https://www.tomshardware.com/tech-industry/semiconductors/nvidia-enterprise-roadmap-rubin-rubin-ultra-feynman-and-silicon-photonicsconf 65%
2026-06-01 → 2027-12-31pendingFullFlat optical NVLink network topology demonstrated for inter-node uniform bandwidth
How: NVIDIA / hyperscaler publishes data-center deployment using FullFlat or optical NVLink topology for LLM training/inference
Source: https://arxiv.org/html/2506.15006v2 — Scaling Intelligence, FullFlat optical topologyconf 60%
2026-06-01 → 2027-12-31pendingHardware-software co-design tooling (Nemotron-Flash / Liquid AI) hits production for inference
How: NVIDIA Nemotron-Flash or Liquid AI co-design framework adopted by ≥1 large model provider in production
Source: https://developer.nvidia.com/blog/how-nvidia-extreme-hardware-software-co-design-delivered-a-large-inference-boost-for-sarvam-ais-sovereign-models/conf 65%
2027-05-26pendingQ3 window check-in (75%)
2026-09-01 → 2028-10-31pendingHyperscaler announces dedicated 'AI utility' offering with SLA-backed inference uptime ≥99.99%
How: AWS / Azure / GCP launches inference platform with public utility-grade SLA on uptime + token throughput
Source: https://intuitionlabs.ai/articles/llm-inference-hardware-enterprise-guideconf 50%
2027-11-12pending'Software 3.0' LLM infrastructure will operate like public utilities — requiring massive upfront capex (training compute, specialized hardwa

No downstream cascades — this prediction is a leaf in the dependency graph.

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.

(live posterior: 70%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first

LBP2026-05-24T02:00:02Z69.9%-2.6pp

Network propagation: 72.5% → 69.9%

4-iter LBP, residual 0.01000 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 806b02f8

LBP2026-05-17T02:00:01Z72.5%-4.8pp

Network propagation: 77.2% → 72.5%

5-iter LBP, residual 0.00689 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e607fa96

LBP2026-05-10T02:00:02Z77.2%-7.7pp

Network propagation: 84.9% → 77.2%

6-iter LBP, residual 0.00584 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e5c18d29

auto_consensus2026-05-05T22:11:01Z84.9%+5.5pp

auto_consensus:dc47127b-c217-49d2-97c6-ce58983ee599 polarity=corroborates sources=4

Raw metadata

{
  "actor": "system:auto_corroborate",
  "method": "lbp",
  "cred_avg": 0.79,
  "polarity": "corroborates",
  "doc_count": 5,
  "applied_llr": 0.3765,
  "evidence_kind": "auto_consensus",
  "evidence_origin": "auto_corroborate",
  "predictor_kappa": 0.6875,
  "n_distinct_sources": 4
}

LBP2026-05-03T02:00:01Z79.4%-8.5pp

Network propagation: 87.9% → 79.4%

6-iter LBP, residual 0.00677 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 1a683ac9

legacy v12026-04-30T19:17:54Z87.9%+17.3pp

intake:99aa73db-75b1-4b1e-8470-a11f87b23937 bayesian_v2 inside=0.879 blend=0.879 LLR=1.106 κ=0.69 no_blend

LBP2026-04-30T16:39:51Z70.7%-3.4pp

Network propagation: 74.1% → 70.7%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3

LBP2026-04-30T02:18:57Z74.1%-5.9pp

Network propagation: 80.0% → 74.1%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact

All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
killer	TK09 Energy Grid Cap (Data Center Power Wall)	35.0%	0.050	0.800	-0.161
killer	TK06 China-Taiwan Military Conflict	8.0%	0.050	0.800	+0.041
killer	TK02 AI Compute Supply Shock (TSMC/Taiwan Disruption)	12.0%	0.050	0.800	+0.011

Top outgoing (children)

Predictions THIS node influences

No outgoing edges.

Ticker exposure

9 ticker(s) linked

Beneficiaries (9)

NVDA CRWV APLD MSFT ORCL AMZN SFTBY GOOGL META

Prerequisites (3)

Predictions that must hit first

Type	Pred	Title	Domain	Lag
killer	TK09	Energy Grid Cap (Data Center Power Wall)	—	—
killer	TK02	AI Compute Supply Shock (TSMC/Taiwan Disruption)	—	—
killer	TK06	China-Taiwan Military Conflict	—	—

Dependents (0)

Predictions enabled by this

Type	Pred	Title	Domain	Lag
No dependents

Expected milestones (2)

From Sheet 17 Monitoring Triggers

Expected by	Description	Status
2026-12-31	Vera Rubin partner availability target for H2 2026	pending
2027-03-31	Thinking Machines/NVIDIA 1 GW Vera Rubin deployment target early 2027	pending

Validations (1)

Resolution events

Observed at	Status	By	Notes
2026-04-29	partial	thesis_timeline_v1.0_import	ChatGPT/Claude/Gemini 99.9% uptime demands; Modal / Replicate / Anyscale / Anthropic Bedrock proliferation. Weights-as-binary tooling (LoRA, SafeTensors, ggml) mainstream.

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Sim	Source	Title	Market prob	Polarity	Reviewed	Published
0.900	codex_research_pack	SoftBank - OpenAI, Oracle, and SoftBank expand Stargate with five new AI data center sites	—	corroborates	✓	2025-09-24
0.900	codex_research_pack	NVIDIA - Vera Rubin Opens Agentic AI Frontier	—	corroborates	✓	2026-03-16
0.900	codex_research_pack	OpenAI - Announcing The Stargate Project	—	corroborates	✓	2025-01-21
0.900	codex_research_pack	NVIDIA - Rubin Platform, Open Models, Autonomous Driving at CES	—	corroborates	✓	2026-01-06
0.900	codex_research_pack	OpenAI - Stargate advances with 4.5 GW partnership with Oracle	—	corroborates	✓	2025-07-22
0.678	arxiv	Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions	—	mentions	pending	2026-05-07
0.656	arxiv	Concurrency without Model Changes: Future-based Asynchronous Function Calling for LLMs	—	mentions	pending	2026-05-14
0.643	arxiv	MinT: Managed Infrastructure for Training and Serving Millions of LLMs	—	mentions	pending	2026-05-13
0.633	arxiv	Do Proactive Agents Really Need an LLM to Decide When to Wake and What to Anchor?	—	mentions	pending	2026-05-28
0.632	arxiv	Latent Performance Profiling of Large Language Models	—	mentions	pending	2026-05-28

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook

{
  "nia": false,
  "mode": "FORECAST",
  "role": "Cited-Other",
  "context": "Maximum hardware utilization will depend on engineers 'tuning the hardware itself' — Tensor Core manipulation, custom kernel engineering to extract fractional-percent performance gains.",
  "to_year": 2028,
  "conv_cues": "explicit paradigm framing; utility analogy",
  "direction": "HAPPEN",
  "from_year": 2026,
  "timeframe": "2026-2028",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "quartile_checkpoint",
      "label": "Q1 window check-in (25%)",
      "status": "pending",
      "weight": 0.05,
      "ordinal": -8,
      "source_id": null,
      "expected_date": "2026-06-20",
      "observed_date": null
    },
    {
      "kind": "llm_pre_event",
      "label": "NVIDIA + Google Cloud announce Vera Rubin NVL72 rack-scale availability for H2 2026 cloud deployment",
      "source": "https://cloud.google.com/blog/products/compute/google-cloud-ai-infrastructure-at-nvidia-gtc-2026",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -7,
      "source_id": null,
      "confidence": 0.85,
      "expected_date": "2026-09-30",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2026-12-31",
        "from": "2026-07-01"
      },
      "measurement_criterion": "Google Cloud publicly opens Vera Rubin NVL72 rack-scale compute to customers in 2H 2026"
    },
    {
      "kind": "quartile_checkpoint",
      "label": "Q2 window check-in (50%)",
      "status": "pending",
      "weight": 0.05,
      "ordinal": -6,
      "source_id": null,
      "expected_date": "2026-12-07",
      "observed_date": null
    },
    {
      "kind": "llm_pre_event",
      "label": "AMD MI400 series (CDNA Next) ships at 40 PFLOPS FP4 / 432 GB HBM4 / 19.6 TB/s",
      "source": "https://www.tomshardware.com/tech-industry/semiconductors/nvidia-enterprise-roadmap-rubin-rubin-ultra-feynman-and-silicon-photonics",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -5,
      "source_id": null,
      "confidence": 0.65,
      "expected_date": "2026-12-15",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2027-06-30",
        "from": "2026-06-01"
      },
      "measurement_criterion": "AMD publicly ships MI400 with stated peak specs (40 PFLOPS FP4, 432GB HBM4, 19.6 TB/s)"
    },
    {
      "kind": "llm_pre_event",
      "label": "FullFlat optical NVLink network topology demonstrated for inter-node uniform bandwidth",
      "source": "https://arxiv.org/html/2506.15006v2 — Scaling Intelligence, FullFlat optical topology",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -4,
      "source_id": null,
      "confidence": 0.6,
      "expected_date": "2027-03-17",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2027-12-31",
        "from": "2026-06-01"
      },
      "measurement_criterion": "NVIDIA / hyperscaler publishes data-center deployment using FullFlat or optical NVLink topology for LLM training/inference"
    },
    {
      "kind": "llm_pre_event",
      "label": "Hardware-software co-design tooling (Nemotron-Flash / Liquid AI) hits production for inference",
      "source": "https://developer.nvidia.com/blog/how-nvidia-extreme-hardware-software-co-design-delivered-a-large-inference-boost-for-sarvam-ais-sovereign-models/",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -3,
      "source_id": null,
      "confidence": 0.65,
      "expected_date": "2027-03-17",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2027-12-31",
        "from": "2026-06-01"
      },
      "measurement_criterion": "NVIDIA Nemotron-Flash or Liquid AI co-design framework adopted by ≥1 large model provider in production"
    },
    {
      "kind": "quartile_checkpoint",
      "label": "Q3 window check-in (75%)",
      "status": "pending",
      "weight": 0.05,
      "ordinal": -2,
      "sou
... (truncated)