SEM_022predictionAI/ArchitectureAI-scaling

FP4 / ternary-weight architectures decouple AI capability from raw transistor density — embargoed nations maintain competitive development.

Predictor: Dave Blundin

Prior probability

78.0%

Current probability

64.5%

evolves via intake + LBP

Conviction

4/5

Signal quality

Resolution

partial

Window

2026-01-01 – 2029-12-31

Edges in / out

10 / 5

Tickers exposed

Prediction text

FP4 / ternary-weight architectures decouple AI capability from raw transistor density — embargoed nations maintain competitive development. | Ternary/FP4/sub-bit model production deployments

Key catalyst: Ternary/FP4/sub-bit model production deployments

Watch events: BitNet-class ternary deployment scale; FP2 / 1-bit model research milestones

Resolution evidence

Status: partial

FP4 training + INT8/INT4 inference now standard at Chinese labs. DeepSeek, Qwen, MiniMax all using sub-8-bit quantization in production.

Predictor: Dave Blundin

κ + Brier as of 2026-05-22

Full calibration →

κ (discount)

0.821

Brier

0.0491

excellent

Hits / Misses

3 / 2

of 9 resolved

Hit rate

33.3%

Calibration plot (stated vs observed)

Evidence about this node from Dave Blundin is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

5 prob_history rows

intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 64.5%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.

Leading chain: 6 fired ✓

2026-04-15hitFP4 inference lands in mainstream open-source frameworks
How: FP4 (NVFP4 or MXFP4) inference support added to llama.cpp, vLLM, or TensorRT-LLM with public benchmarks showing minimal accuracy loss vs FP16
Source: https://insiderllm.com/guides/fp4-inference-llamacpp-nvfp4-mxfp4/ — FP4 already landed in llama.cpp by April 2026conf 99%
Notes: HIT — FP4 (NVFP4 + MXFP4) confirmed in llama.cpp by April 2026 per InsiderLLM. Decouples capability from FP16 transistor density.
2026-04-24hitDeepSeek V4 KV cache quantization confirms architecture decoupling
How: DeepSeek V4-Flash demonstrates KV cache quantization at 384K context window with material memory reduction
Source: https://dev.to/soytuber/deepseek-v4-flash-gemmaqwen-kv-cache-quantization-384k-context-2m0conf 95%
2026-04-29hitNvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.
2026-04-29hitNvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.
2026-04-29hitNvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).
2026-04-29hitNvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a
2026-05-01partialFP4 / ternary-weight architectures decouple AI capability from raw transistor density — embargoed nations maintain competitive development.
2026-09-01 → 2027-12-31pendingTernary-weight production model released by major lab
How: Major lab (Microsoft Research BitNet, peer) releases production-grade ternary-weight model with capability matching FP16 at 4x+ memory reduction
Source: Microsoft Research BitNet papers, arXiv quantization papersconf 50%
Notes: BitNet b1.58 (Microsoft 2024) demonstrated ternary feasibility at 3B scale; production at frontier scale is the bar.
2026-09-01 → 2028-06-30pendingEmbargoed-nation AI lab demonstrates frontier capability via FP4/ternary stack
How: Chinese, Russian, or Iranian AI lab demonstrates frontier-class model trained primarily on FP4/quantized stack on domestic chips
Source: Chinese AI lab papers, Reuters trackingconf 55%
2030-04-18pendingDario Amodei will solve most/all neurological diseases by end of decade
2030-10-10pendingEuropa Clipper will arrive at Jupiter in 2030, conducting 50 passes near Europa.
2034-09-16pendingDragonfly nuclear-powered octicopter arrives at Titan in 2034.
2040-07-06pendingTEPCO's restarted reactor will support 20% of Japan's electric needs by 2040
2063-06-21pendingPeter's 14-year-old son Milan will never get a driver's license.

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.

(live posterior: 65%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first

LBP2026-05-10T02:00:02Z64.5%-1.7pp

Network propagation: 66.2% → 64.5%

6-iter LBP, residual 0.00584 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e5c18d29

LBP2026-05-03T02:00:01Z66.2%-2.9pp

Network propagation: 69.2% → 66.2%

6-iter LBP, residual 0.00677 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 1a683ac9

resolution_terminal2026-05-01T00:00:00Z50.0%-19.2pp

resolution_terminal partial outcome=0.5 pre_resolution=0.692

Raw metadata

{
  "source": "backfill_resolution_history.py",
  "status": "partial",
  "bayesian_v2": false,
  "outcome_prob": 0.5,
  "evidence_kind": "resolution_terminal",
  "posterior_prob": 0.5,
  "delta_to_outcome": -0.19189,
  "inside_posterior": 0.69189,
  "validation_notes": "FP4 training + INT8/INT4 inference now standard at Chinese labs. DeepSeek, Qwen, MiniMax all using sub-8-bit quantization in production.",
  "validation_status": "hit",
  "pre_resolution_prob": 0.69189,
  "resolution_evidence": "FP4 training + INT8/INT4 inference now standard at Chinese labs. DeepSeek, Qwen, MiniMax all using sub-8-bit quantization in production.",
  "does_not_update_current_prob": true
}

LBP2026-04-30T16:39:51Z69.2%-3.8pp

Network propagation: 73.0% → 69.2%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3

LBP2026-04-30T02:18:57Z73.0%-5.0pp

Network propagation: 78.0% → 73.0%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact

All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
killer	TK09 Energy Grid Cap (Data Center Power Wall)	35.0%	0.050	0.780	-0.121
prereq	SEM_015 Nvidia agreed to remit 15% of China chip-sale revenue direct — Jensen Huang	66.3%	0.780	0.050	-0.105
prereq	SEM_027 Nvidia Data Center revenue +66% YoY, contributing ~90% of $5 — Joseph Moore	68.3%	0.780	0.050	-0.104
killer	TK05 Rate Regime Persistence (10y > 5% through 2028)	30.0%	0.050	0.780	-0.084
killer	TK03 AI Regulatory Moratorium (EU/US Capability Freeze)	10.0%	0.050	0.780	+0.062

Top outgoing (children)

Predictions THIS node influences

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
prereq	240_036 TEPCO's restarted reactor will support 20% of Japan's electr — Peter Diamandis	34.3%	0.650	0.050	+0.089
prereq	230_020 Peter's 14-year-old son Milan will never get a driver's lice — Peter Diamandis	34.7%	0.650	0.050	+0.084
prereq	247_035 Dario Amodei will solve most/all neurological diseases by en — Dario Amodei	38.8%	0.700	0.050	+0.075
prereq	246_016 Dragonfly nuclear-powered octicopter arrives at Titan in 203 — Peter Diamandis	35.6%	0.650	0.050	+0.075
prereq	246_017 Europa Clipper will arrive at Jupiter in 2030, conducting 50 — Peter Diamandis	37.7%	0.650	0.050	+0.054

Ticker exposure

37 ticker(s) linked

Beneficiaries (24)

MU WULF IREN EQIX ALAB APLD ASMIY ASML PLAB NVDA NBIS CRWV AAPL AMT AMZN DELL GOOGL IRM LNVGY META MSFT ORCL SFTBY STX

Adverse (6)

ACN GEN CHGG IBM WNS LRN

Prerequisites (10)

Predictions that must hit first

Type	Pred	Title	Domain	Lag
prereq	SEM_011	Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.	Capital Markets	—
prereq	SEM_027	Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.	Capital Markets	—
prereq	SEM_014	Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).	Manufacturing	—
prereq	SEM_012	Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering.	AI/Manufacturing	—
prereq	SEM_015	Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans.	Policy/Semis	—
killer	TK09	Energy Grid Cap (Data Center Power Wall)	—	—
killer	TK05	Rate Regime Persistence (10y > 5% through 2028)	—	—
killer	TK01	AGI Capability Plateau (2026-27 Training Stall)	—	—
killer	TK02	AI Compute Supply Shock (TSMC/Taiwan Disruption)	—	—
killer	TK03	AI Regulatory Moratorium (EU/US Capability Freeze)	—	—

Dependents (5)

Predictions enabled by this

Type	Pred	Title	Domain	Lag
prereq	247_035	Dario Amodei will solve most/all neurological diseases by end of decade	Biotech/Longevity	—
prereq	230_020	Peter's 14-year-old son Milan will never get a driver's license.	Auto/Transport	—
prereq	246_017	Europa Clipper will arrive at Jupiter in 2030, conducting 50 passes near Europa.	Space	—
prereq	246_016	Dragonfly nuclear-powered octicopter arrives at Titan in 2034.	Space	—
prereq	240_036	TEPCO's restarted reactor will support 20% of Japan's electric needs by 2040	Energy	—

Expected milestones (1)

From Sheet 17 Monitoring Triggers

Expected by	Description	Status
2029-12-31	[Capability 2029-12] ; trillion-dollar-cluster announcements [SEM_022] BitNet-class ternary deployment scale; FP2 / 1-bit model research milestones [INF_071] Frontier model benchmark evaluations; GDPval / ARC-AGI progression; enterprise agentic deployment	pending

Validations (1)

Resolution events

Observed at	Status	By	Notes
2026-04-29	hit	thesis_timeline_v1.0_import	FP4 training + INT8/INT4 inference now standard at Chinese labs. DeepSeek, Qwen, MiniMax all using sub-8-bit quantization in production.

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Sim	Source	Title	Market prob	Polarity	Reviewed	Published
0.760	arxiv	Litespark Inference on Consumer CPUs: Custom SIMD Kernels for Ternary Neural Networks	—	mentions	pending	2026-05-07
0.711	arxiv	ITP-STDP: An Intrinsic-Timing Power-of-Two Learning Engine for On-Chip SNN Training	—	mentions	pending	2026-06-04
0.703	arxiv	A Hardware-Aware, Per-Layer Methodology for Post-Training Quantization of Large Language Models	—	mentions	pending	2026-05-14
0.696	arxiv	Buffer-Parameterized Machine Learning Surrogate Models for Cross-Technology Signal Integrity Analysis and Optimization	—	mentions	pending	2026-05-18
0.695	arxiv	Grid Games: The Power of Multiple Grids for Quantizing Large Language Models	—	mentions	pending	2026-05-12
0.690	arxiv	The Complexity of Verifying Feedforward Neural Networks in Quantised Settings	—	mentions	pending	2026-05-28
0.686	arxiv	LLMForge: Multi-Backend Hardware-Aware Neural Architecture Search with Infinite-Head Attention for Edge Language Models	—	mentions	pending	2026-05-17
0.686	arxiv	PoTAcc: A Pipeline for End-to-End Acceleration of Power-of-Two Quantized DNNs	—	mentions	pending	2026-05-07
0.685	arxiv	The Symmetries of Three-Layer ReLU Networks	—	mentions	pending	2026-05-18
0.682	arxiv	Phasor Memory Networks: Stable Backpropagation Through Time for Scalable Explicit Memory	—	mentions	pending	2026-05-13

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook

{
  "nia": false,
  "mode": "THESIS",
  "role": "Host-VC",
  "context": "Shift from FP32 to ternary weights reduces hardware requirements by >1 order of magnitude; proves AI progression not purely hardware-limited.",
  "to_year": 2029,
  "conv_cues": "mathematically decouple; proving",
  "direction": "HAPPEN",
  "from_year": 2026,
  "timeframe": "2026+",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "llm_pre_event",
      "label": "FP4 inference lands in mainstream open-source frameworks",
      "notes": "HIT — FP4 (NVFP4 + MXFP4) confirmed in llama.cpp by April 2026 per InsiderLLM. Decouples capability from FP16 transistor density.",
      "source": "https://insiderllm.com/guides/fp4-inference-llamacpp-nvfp4-mxfp4/ — FP4 already landed in llama.cpp by April 2026",
      "status": "hit",
      "weight": 0.4,
      "ordinal": -6,
      "source_id": null,
      "confidence": 0.99,
      "source_url": "https://insiderllm.com/guides/fp4-inference-llamacpp-nvfp4-mxfp4/",
      "expected_date": "2026-04-01",
      "observed_date": "2026-04-15",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2026-06-30",
        "from": "2026-01-01"
      },
      "measurement_criterion": "FP4 (NVFP4 or MXFP4) inference support added to llama.cpp, vLLM, or TensorRT-LLM with public benchmarks showing minimal accuracy loss vs FP16"
    },
    {
      "kind": "llm_pre_event",
      "label": "DeepSeek V4 KV cache quantization confirms architecture decoupling",
      "source": "https://dev.to/soytuber/deepseek-v4-flash-gemmaqwen-kv-cache-quantization-384k-context-2m0",
      "status": "hit",
      "weight": 0.4,
      "ordinal": -5,
      "source_id": null,
      "confidence": 0.95,
      "source_url": "https://dev.to/soytuber/deepseek-v4-flash-gemmaqwen-kv-cache-quantization-384k-context-2m0",
      "expected_date": "2026-04-24",
      "observed_date": "2026-04-24",
      "research_origin": "deep_research",
      "measurement_criterion": "DeepSeek V4-Flash demonstrates KV cache quantization at 384K context window with material memory reduction"
    },
    {
      "kind": "prereq",
      "label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -4,
      "source_id": "SEM_011",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -3,
      "source_id": "SEM_027",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -2,
      "source_id": "SEM_014",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -1,
      "source_id": "SEM_012",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "event",
      "label": "FP4 / ternary-weight architectures decouple AI capability from raw transistor density — embargoed nations maintain competitive development.",
      "status": "partial",
      "weight": 1,
      "ordinal": 0,
      "source_id": "SEM_022",
      "expected_date": "2026-05-01",
      "observed_date": "2026-05-01"
    },
    {
      "kind": "llm_pre_event",
      "label": "Ternary-weight production model released by 
... (truncated)