Largest AI models will see a 100x leap in size by 2026, driven largely by Chinese research efforts using quantization breakthroughs.
Predictor: Dave Blundin
Prediction text
Largest AI models will see a 100x leap in size by 2026, driven largely by Chinese research efforts using quantization breakthroughs. | DeepSeek V5 / Qwen 3 benchmark releases
Key catalyst: DeepSeek V5 / Qwen 3 benchmark releases
Watch events: DeepSeek V5 release; Qwen 3 scale; MoE ternary-weight model benchmark releases
Resolution evidence
DeepSeek V4 (Mar 2026) used FP4 training + ternary inference achieving GPT-4-class performance at ~1/40th the cost. Qwen/DeepSeek scaling laws validate quantization-first Chinese approach.
Predictor: Dave Blundin
Calibration plot (stated vs observed)
Evidence about this node from Dave Blundin is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-04-24hitDeepSeek V4 release confirmedHow: DeepSeek-AI publicly releases V4 with V4-Pro at 1.6T total parameters, 49B activated per token (vs V3 at ~671B/37B)Source: https://medium.com/@leucopsis/deepseek-v4-review-a23ce940151c and BentoML guide. V4 Pro exposed via API on April 24, 2026.conf 99%Notes: HIT — DeepSeek V4 launched. Total params went from V3's 671B → V4-Pro's 1.6T (~2.4x), not 100x. Prediction's 100x claim is OVER-stated; partial signal.
- 2026-04-24hitQuantization breakthrough — V4 reduces inference FLOPs to 27% of V3.2How: DeepSeek V4 paper/post claims FLOPs/token reduced to ≤30% of prior generation via Compressed Sparse Attention or peer architectureSource: DeepSeek V4 technical paper, Medium reviewsconf 99%Notes: HIT — V4-Pro reduces single-token FLOPs to 27% of V3.2. Confirms Blundin's quantization-breakthrough theme.
- 2026-04-15hitQwen 3.6 release with sub-10B competitive frontierHow: Alibaba Qwen 3.6 release with smaller (≤10B) variants matching frontier benchmarksSource: https://lushbinary.com/blog/qwen-3-6-vs-gemma-4-llama-4-glm-5-1-deepseek-v4-open-source-comparison/conf 90%Notes: HIT — Qwen 3.6 released. Qwen3.6-35B-A3B carries 35B total / 3B active with 262K context.
- 2026-09-01 → 2026-12-31pendingChinese frontier model crosses ≥10T total parametersHow: DeepSeek V5 or Qwen 4 or peer Chinese model crosses 10T total parameters (would be ~15x V4-Pro, ~6x GPT-4)Source: DeepSeek-AI blog, Alibaba Cloud announcementsconf 30%Notes: Path to '100x leap' hinges on next-gen V5/V6 by year-end. Currently behind original prediction trajectory.
- 2026-10-01 → 2026-12-31pendingRetrospective verdict on '100x leap by 2026' — likely PARTIALHow: Industry analyst consensus that V4 (~2.4x V3) and Qwen 3.6 represent meaningful but sub-100x scale gain — supports PARTIAL classificationSource: Stratechery, SemiAnalysis, ChinaTalk newsletterconf 75%
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Raw metadata
{
"source": "backfill_resolution_history.py",
"status": "partial",
"bayesian_v2": false,
"outcome_prob": 0.5,
"evidence_kind": "resolution_terminal",
"posterior_prob": 0.5,
"delta_to_outcome": 0.01482,
"inside_posterior": 0.48518,
"validation_notes": "DeepSeek V4 (Mar 2026) used FP4 training + ternary inference achieving GPT-4-class performance at ~1/40th the cost. Qwen/DeepSeek scaling laws validate quantization-first Chinese approach.",
"validation_status": "hit",
"pre_resolution_prob": 0.48518,
"resolution_evidence": "DeepSeek V4 (Mar 2026) used FP4 training + ternary inference achieving GPT-4-class performance at ~1/40th the cost. Qwen/DeepSeek scaling laws validate quantization-first Chinese approach.",
"does_not_update_current_prob": true
}Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| killer | TK09 Energy Grid Cap (Data Center Power Wall) | 35.0% | 0.050 | 0.550 | -0.079 |
| prereq | SEM_015 Nvidia agreed to remit 15% of China chip-sale revenue direct — Jensen Huang | 66.3% | 0.550 | 0.050 | -0.069 |
| prereq | SEM_027 Nvidia Data Center revenue +66% YoY, contributing ~90% of $5 — Joseph Moore | 68.3% | 0.550 | 0.050 | -0.068 |
| killer | TK05 Rate Regime Persistence (10y > 5% through 2028) | 30.0% | 0.050 | 0.550 | -0.054 |
| killer | TK03 AI Regulatory Moratorium (EU/US Capability Freeze) | 10.0% | 0.050 | 0.550 | +0.046 |
Top outgoing (children)
Predictions THIS node influences
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | 247_023 AI will be able to do everything a white collar worker does — Dave Blundin | 40.8% | 0.720 | 0.050 | -0.058 |
| prereq | 244_019 Peter's son won't need a driver's license in 2 years — Peter Diamandis | 48.4% | 0.920 | 0.050 | -0.045 |
| prereq | 242_031 Most large companies' business models will be disrupted in 2 — Peter Diamandis | 36.1% | 0.650 | 0.050 | -0.042 |
| prereq | 230_020 Peter's 14-year-old son Milan will never get a driver's lice — Peter Diamandis | 34.7% | 0.650 | 0.050 | -0.028 |
| prereq | 232_055 We're exiting the industrial age permanently as recursive se — Peter Diamandis | 35.5% | 0.700 | 0.050 | -0.014 |
Ticker exposure
Beneficiaries (24)
Adverse (6)
Prerequisites (10)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | SEM_011 | Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips. | Capital Markets | — |
| prereq | SEM_027 | Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon. | Capital Markets | — |
| prereq | SEM_014 | Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025). | Manufacturing | — |
| prereq | SEM_012 | Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering. | AI/Manufacturing | — |
| prereq | SEM_015 | Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans. | Policy/Semis | — |
| killer | TK09 | Energy Grid Cap (Data Center Power Wall) | — | — |
| killer | TK05 | Rate Regime Persistence (10y > 5% through 2028) | — | — |
| killer | TK01 | AGI Capability Plateau (2026-27 Training Stall) | — | — |
| killer | TK02 | AI Compute Supply Shock (TSMC/Taiwan Disruption) | — | — |
| killer | TK03 | AI Regulatory Moratorium (EU/US Capability Freeze) | — | — |
Dependents (5)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | 244_019 | Peter's son won't need a driver's license in 2 years | Auto/Transport | — |
| prereq | 247_023 | AI will be able to do everything a white collar worker does imminently | AI | — |
| prereq | 232_055 | We're exiting the industrial age permanently as recursive self-improvement unfolds. | AI | — |
| prereq | 242_031 | Most large companies' business models will be disrupted in 2-5 years | Markets/Stocks | — |
| prereq | 230_020 | Peter's 14-year-old son Milan will never get a driver's license. | Auto/Transport | — |
Expected milestones (1)
| Expected by | Description | Status |
|---|---|---|
| 2026-12-31 | [Geopolitics 2026-12] r loopholes; DeepSeek/Qwen/Kimi frontie [SEM_021] DeepSeek V5 release; Qwen 3 scale; MoE ternary-weight model benchmark releases [235_007] AI regulation window is this calendar year 2026 before chaos breaks out. | pending |
Validations (1)
| Observed at | Status | By | Notes |
|---|---|---|---|
| 2026-04-29 | hit | thesis_timeline_v1.0_import | DeepSeek V4 (Mar 2026) used FP4 training + ternary inference achieving GPT-4-class performance at ~1/40th the cost. Qwen/DeepSeek scaling laws validate quantization-first Chinese approach. |
Linked documents (10)
Raw metadata
{
"nia": false,
"qty": "100x model size",
"mode": "FORECAST",
"role": "Host-VC",
"context": "Blundin forecasts 100x model-size leap by 2026 via FP4/ternary quantization, proving embargoed nations can stay competitive via algorithmic bypasses.",
"to_year": 2026,
"conv_cues": "staggering; predicts",
"direction": "NUMERIC_TARGET",
"from_year": 2026,
"timeframe": "2026",
"conv_level": "HIGH",
"milestones": [
{
"kind": "llm_pre_event",
"label": "DeepSeek V4 release confirmed",
"notes": "HIT — DeepSeek V4 launched. Total params went from V3's 671B → V4-Pro's 1.6T (~2.4x), not 100x. Prediction's 100x claim is OVER-stated; partial signal.",
"source": "https://medium.com/@leucopsis/deepseek-v4-review-a23ce940151c and BentoML guide. V4 Pro exposed via API on April 24, 2026.",
"status": "hit",
"weight": 0.4,
"ordinal": -6,
"source_id": null,
"confidence": 0.99,
"source_url": "https://medium.com/@leucopsis/deepseek-v4-review-a23ce940151c",
"expected_date": "2026-04-24",
"observed_date": "2026-04-24",
"research_origin": "deep_research",
"measurement_criterion": "DeepSeek-AI publicly releases V4 with V4-Pro at 1.6T total parameters, 49B activated per token (vs V3 at ~671B/37B)"
},
{
"kind": "llm_pre_event",
"label": "Quantization breakthrough — V4 reduces inference FLOPs to 27% of V3.2",
"notes": "HIT — V4-Pro reduces single-token FLOPs to 27% of V3.2. Confirms Blundin's quantization-breakthrough theme.",
"source": "DeepSeek V4 technical paper, Medium reviews",
"status": "hit",
"weight": 0.4,
"ordinal": -5,
"source_id": null,
"confidence": 0.99,
"source_url": "https://medium.com/@leucopsis/deepseek-v4-review-a23ce940151c",
"expected_date": "2026-04-24",
"observed_date": "2026-04-24",
"research_origin": "deep_research",
"measurement_criterion": "DeepSeek V4 paper/post claims FLOPs/token reduced to ≤30% of prior generation via Compressed Sparse Attention or peer architecture"
},
{
"kind": "prereq",
"label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
"status": "hit",
"weight": 0.5,
"ordinal": -4,
"source_id": "SEM_011",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.",
"status": "hit",
"weight": 0.5,
"ordinal": -3,
"source_id": "SEM_027",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).",
"status": "hit",
"weight": 0.5,
"ordinal": -2,
"source_id": "SEM_014",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
"status": "hit",
"weight": 0.5,
"ordinal": -1,
"source_id": "SEM_012",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "event",
"label": "Largest AI models will see a 100x leap in size by 2026, driven largely by Chinese research efforts using quantization breakthroughs.",
"status": "partial",
"weight": 1,
"ordinal": 0,
"source_id": "SEM_021",
"expected_date": "2026-05-01",
"observed_date": "2026-05-01"
},
{
"kind": "llm_pre_event",
"label": "Qwen 3.6 release with sub-10B competitive frontier",
"note
... (truncated)