← Cockpit
238_070predictionAIAI-scaling

Small language models will be where most algorithmic breakthroughs come from (crowdsourced)

Predictor: Alex Wissner-Gross · ep#238 "Meta Buys Moltbook, GPT 5.4, and Fruitfly Brain Upload | Moonshots Live at The Abundance Summit 238" · source

Prior probability
50.0%
Current probability
41.9%
evolves via intake + LBP
Conviction
4/5
Signal quality
B
Resolution
pending
Window
2027-01-01 – 2030-08-31
Edges in / out
10 / 5
Tickers exposed
37

Prediction text

Small language models will be where most algorithmic breakthroughs come from (crowdsourced) | At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings... But at the small end, I'm pretty sure that we'll look back in a few years time and we'll see at the small end... that's where the algorithmic innovations are going to come from and those can be crowdsourced.

Verbatim quote

From episode "Meta Buys Moltbook, GPT 5.4, and Fruitfly Brain Upload | Moonshots Live at The Abundance Summit 238"
At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings... But at the small end, I'm pretty sure that we'll look back in a few years time and we'll see at the small end... that's where the algorithmic innovations are going to come from and those can be crowdsourced.

Predictor: Alex Wissner-Gross

κ + Brier as of 2026-05-22
κ (discount)
0.844
Brier
0.0341
excellent
Hits / Misses
6 / 1
of 11 resolved
Hit rate
54.5%
Calibration plot (stated vs observed)

Evidence about this node from Alex Wissner-Gross is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

4 prob_history rows
0%25%50%75%100%prior 50%2026-04-302026-05-032026-05-10
intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 41.9%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.
Leading chain: 5 fired ✓ · 4 pending
  1. 2026-12-31pendingOpen-source SLM <=10B parameters matches GPT-4-class score on MMLU/HellaSwag
    How: Public benchmark report (HuggingFace Open LLM Leaderboard, Stanford HELM, lmsys-chat) shows model <=10B params scoring within 5% of GPT-4 (Mar-2024) on standardized MMLU and HellaSwag
    Source: Phi-4-mini 3.8B / Mistral / Gemma-2 trajectory 2026conf 85%
  2. 2026-04-30 → 2027-12-31pendingCrowdsourced fine-tune of <13B base model tops a major task leaderboard
    How: Community fine-tune (HuggingFace, Together AI, Modal community) of open SLM tops a SOTA task leaderboard (HumanEval, MATH, GSM8K, ARC) above any closed >100B model
    Source: Wissner-Gross crowdsourcing thesisconf 55%
  3. 2026-04-30 → 2028-12-31pendingMajor published architectural breakthrough originates from SLM research
    How: Top-tier peer-reviewed venue (NeurIPS/ICML/ICLR) accepts paper introducing novel architecture demonstrated first in <13B model that subsequently improves frontier; >=500 citations within 12 months
    Source: Mamba/Mistral pattern + Wissner-Gross small-end innovation thesisconf 50%
  4. 2027-12-31pendingSLMs power >50% of enterprise production AI inference workloads
    How: Gartner / Forrester / IDC enterprise AI workload survey reports >50% of production inference traffic served by SLMs (<=20B params), validates SLM dominance thesis
    Source: Gartner 2027 SLM 3x usage predictionconf 60%
  5. 2029-09-09pendingSLM crowdsourced algorithmic dominance recognized by 2029
    How: AI research community (Andrej Karpathy, Yann LeCun, or NeurIPS retrospective) explicitly states algorithmic breakthroughs of 2026-2029 era originated predominantly from open SLM research; not closed frontier labs
    Source: Wissner-Gross small-end crowdsource quoteconf 45%

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.
(live posterior: 42%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first
LBP2026-05-10T02:00:02Z41.9%-1.0pp
Network propagation: 42.9% → 41.9%
6-iter LBP, residual 0.00584 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e5c18d29
LBP2026-05-03T02:00:01Z42.9%-1.6pp
Network propagation: 44.5% → 42.9%
6-iter LBP, residual 0.00677 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 1a683ac9
LBP2026-04-30T16:39:51Z44.5%-2.4pp
Network propagation: 46.8% → 44.5%
5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3
LBP2026-04-30T02:18:57Z46.8%-3.2pp
Network propagation: 50.0% → 46.8%
5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact
All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

KindNodeTheir probP(c|s=T)P(c|s=F)Δ implied
killerTK09
Energy Grid Cap (Data Center Power Wall)
35.0%0.0500.500-0.077
prereqSEM_027
Nvidia Data Center revenue +66% YoY, contributing ~90% of $5Joseph Moore
68.3%0.5000.050-0.067
killerTK05
Rate Regime Persistence (10y > 5% through 2028)
30.0%0.0500.500-0.054
killerTK03
AI Regulatory Moratorium (EU/US Capability Freeze)
10.0%0.0500.500+0.036
prereqSEM_012
Nvidia quadrupled chip production output while only doublingJensen Huang
75.0%0.5000.050-0.035

Top outgoing (children)

Predictions THIS node influences

KindNodeTheir probP(c|s=T)P(c|s=F)Δ implied
prereq231_038
TSMC fabs will come online in 5-6-7 years; Elon won't wait tDave Blundin
44.0%0.6000.050-0.163
prereq232_057
First person to arrive on moon/Mars will find bed made by OpDave Blundin
34.8%0.6000.050-0.071
prereq246_016
Dragonfly nuclear-powered octicopter arrives at Titan in 203Peter Diamandis
35.6%0.6500.050-0.059
prereq230_020
Peter's 14-year-old son Milan will never get a driver's licePeter Diamandis
34.7%0.6500.050-0.049
prereq240_036
TEPCO's restarted reactor will support 20% of Japan's electrPeter Diamandis
34.3%0.6500.050-0.045

Ticker exposure

37 ticker(s) linked

Beneficiaries (24)

MUWULFIRENEQIXALABAPLDASMIYASMLPLABNVDANBISCRWVAAPLAMTAMZNDELLGOOGLIRMLNVGYMETAMSFTORCLSFTBYSTX

Adverse (6)

ACNGENCHGGIBMWNSLRN

Prerequisites (10)

Predictions that must hit first
TypePredTitleDomainLag
prereqSEM_011Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.Capital Markets
prereqSEM_027Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.Capital Markets
prereqSEM_014Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).Manufacturing
prereqSEM_029Blackwell RTX PRO 5000 (72GB) engineered with 50% memory boost over previous generation — deliberate architectural concession for larger AI training.Semis/Products
prereqSEM_012Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering.AI/Manufacturing
killerTK09Energy Grid Cap (Data Center Power Wall)
killerTK05Rate Regime Persistence (10y > 5% through 2028)
killerTK01AGI Capability Plateau (2026-27 Training Stall)
killerTK02AI Compute Supply Shock (TSMC/Taiwan Disruption)
killerTK03AI Regulatory Moratorium (EU/US Capability Freeze)

Dependents (5)

Predictions enabled by this
TypePredTitleDomainLag
prereq230_020Peter's 14-year-old son Milan will never get a driver's license.Auto/Transport
prereq246_016Dragonfly nuclear-powered octicopter arrives at Titan in 2034.Space
prereq240_036TEPCO's restarted reactor will support 20% of Japan's electric needs by 2040Energy
prereq232_057First person to arrive on moon/Mars will find bed made by Optimus robots; robots not astronauts lead exploration.Robotics
prereq231_038TSMC fabs will come online in 5-6-7 years; Elon won't wait that long.AI

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook
{
  "nia": false,
  "url": "https://www.youtube.com/watch?v=d__HRChE2ZE",
  "mode": "PREDICTION",
  "role": "Host",
  "context": "At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings. we'll just build bigger and better and more post-trained models. But at the small end, I'm pretty sure that we'll look back in a few years time... that's where the algorithmic innovations are going to come from and those can be crowdsourced.",
  "to_year": 2030,
  "verbatim": "At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings... But at the small end, I'm pretty sure that we'll look back in a few years time and we'll see at the small end... that's where the algorithmic innovations are going to come from and those can be crowdsourced.",
  "conv_cues": "pretty sure",
  "direction": "HAPPEN",
  "from_year": 2027,
  "timeframe": "Few years",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "prereq",
      "label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -9,
      "source_id": "SEM_011",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -8,
      "source_id": "SEM_027",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -7,
      "source_id": "SEM_014",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Blackwell RTX PRO 5000 (72GB) engineered with 50% memory boost over previous generation — deliberate architectural concession for larger AI ",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -6,
      "source_id": "SEM_029",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -5,
      "source_id": "SEM_012",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "llm_pre_event",
      "label": "Open-source SLM <=10B parameters matches GPT-4-class score on MMLU/HellaSwag",
      "source": "Phi-4-mini 3.8B / Mistral / Gemma-2 trajectory 2026",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -4,
      "source_id": null,
      "confidence": 0.85,
      "source_url": "https://localaimaster.com/blog/small-language-models-guide-2026",
      "expected_date": "2026-12-31",
      "research_origin": "deep_research",
      "measurement_criterion": "Public benchmark report (HuggingFace Open LLM Leaderboard, Stanford HELM, lmsys-chat) shows model <=10B params scoring within 5% of GPT-4 (Mar-2024) on standardized MMLU and HellaSwag"
    },
    {
      "kind": "llm_pre_event",
      "label": "Crowdsourced fine-tune of <13B base model tops a major task leaderboard",
      "source": "Wissner-Gross crowdsourcing thesis",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -3,
      "source_id": null,
      "confidence": 0.55,
      "source_url": "https://www.bentoml.com/blog/the-best-open-source-small-language-models",
      "expected_date": "2027-03-01",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2027-12-31",
        "from": 
... (truncated)