238_070predictionAIAI-scaling

Small language models will be where most algorithmic breakthroughs come from (crowdsourced)

Predictor: Alex Wissner-Gross · ep#238 "Meta Buys Moltbook, GPT 5.4, and Fruitfly Brain Upload | Moonshots Live at The Abundance Summit 238" · source

Prior probability

50.0%

Current probability

41.9%

evolves via intake + LBP

Conviction

4/5

Signal quality

Resolution

pending

Window

2027-01-01 – 2030-08-31

Edges in / out

10 / 5

Tickers exposed

Prediction text

Small language models will be where most algorithmic breakthroughs come from (crowdsourced) | At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings... But at the small end, I'm pretty sure that we'll look back in a few years time and we'll see at the small end... that's where the algorithmic innovations are going to come from and those can be crowdsourced.

Verbatim quote

From episode "Meta Buys Moltbook, GPT 5.4, and Fruitfly Brain Upload | Moonshots Live at The Abundance Summit 238"

At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings... But at the small end, I'm pretty sure that we'll look back in a few years time and we'll see at the small end... that's where the algorithmic innovations are going to come from and those can be crowdsourced.

Predictor: Alex Wissner-Gross

κ + Brier as of 2026-05-22

Full calibration →

κ (discount)

0.844

Brier

0.0341

excellent

Hits / Misses

6 / 1

of 11 resolved

Hit rate

54.5%

Calibration plot (stated vs observed)

Evidence about this node from Alex Wissner-Gross is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

4 prob_history rows

intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 41.9%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.

Leading chain: 5 fired ✓ · 4 pending

2026-04-29hitNvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.
2026-04-29hitNvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.
2026-04-29hitNvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).
2026-04-29hitBlackwell RTX PRO 5000 (72GB) engineered with 50% memory boost over previous generation — deliberate architectural concession for larger AI
2026-04-29hitNvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a
2026-12-31pendingOpen-source SLM <=10B parameters matches GPT-4-class score on MMLU/HellaSwag
How: Public benchmark report (HuggingFace Open LLM Leaderboard, Stanford HELM, lmsys-chat) shows model <=10B params scoring within 5% of GPT-4 (Mar-2024) on standardized MMLU and HellaSwag
Source: Phi-4-mini 3.8B / Mistral / Gemma-2 trajectory 2026conf 85%
2026-04-30 → 2027-12-31pendingCrowdsourced fine-tune of <13B base model tops a major task leaderboard
How: Community fine-tune (HuggingFace, Together AI, Modal community) of open SLM tops a SOTA task leaderboard (HumanEval, MATH, GSM8K, ARC) above any closed >100B model
Source: Wissner-Gross crowdsourcing thesisconf 55%
2026-04-30 → 2028-12-31pendingMajor published architectural breakthrough originates from SLM research
How: Top-tier peer-reviewed venue (NeurIPS/ICML/ICLR) accepts paper introducing novel architecture demonstrated first in <13B model that subsequently improves frontier; >=500 citations within 12 months
Source: Mamba/Mistral pattern + Wissner-Gross small-end innovation thesisconf 50%
2027-12-31pendingSLMs power >50% of enterprise production AI inference workloads
How: Gartner / Forrester / IDC enterprise AI workload survey reports >50% of production inference traffic served by SLMs (<=20B params), validates SLM dominance thesis
Source: Gartner 2027 SLM 3x usage predictionconf 60%
2029-09-09pendingSmall language models will be where most algorithmic breakthroughs come from (crowdsourced)
2029-09-09pendingSLM crowdsourced algorithmic dominance recognized by 2029
How: AI research community (Andrej Karpathy, Yann LeCun, or NeurIPS retrospective) explicitly states algorithmic breakthroughs of 2026-2029 era originated predominantly from open SLM research; not closed frontier labs
Source: Wissner-Gross small-end crowdsource quoteconf 45%
2031-06-17pendingFirst person to arrive on moon/Mars will find bed made by Optimus robots; robots not astronauts lead exploration.
2032-12-02pendingTSMC fabs will come online in 5-6-7 years; Elon won't wait that long.
2034-09-16pendingDragonfly nuclear-powered octicopter arrives at Titan in 2034.
2040-07-06pendingTEPCO's restarted reactor will support 20% of Japan's electric needs by 2040
2063-06-21pendingPeter's 14-year-old son Milan will never get a driver's license.

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.

(live posterior: 42%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first

LBP2026-05-10T02:00:02Z41.9%-1.0pp

Network propagation: 42.9% → 41.9%

6-iter LBP, residual 0.00584 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e5c18d29

LBP2026-05-03T02:00:01Z42.9%-1.6pp

Network propagation: 44.5% → 42.9%

6-iter LBP, residual 0.00677 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 1a683ac9

LBP2026-04-30T16:39:51Z44.5%-2.4pp

Network propagation: 46.8% → 44.5%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3

LBP2026-04-30T02:18:57Z46.8%-3.2pp

Network propagation: 50.0% → 46.8%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact

All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
killer	TK09 Energy Grid Cap (Data Center Power Wall)	35.0%	0.050	0.500	-0.077
prereq	SEM_027 Nvidia Data Center revenue +66% YoY, contributing ~90% of $5 — Joseph Moore	68.3%	0.500	0.050	-0.067
killer	TK05 Rate Regime Persistence (10y > 5% through 2028)	30.0%	0.050	0.500	-0.054
killer	TK03 AI Regulatory Moratorium (EU/US Capability Freeze)	10.0%	0.050	0.500	+0.036
prereq	SEM_012 Nvidia quadrupled chip production output while only doubling — Jensen Huang	75.0%	0.500	0.050	-0.035

Top outgoing (children)

Predictions THIS node influences

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
prereq	231_038 TSMC fabs will come online in 5-6-7 years; Elon won't wait t — Dave Blundin	44.0%	0.600	0.050	-0.163
prereq	232_057 First person to arrive on moon/Mars will find bed made by Op — Dave Blundin	34.8%	0.600	0.050	-0.071
prereq	246_016 Dragonfly nuclear-powered octicopter arrives at Titan in 203 — Peter Diamandis	35.6%	0.650	0.050	-0.059
prereq	230_020 Peter's 14-year-old son Milan will never get a driver's lice — Peter Diamandis	34.7%	0.650	0.050	-0.049
prereq	240_036 TEPCO's restarted reactor will support 20% of Japan's electr — Peter Diamandis	34.3%	0.650	0.050	-0.045

Ticker exposure

37 ticker(s) linked

Beneficiaries (24)

MU WULF IREN EQIX ALAB APLD ASMIY ASML PLAB NVDA NBIS CRWV AAPL AMT AMZN DELL GOOGL IRM LNVGY META MSFT ORCL SFTBY STX

Adverse (6)

ACN GEN CHGG IBM WNS LRN

Prerequisites (10)

Predictions that must hit first

Type	Pred	Title	Domain	Lag
prereq	SEM_011	Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.	Capital Markets	—
prereq	SEM_027	Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.	Capital Markets	—
prereq	SEM_014	Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).	Manufacturing	—
prereq	SEM_029	Blackwell RTX PRO 5000 (72GB) engineered with 50% memory boost over previous generation — deliberate architectural concession for larger AI training.	Semis/Products	—
prereq	SEM_012	Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering.	AI/Manufacturing	—
killer	TK09	Energy Grid Cap (Data Center Power Wall)	—	—
killer	TK05	Rate Regime Persistence (10y > 5% through 2028)	—	—
killer	TK01	AGI Capability Plateau (2026-27 Training Stall)	—	—
killer	TK02	AI Compute Supply Shock (TSMC/Taiwan Disruption)	—	—
killer	TK03	AI Regulatory Moratorium (EU/US Capability Freeze)	—	—

Dependents (5)

Predictions enabled by this

Type	Pred	Title	Domain	Lag
prereq	230_020	Peter's 14-year-old son Milan will never get a driver's license.	Auto/Transport	—
prereq	246_016	Dragonfly nuclear-powered octicopter arrives at Titan in 2034.	Space	—
prereq	240_036	TEPCO's restarted reactor will support 20% of Japan's electric needs by 2040	Energy	—
prereq	232_057	First person to arrive on moon/Mars will find bed made by Optimus robots; robots not astronauts lead exploration.	Robotics	—
prereq	231_038	TSMC fabs will come online in 5-6-7 years; Elon won't wait that long.	AI	—

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Sim	Source	Title	Market prob	Polarity	Reviewed	Published
0.748	arxiv	Can Crowdsourcing Survive the LLM Era? A Community Survey on Human Data Collection	—	mentions	pending	2026-06-03
0.711	arxiv	Anticipating Innovation Using Large Language Models	—	mentions	pending	2026-05-06
0.711	arxiv	Ex Ante Evaluation of AI-Induced Idea Diversity Collapse	—	mentions	pending	2026-05-07
0.702	arxiv	A Hierarchical Language Model with Predictable Scaling Laws and Provable Benefits of Reasoning	—	mentions	pending	2026-05-13
0.693	arxiv	How to Scale Mixture-of-Experts: From muP to the Maximally Scale-Stable Parameterization	—	mentions	pending	2026-05-13
0.693	arxiv	When is Warmstarting Effective for Scaling Language Models?	—	mentions	pending	2026-05-13
0.687	arxiv	ProbeScale: Probing Analysis to Optimize Neural Scaling Laws for Efficient Small Language Model Inference	—	mentions	pending	2026-06-01
0.686	arxiv	Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention	—	mentions	pending	2026-05-28
0.683	arxiv	Interestingness as an Inductive Heuristic for Future Compression Progress	—	mentions	pending	2026-05-14
0.682	arxiv	On Language Generation in the Limit with Bounded Memory	—	mentions	pending	2026-05-28

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook

{
  "nia": false,
  "url": "https://www.youtube.com/watch?v=d__HRChE2ZE",
  "mode": "PREDICTION",
  "role": "Host",
  "context": "At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings. we'll just build bigger and better and more post-trained models. But at the small end, I'm pretty sure that we'll look back in a few years time... that's where the algorithmic innovations are going to come from and those can be crowdsourced.",
  "to_year": 2030,
  "verbatim": "At the high end, scaling hypothesis seems to continue to hold. There are no glass ceilings... But at the small end, I'm pretty sure that we'll look back in a few years time and we'll see at the small end... that's where the algorithmic innovations are going to come from and those can be crowdsourced.",
  "conv_cues": "pretty sure",
  "direction": "HAPPEN",
  "from_year": 2027,
  "timeframe": "Few years",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "prereq",
      "label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -9,
      "source_id": "SEM_011",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -8,
      "source_id": "SEM_027",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025).",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -7,
      "source_id": "SEM_014",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Blackwell RTX PRO 5000 (72GB) engineered with 50% memory boost over previous generation — deliberate architectural concession for larger AI ",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -6,
      "source_id": "SEM_029",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -5,
      "source_id": "SEM_012",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "llm_pre_event",
      "label": "Open-source SLM <=10B parameters matches GPT-4-class score on MMLU/HellaSwag",
      "source": "Phi-4-mini 3.8B / Mistral / Gemma-2 trajectory 2026",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -4,
      "source_id": null,
      "confidence": 0.85,
      "source_url": "https://localaimaster.com/blog/small-language-models-guide-2026",
      "expected_date": "2026-12-31",
      "research_origin": "deep_research",
      "measurement_criterion": "Public benchmark report (HuggingFace Open LLM Leaderboard, Stanford HELM, lmsys-chat) shows model <=10B params scoring within 5% of GPT-4 (Mar-2024) on standardized MMLU and HellaSwag"
    },
    {
      "kind": "llm_pre_event",
      "label": "Crowdsourced fine-tune of <13B base model tops a major task leaderboard",
      "source": "Wissner-Gross crowdsourcing thesis",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -3,
      "source_id": null,
      "confidence": 0.55,
      "source_url": "https://www.bentoml.com/blog/the-best-open-source-small-language-models",
      "expected_date": "2027-03-01",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2027-12-31",
        "from": 
... (truncated)