238_071predictionAIAI-scaling

Future AI models may compress all human knowledge into megabytes via post-transformer breakthroughs

Predictor: Alex Wissner-Gross · ep#238 "Meta Buys Moltbook, GPT 5.4, and Fruitfly Brain Upload | Moonshots Live at The Abundance Summit 238" · source

Prior probability

45.0%

Current probability

34.7%

evolves via intake + LBP

Conviction

3/5

Signal quality

Resolution

pending

Window

2026-04-30 – 2029-03-31

Edges in / out

6 / 0

Tickers exposed

Prediction text

Future AI models may compress all human knowledge into megabytes via post-transformer breakthroughs | if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller... will be something maybe even in the megabytes.

Verbatim quote

From episode "Meta Buys Moltbook, GPT 5.4, and Fruitfly Brain Upload | Moonshots Live at The Abundance Summit 238"

if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller... will be something maybe even in the megabytes.

Predictor: Alex Wissner-Gross

κ + Brier as of 2026-05-22

Full calibration →

κ (discount)

0.844

Brier

0.0341

excellent

Hits / Misses

6 / 1

of 11 resolved

Hit rate

54.5%

Calibration plot (stated vs observed)

Evidence about this node from Alex Wissner-Gross is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

4 prob_history rows

intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 34.7%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.

Leading chain: 7 pending

2026-10-05pendingQ1 window check-in (25%)
2026-06-01 → 2027-09-30pendingMajor academic paper demonstrates state-of-the-art LLM at <1B parameters on a benchmark within 5 points of frontier
How: ICLR/NeurIPS-tier paper or arXiv preprint demonstrates a model at <1B parameters reaching within 5 points of GPT-5/Gemini-3/Claude on at least one major benchmark (MMLU, HumanEval, GSM8K)
Source: Microsoft Phi-4 series; Google Gemma 3; Mistral Small 2026; ongoing model-compression literatureconf 55%
2027-03-12pendingQ2 window check-in (50%)
2026-06-01 → 2027-12-31pendingMainstream non-transformer architecture (Mamba, RWKV, Hyena, RetNet) wins on ChatBot Arena top 20
How: A model based on a fundamentally non-transformer architecture (state-space, recurrent, etc.) appears in ChatBot Arena top-20 leaderboard. Required precondition for 'post-transformer breakthrough' framing.
Source: Mamba (Albert Gu/Tri Dao); Phi-4-mini-flash hybrid attention 2025-2026conf 45%
2026-09-01 → 2028-06-30pendingFrontier lab announces 'next-generation architecture' departure from transformer scaling
How: OpenAI, Anthropic, Google DeepMind, Meta, or xAI publicly announces a new model framed as fundamentally post-transformer, with technical paper showing architectural change
Source: AI scaling literature 2025-2026; LeCun JEPA / world-models thesisconf 40%
2026-09-01 → 2028-06-30pendingKolmogorov-complexity / compression-prize benchmark shows AI compressing knowledge below transformer-floor
How: Hutter Prize-style enwik9 compression benchmark or equivalent shows AI-driven compression at <90% of best transformer, supporting 'all knowledge in megabytes' framing
Source: Hutter Prize (enwik9 compression); information-theoretic AI boundsconf 30%
2027-08-17pendingQ3 window check-in (75%)
2028-01-23pendingFuture AI models may compress all human knowledge into megabytes via post-transformer breakthroughs
2027-06-01 → 2029-06-30pendingFirst production LLM ships at <100MB total size with utility comparable to GPT-3
How: Public production model under 100MB (post-quantization) used in a shipping product, reaching GPT-3-class quality on standard benchmarks. Wissner-Gross specifically pointed to 'megabytes' scale.
Source: Quantization research (GPTQ, AWQ, EETQ); on-device LLM trends Apple Intelligenceconf 50%
2028-06-01 → 2030-12-31pendingCascade: On-device AGI-class models ship with major OS
How: Apple, Google Android, or Microsoft ships major OS update with frontier-capable AI model running fully on-device (no cloud)
Source: Cascade from compression milestonesconf 35%

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.

(live posterior: 35%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first

LBP2026-05-10T02:00:02Z34.7%-1.4pp

Network propagation: 36.1% → 34.7%

6-iter LBP, residual 0.00584 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e5c18d29

LBP2026-05-03T02:00:01Z36.1%-2.8pp

Network propagation: 38.9% → 36.1%

6-iter LBP, residual 0.00677 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 1a683ac9

LBP2026-04-30T16:39:51Z38.9%-2.1pp

Network propagation: 40.9% → 38.9%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3

LBP2026-04-30T02:18:57Z40.9%-4.1pp

Network propagation: 45.0% → 40.9%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact

All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
prereq	S_AGI_MID_2029 AGI mid: Kurzweil 2029 path	35.0%	0.450	0.050	-0.157
killer	TK03 AI Regulatory Moratorium (EU/US Capability Freeze)	10.0%	0.050	0.450	+0.063
killer	TK02 AI Compute Supply Shock (TSMC/Taiwan Disruption)	12.0%	0.050	0.450	+0.055
killer	TK01 AGI Capability Plateau (2026-27 Training Stall)	15.0%	0.050	0.450	+0.043
killer	TK09 Energy Grid Cap (Data Center Power Wall)	35.0%	0.050	0.450	-0.037

Top outgoing (children)

Predictions THIS node influences

No outgoing edges.

Ticker exposure

37 ticker(s) linked

Beneficiaries (24)

MU WULF IREN EQIX ALAB APLD ASMIY ASML PLAB NVDA NBIS CRWV AAPL AMT AMZN DELL GOOGL IRM LNVGY META MSFT ORCL SFTBY STX

Adverse (6)

ACN GEN CHGG IBM WNS LRN

Prerequisites (6)

Predictions that must hit first

Type	Pred	Title	Domain	Lag
prereq	S_AGI_MID_2029	AGI mid: Kurzweil 2029 path	agi_general_capability	—
killer	TK09	Energy Grid Cap (Data Center Power Wall)	—	—
killer	TK05	Rate Regime Persistence (10y > 5% through 2028)	—	—
killer	TK01	AGI Capability Plateau (2026-27 Training Stall)	—	—
killer	TK02	AI Compute Supply Shock (TSMC/Taiwan Disruption)	—	—
killer	TK03	AI Regulatory Moratorium (EU/US Capability Freeze)	—	—

Dependents (0)

Predictions enabled by this

Type	Pred	Title	Domain	Lag
No dependents

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Sim	Source	Title	Market prob	Polarity	Reviewed	Published
0.712	arxiv	Human-AI Co-Evolution and Epistemic Collapse: A Dynamical Systems Perspective	—	mentions	pending	2026-05-07
0.685	manifold	Will an AI system discover information theory from first principles before 2031?	35%	mentions	pending	2026-05-01
0.684	arxiv	Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention	—	mentions	pending	2026-05-28
0.682	arxiv	On What We Can Learn from Low-Resolution Data	—	mentions	pending	2026-05-12
0.679	arxiv	Anticipating Innovation Using Large Language Models	—	mentions	pending	2026-05-06
0.677	arxiv	Anti Mode-Collapse in Mean-Field Transformer via Auxiliary Variables	—	mentions	pending	2026-05-28
0.676	arxiv	Lower bounds for one-layer transformers that compute parity	—	mentions	pending	2026-05-12
0.671	arxiv	The Impossibility Triangle of Long-Context Modeling	—	mentions	pending	2026-05-06
0.668	arxiv	The Expressive Power of Low Precision Softmax Transformers with (Summarized) Chain-of-Thought	—	mentions	pending	2026-05-18
0.668	arxiv	When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop	—	mentions	pending	2026-05-28

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook

{
  "nia": false,
  "qty": "megabytes",
  "url": "https://www.youtube.com/watch?v=d__HRChE2ZE",
  "mode": "BET",
  "role": "Host",
  "caveats": "bet",
  "context": "if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller and we took all of the internet and we compressed it down to single gigabytes or tens or hundreds of gigabytes, compresses down even further. There's some phase transition out there that's waiting to be discovered... will be something maybe even in the megabytes.",
  "verbatim": "if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller... will be something maybe even in the megabytes.",
  "conv_cues": "if I had to bet",
  "direction": "DOWN",
  "timeframe": "Future",
  "conv_level": "MEDIUM",
  "milestones": [
    {
      "kind": "quartile_checkpoint",
      "label": "Q1 window check-in (25%)",
      "status": "pending",
      "weight": 0.05,
      "ordinal": -7,
      "source_id": null,
      "expected_date": "2026-10-05",
      "observed_date": null
    },
    {
      "kind": "llm_pre_event",
      "label": "Major academic paper demonstrates state-of-the-art LLM at <1B parameters on a benchmark within 5 points of frontier",
      "source": "Microsoft Phi-4 series; Google Gemma 3; Mistral Small 2026; ongoing model-compression literature",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -6,
      "source_id": null,
      "confidence": 0.55,
      "expected_date": "2027-01-30",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2027-09-30",
        "from": "2026-06-01"
      },
      "measurement_criterion": "ICLR/NeurIPS-tier paper or arXiv preprint demonstrates a model at <1B parameters reaching within 5 points of GPT-5/Gemini-3/Claude on at least one major benchmark (MMLU, HumanEval, GSM8K)"
    },
    {
      "kind": "quartile_checkpoint",
      "label": "Q2 window check-in (50%)",
      "status": "pending",
      "weight": 0.05,
      "ordinal": -5,
      "source_id": null,
      "expected_date": "2027-03-12",
      "observed_date": null
    },
    {
      "kind": "llm_pre_event",
      "label": "Mainstream non-transformer architecture (Mamba, RWKV, Hyena, RetNet) wins on ChatBot Arena top 20",
      "source": "Mamba (Albert Gu/Tri Dao); Phi-4-mini-flash hybrid attention 2025-2026",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -4,
      "source_id": null,
      "confidence": 0.45,
      "expected_date": "2027-03-17",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2027-12-31",
        "from": "2026-06-01"
      },
      "measurement_criterion": "A model based on a fundamentally non-transformer architecture (state-space, recurrent, etc.) appears in ChatBot Arena top-20 leaderboard. Required precondition for 'post-transformer breakthrough' framing."
    },
    {
      "kind": "llm_pre_event",
      "label": "Frontier lab announces 'next-generation architecture' departure from transformer scaling",
      "source": "AI scaling literature 2025-2026; LeCun JEPA / world-models thesis",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -3,
      "source_id": null,
      "confidence": 0.4,
      "expected_date": "2027-08-01",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2028-06-30",
        "from": "2026-09-01"
      },
      "measurement_criterion": "OpenAI, Anthropic, Google DeepMind, Meta, or xAI publicly announces a new model framed as fundamentally post-transformer, with technical paper showing architectural change"
    },
    {
      "kind": "llm_pre_event",
      "label": "Kolmogorov-complexity / compression-prize benchmark shows AI compressing knowledge below transformer-floor",
      "source": "Hutter Prize (enwik9 compression); information-theoretic AI bounds",
      "status": "pending",
      "weight": 0.4,
      "ordina
... (truncated)