Future AI models may compress all human knowledge into megabytes via post-transformer breakthroughs
Predictor: Alex Wissner-Gross · ep#238 "Meta Buys Moltbook, GPT 5.4, and Fruitfly Brain Upload | Moonshots Live at The Abundance Summit 238" · source
Prediction text
Future AI models may compress all human knowledge into megabytes via post-transformer breakthroughs | if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller... will be something maybe even in the megabytes.
Verbatim quote
if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller... will be something maybe even in the megabytes.
Predictor: Alex Wissner-Gross
Calibration plot (stated vs observed)
Evidence about this node from Alex Wissner-Gross is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-10-05pendingQ1 window check-in (25%)
- 2026-06-01 → 2027-09-30pendingMajor academic paper demonstrates state-of-the-art LLM at <1B parameters on a benchmark within 5 points of frontierHow: ICLR/NeurIPS-tier paper or arXiv preprint demonstrates a model at <1B parameters reaching within 5 points of GPT-5/Gemini-3/Claude on at least one major benchmark (MMLU, HumanEval, GSM8K)Source: Microsoft Phi-4 series; Google Gemma 3; Mistral Small 2026; ongoing model-compression literatureconf 55%
- 2027-03-12pendingQ2 window check-in (50%)
- 2026-06-01 → 2027-12-31pendingMainstream non-transformer architecture (Mamba, RWKV, Hyena, RetNet) wins on ChatBot Arena top 20How: A model based on a fundamentally non-transformer architecture (state-space, recurrent, etc.) appears in ChatBot Arena top-20 leaderboard. Required precondition for 'post-transformer breakthrough' framing.Source: Mamba (Albert Gu/Tri Dao); Phi-4-mini-flash hybrid attention 2025-2026conf 45%
- 2026-09-01 → 2028-06-30pendingFrontier lab announces 'next-generation architecture' departure from transformer scalingHow: OpenAI, Anthropic, Google DeepMind, Meta, or xAI publicly announces a new model framed as fundamentally post-transformer, with technical paper showing architectural changeSource: AI scaling literature 2025-2026; LeCun JEPA / world-models thesisconf 40%
- 2026-09-01 → 2028-06-30pendingKolmogorov-complexity / compression-prize benchmark shows AI compressing knowledge below transformer-floorHow: Hutter Prize-style enwik9 compression benchmark or equivalent shows AI-driven compression at <90% of best transformer, supporting 'all knowledge in megabytes' framingSource: Hutter Prize (enwik9 compression); information-theoretic AI boundsconf 30%
- 2027-08-17pendingQ3 window check-in (75%)
- 2027-06-01 → 2029-06-30pendingFirst production LLM ships at <100MB total size with utility comparable to GPT-3How: Public production model under 100MB (post-quantization) used in a shipping product, reaching GPT-3-class quality on standard benchmarks. Wissner-Gross specifically pointed to 'megabytes' scale.Source: Quantization research (GPTQ, AWQ, EETQ); on-device LLM trends Apple Intelligenceconf 50%
- 2028-06-01 → 2030-12-31pendingCascade: On-device AGI-class models ship with major OSHow: Apple, Google Android, or Microsoft ships major OS update with frontier-capable AI model running fully on-device (no cloud)Source: Cascade from compression milestonesconf 35%
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | S_AGI_MID_2029 AGI mid: Kurzweil 2029 path | 35.0% | 0.450 | 0.050 | -0.157 |
| killer | TK03 AI Regulatory Moratorium (EU/US Capability Freeze) | 10.0% | 0.050 | 0.450 | +0.063 |
| killer | TK02 AI Compute Supply Shock (TSMC/Taiwan Disruption) | 12.0% | 0.050 | 0.450 | +0.055 |
| killer | TK01 AGI Capability Plateau (2026-27 Training Stall) | 15.0% | 0.050 | 0.450 | +0.043 |
| killer | TK09 Energy Grid Cap (Data Center Power Wall) | 35.0% | 0.050 | 0.450 | -0.037 |
Top outgoing (children)
Predictions THIS node influences
No outgoing edges.
Ticker exposure
Beneficiaries (24)
Adverse (6)
Prerequisites (6)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | S_AGI_MID_2029 | AGI mid: Kurzweil 2029 path | agi_general_capability | — |
| killer | TK09 | Energy Grid Cap (Data Center Power Wall) | — | — |
| killer | TK05 | Rate Regime Persistence (10y > 5% through 2028) | — | — |
| killer | TK01 | AGI Capability Plateau (2026-27 Training Stall) | — | — |
| killer | TK02 | AI Compute Supply Shock (TSMC/Taiwan Disruption) | — | — |
| killer | TK03 | AI Regulatory Moratorium (EU/US Capability Freeze) | — | — |
Dependents (0)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| No dependents | ||||
Linked documents (10)
Raw metadata
{
"nia": false,
"qty": "megabytes",
"url": "https://www.youtube.com/watch?v=d__HRChE2ZE",
"mode": "BET",
"role": "Host",
"caveats": "bet",
"context": "if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller and we took all of the internet and we compressed it down to single gigabytes or tens or hundreds of gigabytes, compresses down even further. There's some phase transition out there that's waiting to be discovered... will be something maybe even in the megabytes.",
"verbatim": "if I had to bet, I'd bet that it's some sort of radical post transformer advance where the models get even smaller... will be something maybe even in the megabytes.",
"conv_cues": "if I had to bet",
"direction": "DOWN",
"timeframe": "Future",
"conv_level": "MEDIUM",
"milestones": [
{
"kind": "quartile_checkpoint",
"label": "Q1 window check-in (25%)",
"status": "pending",
"weight": 0.05,
"ordinal": -7,
"source_id": null,
"expected_date": "2026-10-05",
"observed_date": null
},
{
"kind": "llm_pre_event",
"label": "Major academic paper demonstrates state-of-the-art LLM at <1B parameters on a benchmark within 5 points of frontier",
"source": "Microsoft Phi-4 series; Google Gemma 3; Mistral Small 2026; ongoing model-compression literature",
"status": "pending",
"weight": 0.4,
"ordinal": -6,
"source_id": null,
"confidence": 0.55,
"expected_date": "2027-01-30",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2027-09-30",
"from": "2026-06-01"
},
"measurement_criterion": "ICLR/NeurIPS-tier paper or arXiv preprint demonstrates a model at <1B parameters reaching within 5 points of GPT-5/Gemini-3/Claude on at least one major benchmark (MMLU, HumanEval, GSM8K)"
},
{
"kind": "quartile_checkpoint",
"label": "Q2 window check-in (50%)",
"status": "pending",
"weight": 0.05,
"ordinal": -5,
"source_id": null,
"expected_date": "2027-03-12",
"observed_date": null
},
{
"kind": "llm_pre_event",
"label": "Mainstream non-transformer architecture (Mamba, RWKV, Hyena, RetNet) wins on ChatBot Arena top 20",
"source": "Mamba (Albert Gu/Tri Dao); Phi-4-mini-flash hybrid attention 2025-2026",
"status": "pending",
"weight": 0.4,
"ordinal": -4,
"source_id": null,
"confidence": 0.45,
"expected_date": "2027-03-17",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2027-12-31",
"from": "2026-06-01"
},
"measurement_criterion": "A model based on a fundamentally non-transformer architecture (state-space, recurrent, etc.) appears in ChatBot Arena top-20 leaderboard. Required precondition for 'post-transformer breakthrough' framing."
},
{
"kind": "llm_pre_event",
"label": "Frontier lab announces 'next-generation architecture' departure from transformer scaling",
"source": "AI scaling literature 2025-2026; LeCun JEPA / world-models thesis",
"status": "pending",
"weight": 0.4,
"ordinal": -3,
"source_id": null,
"confidence": 0.4,
"expected_date": "2027-08-01",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2028-06-30",
"from": "2026-09-01"
},
"measurement_criterion": "OpenAI, Anthropic, Google DeepMind, Meta, or xAI publicly announces a new model framed as fundamentally post-transformer, with technical paper showing architectural change"
},
{
"kind": "llm_pre_event",
"label": "Kolmogorov-complexity / compression-prize benchmark shows AI compressing knowledge below transformer-floor",
"source": "Hutter Prize (enwik9 compression); information-theoretic AI bounds",
"status": "pending",
"weight": 0.4,
"ordina
... (truncated)