Post-transformer architectures will make a 1000x cost reduction look like child's play
Predictor: Alex Wissner-Gross · ep#240 "NVIDIA's $1 Trillion Prediction, Anthropic Beats OpenAI, Tesla vs. TSMC & The CS Job Collapse" · source
Prediction text
Post-transformer architectures will make a 1000x cost reduction look like child's play | We're going to see post transformer architectures that make a thousandx reduction in cost look like child's play.
Verbatim quote
We're going to see post transformer architectures that make a thousandx reduction in cost look like child's play.
Predictor: Alex Wissner-Gross
Calibration plot (stated vs observed)
Evidence about this node from Alex Wissner-Gross is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2025-11-30hitIBM Granite 4.0 hybrid Mamba-2 ships to enterprise with >70% RAM reductionHow: IBM publicly releases Granite 4.0 hybrid Mamba/Transformer with documented >70% RAM reduction for long-contextSource: https://www.infoq.com/news/2025/11/ibm-granite-mamba2-enterprise/conf 95%Notes: HIT — Granite 4 with hybrid arch released Nov 2025.
- 2026-02-15hitDeepSeek V4 1T-param model claims 10-40x cost reduction vs Western peersHow: DeepSeek V4 release with documented benchmarks showing >10x cost-per-token reduction vs comparable Western frontier modelSource: https://introl.com/blog/deepseek-v4-trillion-parameter-coding-model-february-2026conf 90%
- 2026-05-31hitMamba-3 published at ICLR 2026 with sub-Transformer latency at 1.5B scaleHow: Mamba-3 paper accepted at ICLR 2026 demonstrating beat-Transformer latency at scaleSource: https://openreview.net/pdf?id=HwCvaJOiCjconf 95%Notes: HIT — Mamba-3 SISO beats Mamba-2/Llama-3.2 on prefill+decode latency.
- 2026-06-01 → 2027-06-30pendingHybrid attention-SSM becomes default architecture in major frontier modelHow: OpenAI, Anthropic, Google, or Meta releases frontier-class model using hybrid Mamba/SSM as primary architectureSource: https://www.askaibrain.com/en/posts/end-of-transformers-hybrids-attention-state-space-2025conf 45%
- 2026-06-01 → 2027-12-31pendingFrontier post-Transformer architecture demonstrates 1000x+ inference cost reduction at frontier scaleHow: Major lab publishes benchmark showing post-Transformer model achieves >1,000x cost reduction at GPT-4-class capabilitySource: https://venturebeat.com/technology/open-source-mamba-3-arrives-to-surpass-transformer-architecture-with-nearlyconf 40%
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | 234_012 Anthropic revenue will cross OpenAI revenue in middle of 202 — Peter Diamandis | 67.1% | 0.500 | 0.050 | -0.074 |
| prereq | SEM_042 2025 will be the definitive year that agentic systems finall — Kevin Weil | 73.8% | 0.500 | 0.050 | -0.045 |
| prereq | SEM_012 Nvidia quadrupled chip production output while only doubling — Jensen Huang | 75.0% | 0.500 | 0.050 | -0.039 |
| killer | TK03 AI Regulatory Moratorium (EU/US Capability Freeze) | 10.0% | 0.050 | 0.500 | +0.032 |
| prereq | SEM_008 Training runs costing $10 billion for a single model will co — Dario Amodei | 76.9% | 0.500 | 0.050 | -0.030 |
Top outgoing (children)
Predictions THIS node influences
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | 231_013 Math is cooked (will be solved), physics cooked, biology cha — Alex Wissner-Gross | 35.4% | 0.620 | 0.050 | -0.067 |
| prereq | 241_043 ASI will arrive within 2 years to 5 years to this next decad — Peter Diamandis | 35.9% | 0.650 | 0.050 | -0.059 |
| prereq | CMQ_002 By 2028, AI systems will reach 'independent researcher' leve — Sam Altman | 31.4% | 0.550 | 0.050 | -0.056 |
| prereq | 235_030 Ray Kurzweil predicts Longevity Escape Velocity (LEV) by 203 — Ray Kurzweil | 39.2% | 0.750 | 0.050 | -0.051 |
| prereq | 232_055 We're exiting the industrial age permanently as recursive se — Peter Diamandis | 35.5% | 0.700 | 0.050 | -0.034 |
Ticker exposure
Beneficiaries (23)
Adverse (6)
Prerequisites (8)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | SEM_008 | Training runs costing $10 billion for a single model will commence sometime in 2025. | AI | — |
| prereq | 238_009 | Recursive self-improvement is already happening now (no longer three years out) | AI | — |
| prereq | 234_012 | Anthropic revenue will cross OpenAI revenue in middle of 2026 | Markets/Stocks | — |
| prereq | SEM_012 | Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering. | AI/Manufacturing | — |
| prereq | SEM_042 | 2025 will be the definitive year that agentic systems finally hit the mainstream. | AI/Agents | — |
| killer | TK14 | Superbubble Pop (S&P 500 -40%, Moonshot Capital Evaporates) | — | — |
| killer | TK01 | AGI Capability Plateau (2026-27 Training Stall) | — | — |
| killer | TK03 | AI Regulatory Moratorium (EU/US Capability Freeze) | — | — |
Dependents (5)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | 235_030 | Ray Kurzweil predicts Longevity Escape Velocity (LEV) by 2033. | Biotech/Longevity | — |
| prereq | 232_055 | We're exiting the industrial age permanently as recursive self-improvement unfolds. | AI | — |
| prereq | 241_043 | ASI will arrive within 2 years to 5 years to this next decade | AI | — |
| prereq | 231_013 | Math is cooked (will be solved), physics cooked, biology char broiled. | AI | — |
| prereq | CMQ_002 | By 2028, AI systems will reach 'independent researcher' level — driving autonomous scientific discoveries without human intervention. | AI | — |
Linked documents (3)
| Sim | Source | Title | Market prob | Polarity | Reviewed | Published |
|---|---|---|---|---|---|---|
| 0.640 | github_release | huggingface/transformers v5.0.0rc0 | — | mentions | pending | 2025-12-01 |
| 0.636 | github_release | huggingface/transformers v5.0.0 | — | mentions | pending | 2026-01-26 |
| 0.603 | github_release | pytorch/pytorch v2.0.0 | — | mentions | pending | 2023-03-15 |
Raw metadata
{
"nia": false,
"qty": ">1000x",
"url": "https://www.youtube.com/watch?v=uOGHXAfvK8w",
"mode": "PREDICTION",
"role": "Host",
"context": "We're going to see post transformer architectures that make a thousandx reduction in cost look like child's play.",
"to_year": 2026,
"verbatim": "We're going to see post transformer architectures that make a thousandx reduction in cost look like child's play.",
"conv_cues": "going to see",
"direction": "DOWN",
"from_year": 2026,
"timeframe": "Future (near-term)",
"conv_level": "HIGH",
"milestones": [
{
"kind": "llm_pre_event",
"label": "IBM Granite 4.0 hybrid Mamba-2 ships to enterprise with >70% RAM reduction",
"notes": "HIT — Granite 4 with hybrid arch released Nov 2025.",
"source": "https://www.infoq.com/news/2025/11/ibm-granite-mamba2-enterprise/",
"status": "hit",
"weight": 0.4,
"ordinal": -8,
"source_id": null,
"confidence": 0.95,
"source_url": "https://www.infoq.com/news/2025/11/ibm-granite-mamba2-enterprise/",
"expected_date": "2025-11-30",
"observed_date": "2025-11-30",
"research_origin": "deep_research",
"measurement_criterion": "IBM publicly releases Granite 4.0 hybrid Mamba/Transformer with documented >70% RAM reduction for long-context"
},
{
"kind": "llm_pre_event",
"label": "DeepSeek V4 1T-param model claims 10-40x cost reduction vs Western peers",
"source": "https://introl.com/blog/deepseek-v4-trillion-parameter-coding-model-february-2026",
"status": "hit",
"weight": 0.4,
"ordinal": -7,
"source_id": null,
"confidence": 0.9,
"source_url": "https://introl.com/blog/deepseek-v4-trillion-parameter-coding-model-february-2026",
"expected_date": "2026-02-28",
"observed_date": "2026-02-15",
"research_origin": "deep_research",
"measurement_criterion": "DeepSeek V4 release with documented benchmarks showing >10x cost-per-token reduction vs comparable Western frontier model"
},
{
"kind": "prereq",
"label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
"status": "hit",
"weight": 0.5,
"ordinal": -6,
"source_id": "SEM_012",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Training runs costing $10 billion for a single model will commence sometime in 2025.",
"status": "hit",
"weight": 0.5,
"ordinal": -5,
"source_id": "SEM_008",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Anthropic revenue will cross OpenAI revenue in middle of 2026",
"status": "hit",
"weight": 0.5,
"ordinal": -4,
"source_id": "234_012",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "2025 will be the definitive year that agentic systems finally hit the mainstream.",
"status": "hit",
"weight": 0.5,
"ordinal": -3,
"source_id": "SEM_042",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "prereq",
"label": "Recursive self-improvement is already happening now (no longer three years out)",
"status": "hit",
"weight": 0.5,
"ordinal": -2,
"source_id": "238_009",
"expected_date": "2026-04-29",
"observed_date": "2026-04-29"
},
{
"kind": "llm_pre_event",
"label": "Mamba-3 published at ICLR 2026 with sub-Transformer latency at 1.5B scale",
"notes": "HIT — Mamba-3 SISO beats Mamba-2/Llama-3.2 on prefill+decode latency.",
"source": "https://openreview.net/pdf?id=HwCvaJOiCj",
"status": "hit",
"weight": 0.4,
"ordinal": -1,
"source_id": null,
"confiden
... (truncated)