Algorithmic efficiencies will deliver ~0.5 OOMs per year of additional effective compute through 2027 — pure multiplier on raw FLOPs.
Predictor: Leopold Aschenbrenner
Prediction text
Algorithmic efficiencies will deliver ~0.5 OOMs per year of additional effective compute through 2027 — pure multiplier on raw FLOPs. | Frontier training efficiency benchmarks
Key catalyst: Frontier training efficiency benchmarks
Watch events: MoE paper throughput; FP4/ternary training adoption; distillation pipeline efficiency.
Resolution evidence
MoE architectures, DeepSeek-class distillation, FP4 training, Chinese quantization breakthroughs (see SEM_021/022) all contributing measurable efficiency gains.
Predictor: Leopold Aschenbrenner
Calibration plot (stated vs observed)
Evidence about this node from Leopold Aschenbrenner is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2024-08-31overdueQ1 window check-in (25%)
- 2025-05-02overdueQ2 window check-in (50%)
- 2025-12-31overdueQ3 window check-in (75%)
- 2026-03-15hitFlashAttention-4 published with asymmetric hardware co-design improvementsHow: Peer-reviewed or arXiv preprint published demonstrating FlashAttention-4 with measurable training-throughput speedup over FA-3Source: arXiv 2603.05451 — FlashAttention-4: Algorithm and Kernel Pipelining Co-Designconf 95%
- 2026-01-01 → 2026-06-30overdueGPT-5.1 / Grok 4.20 / Opus 4.5 architectural innovation delivers compute-efficient frontierHow: At least one frontier lab publishes architecture (Smart Router, multi-agent debate, or equivalent) achieving SOTA at <50% training compute of prior generationSource: Frontier AI Models Technical Deep Dive (Huang)conf 85%
- 2026-04-30hitInference-time scaling / RLVR / MoE deliver measurable algorithmic compute multiplierHow: Independent analysis (Epoch AI, METR, or peer-reviewed) confirms >=0.5 OOM/year algorithmic-efficiency improvement trend continuing through 2025-2026Source: Medium: Situational Awareness, Two Years Later (April 2026)conf 85%
- 2026-06-01 → 2027-06-30pendingFP4 / FP8 mixed-precision training adoption across all frontier labsHow: Public model cards from 3+ frontier labs disclose FP4/FP8 mixed-precision training as primary regimeSource: Composite — Aschenbrenner OOM thesis derivationconf 70%
- 2027-08-31pendingCumulative algorithmic gains 2024-2027 reach ~1.5 OOMs per Aschenbrenner trajectoryHow: Quarterly Epoch AI / Situational Awareness retrospective confirms cumulative ~1.5 OOM algorithmic-efficiency gain over 2024-2027 windowSource: Aschenbrenner — Situational Awareness original frameworkconf 55%
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Raw metadata
{
"trf": 0.3625397793120952,
"kappa": 0.6875,
"base_rate": null,
"predictor": "Leopold Aschenbrenner",
"total_llr": -1.6218604324326575,
"grace_days": 7,
"bayesian_v2": true,
"prior_logit": 0.395510903045804,
"bayes_factor": "2.9:1 against",
"blend_reason": "no reference_class linked",
"inside_prior": 0.597608630112356,
"kappa_source": "predictor_table",
"n_milestones": 4,
"blend_applied": false,
"contributions": [
{
"llr": -0.4054651081081644,
"kind": "quartile_checkpoint",
"kappa": 0.6875,
"label": "Q1 window check-in (25%)",
"weight": 0.05,
"strength": "weak",
"confidence": null,
"source_url": null,
"adjusted_llr": -0.278757261824363,
"expected_date": "2024-08-31",
"measurement_criterion": null
},
{
"llr": -0.4054651081081644,
"kind": "quartile_checkpoint",
"kappa": 0.6875,
"label": "Q2 window check-in (50%)",
"weight": 0.05,
"strength": "weak",
"confidence": null,
"source_url": null,
"adjusted_llr": -0.278757261824363,
"expected_date": "2025-05-02",
"measurement_criterion": null
},
{
"llr": -0.4054651081081644,
"kind": "quartile_checkpoint",
"kappa": 0.6875,
"label": "Q3 window check-in (75%)",
"weight": 0.05,
"strength": "weak",
"confidence": null,
"source_url": null,
"adjusted_llr": -0.278757261824363,
"expected_date": "2025-12-31",
"measurement_criterion": null
},
{
"llr": -0.4054651081081644,
"kind": "llm_pre_event",
"kappa": 0.584375,
"label": "GPT-5.1 / Grok 4.20 / Opus 4.5 architectural innovation delivers compute-efficient frontier",
"weight": 0.4,
"strength": "weak",
"confidence": 0.85,
"source_url": "https://kenhuangus.substack.com/p/frontier-ai-models-technical-deep",
"adjusted_llr": -0.23694367255070856,
"expected_date": "2026-04-01",
"measurement_criterion": "At least one frontier lab publishes architecture (Smart Router, multi-agent debate, or equivalent) achieving SOTA at <50% training compute of prior generation"
}
],
"evidence_kind": "metadata_milestone_miss_sweep",
"inside_source": "history_v2",
"inside_weight": 0.7462221544815333,
"outside_weight": 0.2537778455184667,
"posterior_prob": 0.33677381454559646,
"posterior_logit": -0.6777045549779936,
"predictor_brier": 0.04167,
"inside_posterior": 0.33677381454559646,
"blended_posterior": 0.33677381454559646,
"reference_class_id": null,
"total_adjusted_llr": -1.0732154580237976,
"predictor_n_resolved": 3
}Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| killer | TK02 AI Compute Supply Shock (TSMC/Taiwan Disruption) | 12.0% | 0.050 | 0.650 | +0.076 |
Top outgoing (children)
Predictions THIS node influences
No outgoing edges.
Prerequisites (2)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| correlate | S_COMPUTE_100GW_2030 | Compute: 100GW national-scale by Dec 2030 | compute_scale | — |
| killer | TK02 | AI Compute Supply Shock (TSMC/Taiwan Disruption) | — | — |
Dependents (0)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| No dependents | ||||
Linked documents (10)
Raw metadata
{
"nia": false,
"qty": "~0.5 OOMs/yr",
"mode": "FORECAST",
"role": "Cited-Researcher",
"context": "Second OOM vector: improvements in architectures, data curation, training methods. Historically consistent (MoE, FlashAttention, FP8/FP4, etc).",
"to_year": 2027,
"conv_cues": "quantitative rate",
"direction": "NUMERIC_TARGET",
"from_year": 2024,
"timeframe": "2024-2027",
"conv_level": "HIGH",
"milestones": [
{
"kind": "quartile_checkpoint",
"label": "Q1 window check-in (25%)",
"status": "overdue",
"weight": 0.05,
"ordinal": -6,
"source_id": null,
"expected_date": "2024-08-31",
"observed_date": null,
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep"
},
{
"kind": "quartile_checkpoint",
"label": "Q2 window check-in (50%)",
"status": "overdue",
"weight": 0.05,
"ordinal": -5,
"source_id": null,
"expected_date": "2025-05-02",
"observed_date": null,
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep"
},
{
"kind": "quartile_checkpoint",
"label": "Q3 window check-in (75%)",
"status": "overdue",
"weight": 0.05,
"ordinal": -4,
"source_id": null,
"expected_date": "2025-12-31",
"observed_date": null,
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep"
},
{
"kind": "llm_pre_event",
"label": "FlashAttention-4 published with asymmetric hardware co-design improvements",
"source": "arXiv 2603.05451 — FlashAttention-4: Algorithm and Kernel Pipelining Co-Design",
"status": "hit",
"weight": 0.4,
"ordinal": -3,
"source_id": null,
"confidence": 0.95,
"source_url": "https://arxiv.org/html/2603.05451v1",
"expected_date": "2026-03-15",
"observed_date": "2026-03-15",
"research_origin": "deep_research",
"measurement_criterion": "Peer-reviewed or arXiv preprint published demonstrating FlashAttention-4 with measurable training-throughput speedup over FA-3"
},
{
"kind": "llm_pre_event",
"label": "GPT-5.1 / Grok 4.20 / Opus 4.5 architectural innovation delivers compute-efficient frontier",
"source": "Frontier AI Models Technical Deep Dive (Huang)",
"status": "overdue",
"weight": 0.4,
"ordinal": -2,
"source_id": null,
"confidence": 0.85,
"source_url": "https://kenhuangus.substack.com/p/frontier-ai-models-technical-deep",
"expected_date": "2026-04-01",
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2026-06-30",
"from": "2026-01-01"
},
"measurement_criterion": "At least one frontier lab publishes architecture (Smart Router, multi-agent debate, or equivalent) achieving SOTA at <50% training compute of prior generation"
},
{
"kind": "llm_pre_event",
"label": "Inference-time scaling / RLVR / MoE deliver measurable algorithmic compute multiplier",
"source": "Medium: Situational Awareness, Two Years Later (April 2026)",
"status": "hit",
"weight": 0.4,
"ordinal": -1,
"source_id": null,
"confidence": 0.85,
"source_url": "https://medium.com/data-science-collective/situational-awareness-two-years-later-4b941d052ef9",
"expected_date": "2026-04-30",
"observed_date": "2026-04-30",
"research_origin": "deep_research",
"measurement_criterion": "Independent analysis (Epoch AI, METR, or peer-reviewed) confirms >=0.5 OOM/year algorithmic-efficiency improvement trend continuing through 2025-2026"
},
{
"kind": "event",
"label": "Algorithmic efficiencies will deliver ~0.5 OOMs per year of ad
... (truncated)