Unit cost of intelligence will drop at a rate far exceeding Moore's Law, driving explosive demand growth that filters out to edge devices and localized small models.
Predictor: Marc Andreessen
Prediction text
Unit cost of intelligence will drop at a rate far exceeding Moore's Law, driving explosive demand growth that filters out to edge devices and localized small models. | Next API token-pricing drop from frontier labs
Key catalyst: Next API token-pricing drop from frontier labs
Watch events: OpenAI / Anthropic / Google API token pricing; Epoch AI cost-per-FLOP tracker
Resolution evidence
OpenAI API token pricing fell ~40x 2023-2026 per Altman; DeepSeek-V3 / Kimi K2 / Claude Haiku 4.5 delivered frontier capability at 10-30x lower cost per token 2025-2026.
Predictor: Marc Andreessen
Evidence about this node from Marc Andreessen is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-04-15hitDeepSeek-driven 80%+ price drop across frontier LLMs (2024-2026)How: Industry pricing trackers (PriceperToken, BenchLM, Iternal) document that headline frontier-LLM input-token API prices have dropped >=80% from early-2024 levelsSource: TLDL/BenchLM 2026: LLM API prices dropped 80% across the board 2024->2026; GPT-4-class went from $30/M to $2-3/M (10x drop)conf 99%Notes: HIT — direct measurement of cost-of-intelligence trajectory exceeding Moore's Law.
- 2026-03-31hitAnthropic Opus pricing cut 67%How: Anthropic announces >=50% reduction in Opus or equivalent flagship model API pricing alongside context-window expansionSource: Anthropic dropped Opus pricing 67% and expanded context window to 1M tokens (cited in 2026 pricing analyses)conf 95%
- 2026-06-30pendingApple Intelligence runs 3B model on >=500M devicesHow: Apple discloses Apple Intelligence (3B-param on-device model) deployed on >=500M iPhone 15 Pro / iPhone 16+ active devicesSource: Apple Intelligence 3B on-device model on iPhone 15 Pro+; over 2B smartphones run local SLMs (2026 edge-AI literature)conf 95%Notes: Confirms 'filters out to edge devices and localized small models' — direct mechanism for Andreessen claim.
- 2027-02-15pendingQ1 window check-in (25%)
- 2026-06-01 → 2027-12-31pendingSub-1B parameter SLMs match GPT-3.5 performance benchmarksHow: Public benchmarks (MMLU, HellaSwag, Big-Bench) show sub-1B-param model (Gemma 3 270M, SmolLM2, Phi-mini) matching or exceeding GPT-3.5 (175B) baselineSource: Edge AI Vision Alliance 2026: distilled small models outperform much larger base models on math/reasoningconf 85%
- 2027-12-31pendingGartner: orgs use task-specific SLMs 3x more than general LLMsHow: Gartner CIO survey or equivalent industry benchmark shows enterprise deployments of small task-specific models exceed general-purpose LLM deployments by 3xSource: Gartner forecast cited in 2026 SLM literature: by 2027 orgs will use small task-specific models 3x more than general-purpose LLMsconf 55%Notes: Cascade — validates 'filters out' end of Andreessen's claim with quantitative bench.
- 2028-03-31pendingQ2 window check-in (50%)
- 2027-01-01 → 2029-06-30pendingFrontier-class API pricing crosses below $0.50/M input tokensHow: At least one frontier-tier (GPT-5/Claude Opus/Gemini Ultra equivalent) API offers input-token pricing below $0.50/MSource: Trajectory: $30/M (2024) -> $2.50/M (2026 GPT-5.4) -> sub-$0.50 implied by continued Wright's-Law-like declineconf 50%Notes: Cascade — direct extension of observed 10x/2yr trajectory.
- 2029-05-15pendingQ3 window check-in (75%)
No downstream cascades — this prediction is a leaf in the dependency graph.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
No propagation data yet. Run inference/.venv/bin/python scripts/ops/run_loopy_belief_propagation.py on the droplet, or wait for the Sunday 02:00 UTC weekly cron.
Prerequisites (1)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| correlate | S_AGI_FAST_2027 | AGI fast: drop-in remote worker by 2027-09 | agi_general_capability | — |
Dependents (0)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| No dependents | ||||
Validations (1)
| Observed at | Status | By | Notes |
|---|---|---|---|
| 2026-04-29 | partial | thesis_timeline_v1.0_import | OpenAI API token pricing fell ~40x 2023-2026 per Altman; DeepSeek-V3 / Kimi K2 / Claude Haiku 4.5 delivered frontier capability at 10-30x lower cost per token 2025-2026. |
Linked documents (10)
Raw metadata
{
"nia": false,
"qty": ">Moore pace",
"mode": "FORECAST",
"role": "Cited-VC",
"context": "Pairs with Altman's 40x/yr hyperdeflation call (235_014) and Huang tokens-per-watt KPI (CMQ_024). Directly downstream consequence of the oversupply thesis (INF_042).",
"to_year": 2031,
"conv_cues": "exceeds Moore's Law framing; universally-diffusing prediction",
"direction": "DOWN",
"from_year": 2026,
"timeframe": "2026-2031",
"conv_level": "HIGH",
"milestones": [
{
"kind": "llm_pre_event",
"label": "DeepSeek-driven 80%+ price drop across frontier LLMs (2024-2026)",
"notes": "HIT — direct measurement of cost-of-intelligence trajectory exceeding Moore's Law.",
"source": "TLDL/BenchLM 2026: LLM API prices dropped 80% across the board 2024->2026; GPT-4-class went from $30/M to $2-3/M (10x drop)",
"status": "hit",
"weight": 0.4,
"ordinal": -10,
"source_id": null,
"confidence": 0.99,
"source_url": "https://benchlm.ai/llm-pricing",
"expected_date": "2026-04-30",
"observed_date": "2026-04-15",
"research_origin": "deep_research",
"measurement_criterion": "Industry pricing trackers (PriceperToken, BenchLM, Iternal) document that headline frontier-LLM input-token API prices have dropped >=80% from early-2024 levels"
},
{
"kind": "llm_pre_event",
"label": "Anthropic Opus pricing cut 67%",
"source": "Anthropic dropped Opus pricing 67% and expanded context window to 1M tokens (cited in 2026 pricing analyses)",
"status": "hit",
"weight": 0.4,
"ordinal": -9,
"source_id": null,
"confidence": 0.95,
"source_url": "https://intuitionlabs.ai/articles/ai-api-pricing-comparison-grok-gemini-openai-claude",
"expected_date": "2026-04-30",
"observed_date": "2026-03-31",
"research_origin": "deep_research",
"measurement_criterion": "Anthropic announces >=50% reduction in Opus or equivalent flagship model API pricing alongside context-window expansion"
},
{
"kind": "llm_pre_event",
"label": "Apple Intelligence runs 3B model on >=500M devices",
"notes": "Confirms 'filters out to edge devices and localized small models' — direct mechanism for Andreessen claim.",
"source": "Apple Intelligence 3B on-device model on iPhone 15 Pro+; over 2B smartphones run local SLMs (2026 edge-AI literature)",
"status": "pending",
"weight": 0.4,
"ordinal": -8,
"source_id": null,
"confidence": 0.95,
"source_url": "https://semiengineering.com/the-on-device-llm-revolution/",
"expected_date": "2026-06-30",
"research_origin": "deep_research",
"measurement_criterion": "Apple discloses Apple Intelligence (3B-param on-device model) deployed on >=500M iPhone 15 Pro / iPhone 16+ active devices"
},
{
"kind": "quartile_checkpoint",
"label": "Q1 window check-in (25%)",
"status": "pending",
"weight": 0.05,
"ordinal": -7,
"source_id": null,
"expected_date": "2027-02-15",
"observed_date": null
},
{
"kind": "llm_pre_event",
"label": "Sub-1B parameter SLMs match GPT-3.5 performance benchmarks",
"source": "Edge AI Vision Alliance 2026: distilled small models outperform much larger base models on math/reasoning",
"status": "pending",
"weight": 0.4,
"ordinal": -6,
"source_id": null,
"confidence": 0.85,
"source_url": "https://www.edge-ai-vision.com/2026/01/on-device-llms-in-2026-what-changed-what-matters-whats-next/",
"expected_date": "2027-03-17",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2027-12-31",
"from": "2026-06-01"
},
"measurement_criterion": "Public benchmarks (MMLU, HellaSwag, Big-Bench) show sub-1B-param model (Gemma 3 270M, SmolLM2, Phi-mini) matching or exceeding GPT-3.5 (175B) baseline"
},
{
"kind": "scenario_signal",
... (truncated)