CMQ_058predictionAI/Computelocal-compute

Localized hardware setups (multiple Apple Mac Studios, dedicated 'AI Max 300' silicon) will allow developers to run powerful inference workloads directly on-premises — reducing cloud dependency.

Predictor: Alex Finn

Prior probability

80.0%

Current probability

58.9%

evolves via intake + LBP

Conviction

4/5

Signal quality

Resolution

in_progress

Window

2026-01-01 – 2026-12-31

Edges in / out

3 / 0

Tickers exposed

Prediction text

Localized hardware setups (multiple Apple Mac Studios, dedicated 'AI Max 300' silicon) will allow developers to run powerful inference workloads directly on-premises — reducing cloud dependency. | Local-LLM adoption metrics; open-source model capability curves

Key catalyst: Local-LLM adoption metrics; open-source model capability curves

Watch events: Local-LLM hardware market size; open-source model capability progression.

Resolution evidence

Status: in_progress

Apple M3 Ultra Mac Studio 512GB unified memory running 70B+ models; open-source Qwen 2.5 / DeepSeek R1 / Llama 3.3 capable on local hardware.

Predictor: Alex Finn

κ + Brier as of 2026-05-22

Full calibration →

κ (discount)

0.643

Brier

0.0122

excellent

Hits / Misses

1 / 0

of 2 resolved

Hit rate

50.0%

Calibration plot (stated vs observed)

Evidence about this node from Alex Finn is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class

Not linked

This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.

Probability over time

4 prob_history rows

intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 58.9%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.

Leading chain: 2 overdue ⏱ · 3 pending

2026-03-07overdueQ1 window check-in (25%)
2026-05-12overdueQ2 window check-in (50%)
2026-07-17pendingQ3 window check-in (75%)
2026-07-31pending70B-parameter open-weight LLMs run usably on single Mac Studio at >20 tokens/sec
How: Public benchmarks (MLX, llama.cpp) show Llama 3.x 70B Q4 quantized achieving >=20 tok/s on Mac Studio M-series Ultra
Source: https://machinelearning.apple.com/research/exploring-llms-mlx-m5conf 90%
Notes: Apple's MLX research shows neural-accelerator throughput already meets this bar on M5.
2026-04-01 → 2026-12-31pendingAMD AI Max 300/Ryzen AI Max+ 395 ships in workstation form for local inference
How: AMD ships dedicated 'AI Max' silicon (Ryzen AI Max+ or successor) with >=128GB unified memory at retail
Source: AMD product roadmap 2026; reviews of AI Max 300 seriesconf 70%
2026-09-21pendingLocalized hardware setups (multiple Apple Mac Studios, dedicated 'AI Max 300' silicon) will allow developers to run powerful inference workl
2026-09-01 → 2026-12-31pendingApple Mac Studio M5 Ultra ships with 192GB-256GB unified memory for local LLM inference
How: Apple ships Mac Studio with M5 Ultra chip publicly available for purchase with >=192GB unified memory at >=800 GB/s bandwidth
Source: https://wccftech.com/m5-ultra-mac-studio-delayed-to-q4-2026-apple-llm-revenue-to-suffer/conf 85%
Notes: Apple delayed M5 Ultra to ~Q4 2026; older M3 Ultra configs already sold out due to local-LLM demand.
2026-12-31pendingLocal-LLM developer-tooling adoption: Ollama/LM Studio/MLX combined >5M MAU
How: Combined monthly active users across major local-LLM runtimes (Ollama, LM Studio, MLX, llama.cpp) exceed 5 million per public telemetry/disclosures
Source: Ollama/LM Studio public telemetry, GitHub statsconf 60%
2027-01-01 → 2027-12-31pendingCloud-inference revenue growth decelerates in 2027 as enterprise on-prem migration accelerates
How: Tier-1 inference cloud providers (OpenAI/Anthropic/Together/Fireworks) report decelerating API token-revenue growth (<50% YoY vs. 2026 baseline) attributable to local-inference shift
Source: Cloud provider revenue disclosures; Gartner enterprise AI surveysconf 45%

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.

(live posterior: 59%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first

metadata_milestone_miss_sweep2026-05-30T22:15:00Z58.9%-6.1pp

metadata_milestone_miss_sweep bayesian_v2 n=1 inside=0.589 blend=0.589 LLR=-0.261 κ=0.64 no_blend

Raw metadata

{
  "trf": 0.5881123843731557,
  "kappa": 0.6429,
  "base_rate": null,
  "predictor": "Alex Finn",
  "total_llr": -0.4054651081081644,
  "grace_days": 7,
  "bayesian_v2": true,
  "prior_logit": 0.619747969836153,
  "bayes_factor": "1.3:1 against",
  "blend_reason": "no reference_class linked",
  "inside_prior": 0.6501612260779359,
  "kappa_source": "predictor_table",
  "n_milestones": 1,
  "blend_applied": false,
  "contributions": [
    {
      "llr": -0.4054651081081644,
      "kind": "quartile_checkpoint",
      "kappa": 0.6429,
      "label": "Q2 window check-in (50%)",
      "weight": 0.05,
      "strength": "weak",
      "confidence": null,
      "source_url": null,
      "adjusted_llr": -0.2606735180027389,
      "expected_date": "2026-05-12",
      "measurement_criterion": null
    }
  ],
  "evidence_kind": "metadata_milestone_miss_sweep",
  "inside_source": "history_v2",
  "inside_weight": 0.5883213309387909,
  "outside_weight": 0.4116786690612091,
  "posterior_prob": 0.5888163664972889,
  "posterior_logit": 0.3590744518334141,
  "predictor_brier": 0.0122,
  "inside_posterior": 0.5888163664972889,
  "blended_posterior": 0.5888163664972889,
  "reference_class_id": null,
  "total_adjusted_llr": -0.2606735180027389,
  "predictor_n_resolved": 2
}

metadata_milestone_miss_sweep2026-05-02T22:07:21Z65.0%-5.7pp

metadata_milestone_miss_sweep bayesian_v2 n=1 inside=0.650 blend=0.650 LLR=-0.261 κ=0.64 no_blend

Raw metadata

{
  "trf": 0.6650500679109432,
  "kappa": 0.6429,
  "base_rate": null,
  "predictor": "Alex Finn",
  "total_llr": -0.4054651081081644,
  "grace_days": 7,
  "bayesian_v2": true,
  "prior_logit": 0.8804214878388924,
  "bayes_factor": "1.3:1 against",
  "blend_reason": "no reference_class linked",
  "inside_prior": 0.7069095561147094,
  "kappa_source": "predictor_table",
  "n_milestones": 1,
  "blend_applied": false,
  "contributions": [
    {
      "llr": -0.4054651081081644,
      "kind": "quartile_checkpoint",
      "kappa": 0.6429,
      "label": "Q1 window check-in (25%)",
      "weight": 0.05,
      "strength": "weak",
      "confidence": null,
      "source_url": null,
      "adjusted_llr": -0.2606735180027389,
      "expected_date": "2026-03-07",
      "measurement_criterion": null
    }
  ],
  "evidence_kind": "metadata_milestone_miss_sweep",
  "inside_source": "history_v2",
  "inside_weight": 0.5344649524623397,
  "outside_weight": 0.4655350475376603,
  "posterior_prob": 0.6501612260779359,
  "posterior_logit": 0.6197479698361534,
  "predictor_brier": 0.0122,
  "inside_posterior": 0.6501612260779359,
  "blended_posterior": 0.6501612260779359,
  "reference_class_id": null,
  "total_adjusted_llr": -0.2606735180027389,
  "predictor_n_resolved": 2
}

LBP2026-04-30T16:39:51Z70.7%-3.4pp

Network propagation: 74.1% → 70.7%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3

LBP2026-04-30T02:18:57Z74.1%-5.9pp

Network propagation: 80.0% → 74.1%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact

All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
killer	TK06 China-Taiwan Military Conflict	8.0%	0.050	0.800	+0.151
killer	TK02 AI Compute Supply Shock (TSMC/Taiwan Disruption)	12.0%	0.050	0.800	+0.121
killer	TK09 Energy Grid Cap (Data Center Power Wall)	35.0%	0.050	0.800	-0.051

Top outgoing (children)

Predictions THIS node influences

No outgoing edges.

Prerequisites (3)

Predictions that must hit first

Type	Pred	Title	Domain	Lag
killer	TK09	Energy Grid Cap (Data Center Power Wall)	—	—
killer	TK02	AI Compute Supply Shock (TSMC/Taiwan Disruption)	—	—
killer	TK06	China-Taiwan Military Conflict	—	—

Dependents (0)

Predictions enabled by this

Type	Pred	Title	Domain	Lag
No dependents

Validations (1)

Resolution events

Observed at	Status	By	Notes
2026-04-29	partial	thesis_timeline_v1.0_import	Apple M3 Ultra Mac Studio 512GB unified memory running 70B+ models; open-source Qwen 2.5 / DeepSeek R1 / Llama 3.3 capable on local hardware.

Linked documents (10)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Sim	Source	Title	Market prob	Polarity	Reviewed	Published
0.727	arxiv	When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems	—	mentions	pending	2026-05-28
0.717	arxiv	Litespark Inference on Consumer CPUs: Custom SIMD Kernels for Ternary Neural Networks	—	mentions	pending	2026-05-07
0.708	arxiv	ROMER: Expert Replacement and Router Calibration for Robust MoE LLMs on Analog Compute-in-Memory Systems	—	mentions	pending	2026-05-12
0.695	arxiv	EnergyLens: Predictive Energy-Aware Exploration for Multi-GPU LLM Inference Optimization	—	mentions	pending	2026-05-14
0.695	arxiv	Benchmarking Local LLMs for Natural-Language-to-SQL Querying in Biopharmaceutical Manufacturing: An Empirical Benchmark on Consumer-Grade Hardware	—	mentions	pending	2026-05-31
0.692	arxiv	Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren't Worth Training	—	mentions	pending	2026-05-04
0.691	arxiv	LLMForge: Multi-Backend Hardware-Aware Neural Architecture Search with Infinite-Head Attention for Edge Language Models	—	mentions	pending	2026-05-17
0.687	arxiv	Correct Code, Vulnerable Dependencies: A Large Scale Measurement Study of LLM-Specified Library Versions	—	mentions	pending	2026-05-07
0.687	arxiv	Fingerprinting Inference Systems of Large Language Models	—	mentions	pending	2026-05-28
0.685	arxiv	More Is Not Always Better: Cross-Component Interference in LLM Agent Scaffolding	—	mentions	pending	2026-05-07

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook

{
  "nia": false,
  "qty": "local inference mainstream",
  "mode": "FORECAST",
  "role": "Cited-Analyst",
  "context": "Demonstrated workflow: 3+ Mac Studios running Qwen/DeepSeek/Llama class models for cost of electricity only.",
  "to_year": 2028,
  "conv_cues": "predicts; demonstrated workflow",
  "direction": "HAPPEN",
  "from_year": 2026,
  "timeframe": "2026+",
  "conv_level": "MEDIUM",
  "milestones": [
    {
      "kind": "quartile_checkpoint",
      "label": "Q1 window check-in (25%)",
      "status": "overdue",
      "weight": 0.05,
      "ordinal": -5,
      "source_id": null,
      "expected_date": "2026-03-07",
      "observed_date": null,
      "miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
      "miss_emitted_by": "metadata_milestone_sweep"
    },
    {
      "kind": "quartile_checkpoint",
      "label": "Q2 window check-in (50%)",
      "status": "overdue",
      "weight": 0.05,
      "ordinal": -4,
      "source_id": null,
      "expected_date": "2026-05-12",
      "observed_date": null,
      "miss_emitted_at": "2026-05-30T22:15:00.756418+00:00",
      "miss_emitted_by": "metadata_milestone_sweep"
    },
    {
      "kind": "quartile_checkpoint",
      "label": "Q3 window check-in (75%)",
      "status": "pending",
      "weight": 0.05,
      "ordinal": -3,
      "source_id": null,
      "expected_date": "2026-07-17",
      "observed_date": null
    },
    {
      "kind": "llm_pre_event",
      "label": "70B-parameter open-weight LLMs run usably on single Mac Studio at >20 tokens/sec",
      "notes": "Apple's MLX research shows neural-accelerator throughput already meets this bar on M5.",
      "source": "https://machinelearning.apple.com/research/exploring-llms-mlx-m5",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -2,
      "source_id": null,
      "confidence": 0.9,
      "source_url": "https://machinelearning.apple.com/research/exploring-llms-mlx-m5",
      "expected_date": "2026-07-31",
      "research_origin": "deep_research",
      "measurement_criterion": "Public benchmarks (MLX, llama.cpp) show Llama 3.x 70B Q4 quantized achieving >=20 tok/s on Mac Studio M-series Ultra"
    },
    {
      "kind": "llm_pre_event",
      "label": "AMD AI Max 300/Ryzen AI Max+ 395 ships in workstation form for local inference",
      "source": "AMD product roadmap 2026; reviews of AI Max 300 series",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -1,
      "source_id": null,
      "confidence": 0.7,
      "expected_date": "2026-08-16",
      "research_origin": "training",
      "expected_date_range": {
        "to": "2026-12-31",
        "from": "2026-04-01"
      },
      "measurement_criterion": "AMD ships dedicated 'AI Max' silicon (Ryzen AI Max+ or successor) with >=128GB unified memory at retail"
    },
    {
      "kind": "event",
      "label": "Localized hardware setups (multiple Apple Mac Studios, dedicated 'AI Max 300' silicon) will allow developers to run powerful inference workl",
      "status": "pending",
      "weight": 1,
      "ordinal": 0,
      "source_id": "CMQ_058",
      "expected_date": "2026-09-21",
      "observed_date": null
    },
    {
      "kind": "llm_pre_event",
      "label": "Apple Mac Studio M5 Ultra ships with 192GB-256GB unified memory for local LLM inference",
      "notes": "Apple delayed M5 Ultra to ~Q4 2026; older M3 Ultra configs already sold out due to local-LLM demand.",
      "source": "https://wccftech.com/m5-ultra-mac-studio-delayed-to-q4-2026-apple-llm-revenue-to-suffer/",
      "status": "pending",
      "weight": 0.4,
      "ordinal": 1,
      "source_id": null,
      "confidence": 0.85,
      "source_url": "https://wccftech.com/m5-ultra-mac-studio-delayed-to-q4-2026-apple-llm-revenue-to-suffer/",
      "expected_date": "2026-10-31",
      "research_origin": "deep_research",
      "expected_date_range": {
        "to": "2026-12-31",
        "from": "2026-09-01"
      },
      "measurement_criterion": "Apple ships Mac Studio w
... (truncated)