Ternary (sub-bit) parameter precision will be optimal for AI models.
Predictor: Dave Blundin · ep#248 "Sam Altman's Attack, Amazon vs. Starlink, and What Opus 4.7 Actually Means | #248" · source
Prediction text
Ternary (sub-bit) parameter precision will be optimal for AI models. | Well, I I am 90% sure that turnary is the optimal now. I've got simulations running all the time.
Verbatim quote
Well, I I am 90% sure that turnary is the optimal now. I've got simulations running all the time.
Predictor: Dave Blundin
Calibration plot (stated vs observed)
Evidence about this node from Dave Blundin is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2024-02-27hitMicrosoft BitNet b1.58 (ternary) matches FP16 LLaMA 2 perplexityHow: Microsoft research paper demonstrates BitNet b1.58 with ternary weights {-1,0,1} matches FP16 LLaMA 2 perplexity from 3B+ sizeSource: arXiv 2402.17764 — The Era of 1-bit LLMsconf 99%Notes: HIT — Foundation paper. BitNet b1.58 with 3.9B is 2.4x faster, 3.32x less memory than LLaMA 3B.
- 2024-12-01hitHugging Face fine-tuning blog — ternary quantization 'made easy'How: Hugging Face publishes practitioner-facing blog/tutorial on fine-tuning to 1.58-bit precisionSource: Hugging Face blog — 1.58-bit extreme quantization made easyconf 95%Notes: HIT — Mainstream ML community has accepted ternary as practical, not just theoretical.
- 2024-10-15hitMicrosoft open-sources BitNet inference frameworkHow: Microsoft releases open-source inference framework supporting 1.58-bit ternary modelsSource: GitHub — microsoft/BitNetconf 99%Notes: HIT — Open framework enables 100B+ parameter inference on single CPU per Glen Rhodes coverage.
- 2025-04-15hitMicrosoft releases bitnet-b1.58-2B-4T on Hugging FaceHow: Microsoft publishes production-grade ternary 2B parameter model (4T tokens) on Hugging FaceSource: Hugging Face — microsoft/bitnet-b1.58-2B-4Tconf 99%Notes: HIT — Production-quality ternary 2B model. Sub-bit precision is now beyond research.
- 2025-06-01 → 2027-06-30pendingFirst commercial LLM API deploys ternary weights at scaleHow: Major LLM API provider (Microsoft, Anthropic, OpenAI, Google) ships ternary-weight model in production with public latency/cost benefitsSource: Vendor product announcementsconf 55%Notes: Cascade — Required for Blundin's 'optimal' claim to be validated by market.
- 2025-12-01 → 2027-12-31pendingFrontier model trained natively in ternary precision (>50B params)How: Public research or product announcement of frontier-class (>50B params) LLM trained natively in ternary precision matching FP16 SOTASource: arXiv; Meta/Microsoft/Anthropic research blogsconf 50%Notes: Cascade — Strong signal that ternary is endpoint of quantization, validating Blundin's 90% confidence.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| killer | TK09 Energy Grid Cap (Data Center Power Wall) | 35.0% | 0.050 | 0.600 | -0.088 |
| prereq | SEM_015 Nvidia agreed to remit 15% of China chip-sale revenue direct — Jensen Huang | 66.3% | 0.600 | 0.050 | -0.076 |
| prereq | SEM_027 Nvidia Data Center revenue +66% YoY, contributing ~90% of $5 — Joseph Moore | 68.3% | 0.600 | 0.050 | -0.075 |
| killer | TK05 Rate Regime Persistence (10y > 5% through 2028) | 30.0% | 0.050 | 0.600 | -0.060 |
| killer | TK03 AI Regulatory Moratorium (EU/US Capability Freeze) | 10.0% | 0.050 | 0.600 | +0.050 |
Top outgoing (children)
Predictions THIS node influences
| Kind | Node | Their prob | P(c|s=T) | P(c|s=F) | Δ implied |
|---|---|---|---|---|---|
| prereq | 248_040 Pausing AI will fail and only accelerate race dynamics. — Alex Wissner-Gross | 53.0% | 0.920 | 0.050 | -0.056 |
| prereq | 247_023 AI will be able to do everything a white collar worker does — Dave Blundin | 40.8% | 0.720 | 0.050 | -0.031 |
| prereq | 242_031 Most large companies' business models will be disrupted in 2 — Peter Diamandis | 36.1% | 0.650 | 0.050 | -0.018 |
| prereq | 232_055 We're exiting the industrial age permanently as recursive se — Peter Diamandis | 35.5% | 0.700 | 0.050 | +0.013 |
| prereq | 244_019 Peter's son won't need a driver's license in 2 years — Peter Diamandis | 48.4% | 0.920 | 0.050 | -0.010 |
Ticker exposure
Beneficiaries (24)
Adverse (6)
Prerequisites (10)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | SEM_011 | Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips. | Capital Markets | — |
| prereq | SEM_027 | Nvidia Data Center revenue +66% YoY, contributing ~90% of $57B fiscal Q3 revenue; >$4.5T market cap entirely underpinned by AI silicon. | Capital Markets | — |
| prereq | SEM_014 | Nvidia's Arizona-based TSMC factory successfully fabricated cutting-edge semiconductors on US soil for first time in decades (October 2025). | Manufacturing | — |
| prereq | SEM_012 | Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering. | AI/Manufacturing | — |
| prereq | SEM_015 | Nvidia agreed to remit 15% of China chip-sale revenue directly to US government in exchange for reversing specific AI chip export bans. | Policy/Semis | — |
| killer | TK09 | Energy Grid Cap (Data Center Power Wall) | — | — |
| killer | TK05 | Rate Regime Persistence (10y > 5% through 2028) | — | — |
| killer | TK01 | AGI Capability Plateau (2026-27 Training Stall) | — | — |
| killer | TK02 | AI Compute Supply Shock (TSMC/Taiwan Disruption) | — | — |
| killer | TK03 | AI Regulatory Moratorium (EU/US Capability Freeze) | — | — |
Dependents (5)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| prereq | 244_019 | Peter's son won't need a driver's license in 2 years | Auto/Transport | — |
| prereq | 248_040 | Pausing AI will fail and only accelerate race dynamics. | AI | — |
| prereq | 247_023 | AI will be able to do everything a white collar worker does imminently | AI | — |
| prereq | 232_055 | We're exiting the industrial age permanently as recursive self-improvement unfolds. | AI | — |
| prereq | 242_031 | Most large companies' business models will be disrupted in 2-5 years | Markets/Stocks | — |
Linked documents (10)
Raw metadata
{
"nia": false,
"qty": "90% confidence ternary optimal",
"url": "https://www.youtube.com/watch?v=LVvleNtllPk",
"mode": "PREDICTION",
"role": "Host",
"context": "I am 90% sure that turnary is the optimal now. I've got simulations running all the time.",
"to_year": 2026,
"verbatim": "Well, I I am 90% sure that turnary is the optimal now. I've got simulations running all the time.",
"conv_cues": "90% sure",
"direction": "HAPPEN",
"from_year": 2026,
"timeframe": "present/near-term",
"conv_level": "HIGH",
"milestones": [
{
"kind": "llm_pre_event",
"label": "Microsoft BitNet b1.58 (ternary) matches FP16 LLaMA 2 perplexity",
"notes": "HIT — Foundation paper. BitNet b1.58 with 3.9B is 2.4x faster, 3.32x less memory than LLaMA 3B.",
"source": "arXiv 2402.17764 — The Era of 1-bit LLMs",
"status": "hit",
"weight": 0.4,
"ordinal": -9,
"source_id": null,
"confidence": 0.99,
"source_url": "https://arxiv.org/html/2402.17764v1",
"expected_date": "2024-02-27",
"observed_date": "2024-02-27",
"research_origin": "deep_research",
"measurement_criterion": "Microsoft research paper demonstrates BitNet b1.58 with ternary weights {-1,0,1} matches FP16 LLaMA 2 perplexity from 3B+ size"
},
{
"kind": "llm_pre_event",
"label": "Hugging Face fine-tuning blog — ternary quantization 'made easy'",
"notes": "HIT — Mainstream ML community has accepted ternary as practical, not just theoretical.",
"source": "Hugging Face blog — 1.58-bit extreme quantization made easy",
"status": "hit",
"weight": 0.4,
"ordinal": -8,
"source_id": null,
"confidence": 0.95,
"source_url": "https://huggingface.co/blog/1_58_llm_extreme_quantization",
"expected_date": "2025-01-30",
"observed_date": "2024-12-01",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2025-06-30",
"from": "2024-09-01"
},
"measurement_criterion": "Hugging Face publishes practitioner-facing blog/tutorial on fine-tuning to 1.58-bit precision"
},
{
"kind": "llm_pre_event",
"label": "Microsoft open-sources BitNet inference framework",
"notes": "HIT — Open framework enables 100B+ parameter inference on single CPU per Glen Rhodes coverage.",
"source": "GitHub — microsoft/BitNet",
"status": "hit",
"weight": 0.4,
"ordinal": -7,
"source_id": null,
"confidence": 0.99,
"source_url": "https://github.com/microsoft/BitNet",
"expected_date": "2025-02-14",
"observed_date": "2024-10-15",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2025-06-30",
"from": "2024-10-01"
},
"measurement_criterion": "Microsoft releases open-source inference framework supporting 1.58-bit ternary models"
},
{
"kind": "llm_pre_event",
"label": "Microsoft releases bitnet-b1.58-2B-4T on Hugging Face",
"notes": "HIT — Production-quality ternary 2B model. Sub-bit precision is now beyond research.",
"source": "Hugging Face — microsoft/bitnet-b1.58-2B-4T",
"status": "hit",
"weight": 0.4,
"ordinal": -6,
"source_id": null,
"confidence": 0.99,
"source_url": "https://huggingface.co/microsoft/bitnet-b1.58-2B-4T",
"expected_date": "2025-07-02",
"observed_date": "2025-04-15",
"research_origin": "deep_research",
"expected_date_range": {
"to": "2025-12-31",
"from": "2025-01-01"
},
"measurement_criterion": "Microsoft publishes production-grade ternary 2B parameter model (4T tokens) on Hugging Face"
},
{
"kind": "prereq",
"label": "Nvidia became the world's first $5 trillion company (late 2025), operating a near-monopoly on advanced AI chips.",
"status": "hit",
"weight": 0.5,
"ordinal": -5,
"source_id": "SEM_011",
... (truncated)