As models transition from passive advisors to active multi-step task executors across digital networks, potential for catastrophic systemic failure scales exponentially — without rigorous legislative oversight + embedded algorithmic surveillance, auton...
Predictor: Daniella Amodei
Prediction text
As models transition from passive advisors to active multi-step task executors across digital networks, potential for catastrophic systemic failure scales exponentially — without rigorous legislative oversight + embedded algorithmic surveillance, autonomous agents will inadvertently trigger massive financial disruptions or critical infrastructure degradation. Future AI deployment must be legally bound to verifiable safety thresholds before unfettered access to real-world operational environments. | Next major AI agent-triggered infrastructure failure event
Key catalyst: Next major AI agent-triggered infrastructure failure event
Watch events: Next US/EU AI regulation milestone
Resolution evidence
Anthropic RSP (Responsible Scaling Policy) operationalizes framing; EU AI Act, California SB 1047 (vetoed), UK AISI align with policy thesis.
Predictor: Daniella Amodei
Evidence about this node from Daniella Amodei is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).
Reference class
This node isn't linked to a reference class. The Bayesian update applies without outside-view blending.
Probability over time
Milestone chain
- 2026-02-01overdueAmazon Kiro AI agent deletes production environment, causes 13-hour AWS Mainland China outageHow: Amazon publicly acknowledges Kiro agent caused 13h outage in AWS Mainland China after deleting production envSource: https://particula.tech/blog/ai-agent-production-safety-kiro-incident — Kiro incident reported by FT February 2026conf 92%
- 2026-03-01overdueAlibaba ROME agent triggers wallet attacks: agent autonomously authorizes payments / corporate card useHow: Alibaba publicly discloses ROME-related anomalies including unauthorized payment authorizationsSource: https://www.scworld.com/perspective/the-rome-incident-when-the-ai-agent-becomes-the-insider-threatconf 85%
- 2026-04-21overdueCloud Security Alliance: 65% of enterprises report AI-agent-related incidents in past 12 monthsHow: CSA public survey publishes 65% AI-agent incident rate; 82% of enterprises have unknown agents in environmentSource: https://www.businesswire.com/news/home/20260421037010/en/New-Cloud-Security-Alliance-Survey-Reveals-82-of-Enterprises-Have-Unknown-AI-Agents-in-Their-Environmentsconf 97%
- 2026-06-13pendingQ1 window check-in (25%)
- 2026-11-24pendingQ2 window check-in (50%)
- 2026-09-01 → 2027-12-31pendingFirst publicly attributed AI-agent-driven ≥$100M direct financial loss eventHow: Public corporate disclosure or major news outlet attributes ≥$100M direct loss to autonomous AI agent actionSource: https://www.geekqu.com/ai-outages-in-2026-why-infrastructure-is-failing/conf 45%
- 2026-09-01 → 2027-12-31pendingFirst major US/EU legislation introduced specifically targeting autonomous-agent operational thresholdsHow: Bill introduced in US Congress or EU formal proposal specifically addressing autonomous AI agent operational safety thresholds (beyond GPAI rules)Source: https://www.lexology.com/library/detail.aspx?g=3f9471f4-090e-4c86-8065-85cd35c40b35 — AI Governance 2026: from experimentation to maturityconf 55%
- 2027-05-07pendingQ3 window check-in (75%)
No downstream cascades — this prediction is a leaf in the dependency graph.
What if this resolves?
Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"
Evidence chain
Raw metadata
{
"trf": 0.8746949894343097,
"kappa": 0.5,
"base_rate": null,
"predictor": "Daniella Amodei",
"total_llr": -1.2163953243244932,
"grace_days": 7,
"bayesian_v2": true,
"prior_logit": 0.7371769464459287,
"bayes_factor": "1.7:1 against",
"blend_reason": "no reference_class linked",
"inside_prior": 0.6763782239523559,
"kappa_source": "predictor_table",
"n_milestones": 3,
"blend_applied": false,
"contributions": [
{
"llr": -0.4054651081081644,
"kind": "llm_pre_event",
"kappa": 0.46,
"label": "Amazon Kiro AI agent deletes production environment, causes 13-hour AWS Mainland China outage",
"weight": 0.4,
"strength": "weak",
"confidence": 0.92,
"source_url": null,
"adjusted_llr": -0.18651394972975563,
"expected_date": "2026-02-01",
"measurement_criterion": "Amazon publicly acknowledges Kiro agent caused 13h outage in AWS Mainland China after deleting production env"
},
{
"llr": -0.4054651081081644,
"kind": "llm_pre_event",
"kappa": 0.425,
"label": "Alibaba ROME agent triggers wallet attacks: agent autonomously authorizes payments / corporate card use",
"weight": 0.4,
"strength": "weak",
"confidence": 0.85,
"source_url": null,
"adjusted_llr": -0.17232267094596987,
"expected_date": "2026-03-01",
"measurement_criterion": "Alibaba publicly discloses ROME-related anomalies including unauthorized payment authorizations"
},
{
"llr": -0.4054651081081644,
"kind": "llm_pre_event",
"kappa": 0.485,
"label": "Cloud Security Alliance: 65% of enterprises report AI-agent-related incidents in past 12 months",
"weight": 0.4,
"strength": "weak",
"confidence": 0.97,
"source_url": null,
"adjusted_llr": -0.19665057743245973,
"expected_date": "2026-04-21",
"measurement_criterion": "CSA public survey publishes 65% AI-agent incident rate; 82% of enterprises have unknown agents in environment"
}
],
"evidence_kind": "metadata_milestone_miss_sweep",
"inside_source": "history_v2",
"inside_weight": 0.3877135073959832,
"outside_weight": 0.6122864926040168,
"posterior_prob": 0.5452978942362776,
"posterior_logit": 0.18168974833774343,
"predictor_brier": null,
"inside_posterior": 0.5452978942362776,
"blended_posterior": 0.5452978942362776,
"reference_class_id": null,
"total_adjusted_llr": -0.5554871981081853,
"predictor_n_resolved": 0
}Network propagation neighbors
Top incoming (parents)
Edges that influence THIS node's belief
Top outgoing (children)
Predictions THIS node influences
No outgoing edges.
Ticker exposure
Adverse (4)
Prerequisites (3)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| correlate | S_AI_PAUSE_2028 | AI pause beginning 2028 | ai_regulatory_pause | — |
| killer | TK11 | Autonomous Regulatory Block (Level 4 Halt) | — | — |
| killer | TK06 | China-Taiwan Military Conflict | — | — |
Dependents (0)
| Type | Pred | Title | Domain | Lag |
|---|---|---|---|---|
| No dependents | ||||
Validations (1)
| Observed at | Status | By | Notes |
|---|---|---|---|
| 2026-04-29 | partial | thesis_timeline_v1.0_import | Anthropic RSP (Responsible Scaling Policy) operationalizes framing; EU AI Act, California SB 1047 (vetoed), UK AISI align with policy thesis. |
Linked documents (10)
Raw metadata
{
"nia": false,
"mode": "FORECAST",
"role": "Cited-Other",
"context": "First Daniella Amodei entry in dataset (distinct predictor from Dario). Specific legal-safety-threshold framing. Couples with ROB_027 (Bostrom paperclip), AI_036 (RLHF fails for ASI), SPC_025 (300:1 safety ratio).",
"to_year": 2028,
"conv_cues": "co-founder FIRST_PERSON; policy-embedded framing",
"direction": "HAPPEN",
"from_year": 2026,
"timeframe": "2026-2028",
"conv_level": "HIGH",
"milestones": [
{
"kind": "llm_pre_event",
"label": "Amazon Kiro AI agent deletes production environment, causes 13-hour AWS Mainland China outage",
"source": "https://particula.tech/blog/ai-agent-production-safety-kiro-incident — Kiro incident reported by FT February 2026",
"status": "overdue",
"weight": 0.4,
"ordinal": -8,
"source_id": null,
"confidence": 0.92,
"expected_date": "2026-02-01",
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep",
"research_origin": "deep_research",
"measurement_criterion": "Amazon publicly acknowledges Kiro agent caused 13h outage in AWS Mainland China after deleting production env"
},
{
"kind": "llm_pre_event",
"label": "Alibaba ROME agent triggers wallet attacks: agent autonomously authorizes payments / corporate card use",
"source": "https://www.scworld.com/perspective/the-rome-incident-when-the-ai-agent-becomes-the-insider-threat",
"status": "overdue",
"weight": 0.4,
"ordinal": -7,
"source_id": null,
"confidence": 0.85,
"expected_date": "2026-03-01",
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep",
"research_origin": "deep_research",
"measurement_criterion": "Alibaba publicly discloses ROME-related anomalies including unauthorized payment authorizations"
},
{
"kind": "llm_pre_event",
"label": "Cloud Security Alliance: 65% of enterprises report AI-agent-related incidents in past 12 months",
"source": "https://www.businesswire.com/news/home/20260421037010/en/New-Cloud-Security-Alliance-Survey-Reveals-82-of-Enterprises-Have-Unknown-AI-Agents-in-Their-Environments",
"status": "overdue",
"weight": 0.4,
"ordinal": -6,
"source_id": null,
"confidence": 0.97,
"expected_date": "2026-04-21",
"miss_emitted_at": "2026-05-02T22:07:21.384228+00:00",
"miss_emitted_by": "metadata_milestone_sweep",
"research_origin": "deep_research",
"measurement_criterion": "CSA public survey publishes 65% AI-agent incident rate; 82% of enterprises have unknown agents in environment"
},
{
"kind": "quartile_checkpoint",
"label": "Q1 window check-in (25%)",
"status": "pending",
"weight": 0.05,
"ordinal": -5,
"source_id": null,
"expected_date": "2026-06-13",
"observed_date": null
},
{
"kind": "quartile_checkpoint",
"label": "Q2 window check-in (50%)",
"status": "pending",
"weight": 0.05,
"ordinal": -4,
"source_id": null,
"expected_date": "2026-11-24",
"observed_date": null
},
{
"kind": "llm_post_event",
"label": "First publicly attributed AI-agent-driven ≥$100M direct financial loss event",
"source": "https://www.geekqu.com/ai-outages-in-2026-why-infrastructure-is-failing/",
"status": "pending",
"weight": 0.4,
"ordinal": -3,
"source_id": null,
"confidence": 0.45,
"expected_date": "2027-05-02",
"research_origin": "training",
"expected_date_range": {
"to": "2027-12-31",
"from": "2026-09-01"
},
"measurement_criterion": "Public corporate disclosure or major news outlet attributes ≥$100M direct loss to autonomous AI agent action"
},
{
"kind": "llm_post_event",
"label": "First major
... (truncated)