SEM_036predictionAI/Cognitionjobs

World-class AI programmers will appear within 1-2 years (by 2027-2028).

Predictor: Eric Schmidt

Prior probability

72.0%

Current probability

69.7%

evolves via intake + LBP

Conviction

4/5

Signal quality

Resolution

in_progress

Window

2026-01-01 – 2028-12-31

Edges in / out

12 / 1

Tickers exposed

Prediction text

World-class AI programmers will appear within 1-2 years (by 2027-2028). | SWE-Bench + enterprise code-gen share reports

Key catalyst: SWE-Bench + enterprise code-gen share reports

Watch events: SWE-Bench / METR evals; Claude Code / Cursor enterprise adoption metrics

Resolution evidence

Status: in_progress

Claude Opus 4.7 / Claude Code already at 42-54% global code-gen market share per workbook Thematic_Findings. SWE-Bench scores from frontier models already matching median professional programmers.

Predictor: Eric Schmidt

κ + Brier as of 2026-05-22

Full calibration →

κ (discount)

0.688

Brier

0.0064

excellent

Hits / Misses

3 / 0

of 3 resolved

Hit rate

100.0%

Calibration plot (stated vs observed)

Evidence about this node from Eric Schmidt is multiplied by κ in /api/intake. Lower κ = less weight; floors at 0.10 (effectively silenced) and caps at 1.00 (full weight).

Reference class: ai_capability_milestone_2y

Linked

All classes →

AI reaches specific named capability (intern-level / world-class programmer / etc) within 2y of stated target

Base rate

—

5/15 historical

Inside weight

—

Outside weight

—

no pull

inside 69.7% → blend 69.7% (Δ 0.0pp)

Tetlock-style outside view: at TRF=1 (just predicted), outside view dominates (w_in=0.3). At TRF=0 (deadline), inside view dominates (w_in=1.0). The blend regularizes overconfident inside views toward the historical base rate.

Probability over time

8 prob_history rows

intake v2milestone miss sweeplbp propagationreference class assignedlegacy v1prior_prob (analyst seed)current = 69.7%

Milestone chain

Pre-event signals (upstream prereqs + window checkpoints) → resolution event → downstream cascades. Status/dates update from linked nodes; re-derive nightly via scripts/ops/derive_milestones.py.

Leading chain: 6 fired ✓ · 4 pending

2026-04-29hitDavid Sinclair begins partial epigenetic reprogramming trials with Life Biosciences in March 2026.
2026-04-29hitNvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a
2026-04-29hitTraining runs costing $10 billion for a single model will commence sometime in 2025.
2026-04-29hit2025 will be the definitive year that agentic systems finally hit the mainstream.
2026-04-29hitJury selection begins April 27, 2026 for Musk v OpenAI trial
2026-04-30hitArtemis 2 launches April 1, 2026 - circumlunar mission
2026-08-01 → 2027-03-31pendingSWE-Bench Verified reaches ≥95%
How: Top model score on SWE-Bench Verified ≥95% — within margin-of-error of 'world-class human engineer' baseline
Source: Anthropic blog, OpenAI evals, Papers With Code, METRconf 65%
2026-09-01 → 2027-08-31pendingAI achieves Codeforces Grandmaster (2400+) or top-100 ICPC equivalent
How: AI lab demonstrates an AI system competing in Codeforces Round at Grandmaster division (rating ≥2400) OR scoring at top-100 ICPC World Finals level on equivalent problem set
Source: Codeforces.com, ICPC.global, lab blog posts (DeepMind AlphaCode follow-ups)conf 70%
Notes: DeepMind AlphaCode 2 already at top-15% in 2023; 'Grandmaster' is a specific public rating threshold.
2027-01-01 → 2027-12-31pendingFrontier lab uses 'world-class programmer' framing in official communications
How: Anthropic, OpenAI, or Google explicitly characterizes a model as 'world-class', 'expert-level', or 'top-1%' programmer in official lab communication, citing specific benchmarks
Source: Lab blog posts, conference keynotes (Davos, OpenAI DevDay, Google I/O), CEO interviewsconf 65%
Notes: Schmidt's claim (the prediction author) is itself a directional bet — labs will validate via marketing language well before consensus.
2027-04-01 → 2028-06-30pendingAI solves research-level (PhD-difficulty) coding problem unaided
How: Public demonstration of AI agent solving a problem from RE-Bench, FrontierMath-coding, or similar research-level eval suite with no human intervention
Source: METR.org, Apollo Research, lab blog posts, arXiv papersconf 55%
2028-01-29pendingWorld-class AI programmers will appear within 1-2 years (by 2027-2028).
2027-06-01 → 2028-12-31pendingFrontier model passes standardized senior+ engineering interview process
How: Public demonstration or hiring-leader testimony that a model passes the standardized senior+ ('staff', 'principal') engineering interview at a top-tier tech company (FAANG/MAANG-level) including system design, coding, and behavioral rounds
Source: Lab blogs, hiring leader posts on LinkedIn/Twitter, Levels.fyi, possibly leaked internal interview rubricsconf 50%
Notes: Resolution proxy — once frontier models pass real hiring bars, the prediction is essentially confirmed.
2028-01-01 → 2028-12-31pendingCascade: open-source model crosses world-class programmer threshold
How: Open-source / open-weights model achieves ≥85% on SWE-Bench Verified or comparable, demonstrating commoditization
Source: HuggingFace leaderboards, Llama/DeepSeek/Qwen model card releasesconf 55%
Notes: Cascade — affects scenarios where AI capability commoditization changes labor displacement timing.
2036-07-03pendingJob displacement will be issue 6-10 not top 5 in 10 years; AI discoveries will dominate

What if this resolves?

Clamp this prediction TRUE or FALSE and run a counterfactual Gibbs sample. Surfaces the predictions whose marginals shift most under that assumption.

(live posterior: 70%)

Click a button to clamp this prediction and run a Gibbs sample. Returns the predictions whose marginals shift most. ~30s per run; ideal for stress-testing "if X resolves, what else moves?"

Evidence chain

Every probability update with full Bayesian provenance — chronological, latest first

LBP2026-05-24T02:00:02Z69.7%-11.4pp

Network propagation: 81.1% → 69.7%

4-iter LBP, residual 0.01000 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 806b02f8

intake_event_update2026-05-21T23:15:16Z81.1%+22.4pp

intake:7afeeb9a-f217-4dd2-b910-24ff14bdfc39 bayesian_v2 inside=0.811 blend=0.811 LLR=1.106 κ=0.69 no_blend

Raw metadata

{
  "trf": 0.8712612462476746,
  "kappa": 0.6875,
  "base_rate": null,
  "predictor": "Eric Schmidt",
  "total_llr": 1.6094379124341,
  "bayesian_v2": true,
  "prior_logit": 0.3492657634762088,
  "bayes_factor": "3.0:1 favoring",
  "blend_reason": "no reference_class linked",
  "inside_prior": 0.5864395171566453,
  "kappa_source": "predictor_table",
  "blend_applied": false,
  "contributions": [
    {
      "llr": 1.6094379124341,
      "kappa": 0.6875,
      "label": "Capability acceleration curve (4.3-month doubling) puts 'world-class programmer' threshold within 2027.",
      "adjusted_llr": 1.106488564798444
    }
  ],
  "evidence_kind": "intake_event_update",
  "inside_source": "history_v2",
  "inside_weight": 1,
  "outside_weight": 0,
  "posterior_prob": 0.8108824513148017,
  "evidence_origin": "daily_intake",
  "llm_suggestions": [
    {
      "polarity": "corroborates",
      "status_change": "unchanged",
      "evidence_strength": "strong",
      "delta_prob_suggestion": 0.06
    }
  ],
  "posterior_logit": 1.455754328274653,
  "predictor_brier": 0.0064,
  "evidence_doc_ids": [],
  "inside_posterior": 0.8108824513148017,
  "blended_posterior": 0.8108824513148017,
  "reference_class_id": null,
  "total_adjusted_llr": 1.106488564798444,
  "predictor_n_resolved": 3
}

LBP2026-05-17T02:00:01Z58.6%-4.7pp

Network propagation: 63.3% → 58.6%

5-iter LBP, residual 0.00689 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e607fa96

LBP2026-05-10T02:00:02Z63.3%-8.1pp

Network propagation: 71.5% → 63.3%

6-iter LBP, residual 0.00584 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run e5c18d29

LBP2026-05-03T02:00:01Z71.5%-11.9pp

Network propagation: 83.3% → 71.5%

6-iter LBP, residual 0.00677 · damping 0.5, w_intrinsic 0.5 · method lbp_v3 · run 1a683ac9

legacy v12026-04-30T19:17:54Z83.3%+21.0pp

intake:99aa73db-75b1-4b1e-8470-a11f87b23937 bayesian_v2 inside=0.833 blend=0.833 LLR=1.106 κ=0.69 no_blend

LBP2026-04-30T16:39:51Z62.3%-4.1pp

Network propagation: 66.4% → 62.3%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v2 · run 0c8a4ea3

LBP2026-04-30T02:18:57Z66.4%-5.6pp

Network propagation: 72.0% → 66.4%

5-iter LBP, residual 0.00825 · damping 0.5, w_intrinsic 0.5 · method lbp_v1 · run 592311ef

Network propagation neighbors

Top edges sorted by latest LBP cross-impact

All propagation →

Top incoming (parents)

Edges that influence THIS node's belief

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
prereq	242_057 Artemis 2 launches April 1, 2026 - circumlunar mission — Peter Diamandis	56.2%	0.720	0.050	-0.280
prereq	247_058 Jury selection begins April 27, 2026 for Musk v OpenAI trial — Peter Diamandis	71.4%	0.720	0.050	-0.175
prereq	SEM_042 2025 will be the definitive year that agentic systems finall — Kevin Weil	73.8%	0.720	0.050	-0.159
prereq	235_038 David Sinclair begins partial epigenetic reprogramming trial — Peter Diamandis	74.0%	0.720	0.050	-0.156
prereq	SEM_012 Nvidia quadrupled chip production output while only doubling — Jensen Huang	75.0%	0.720	0.050	-0.150

Top outgoing (children)

Predictions THIS node influences

Kind	Node	Their prob	P(c\|s=T)	P(c\|s=F)	Δ implied
prereq	234_036 Job displacement will be issue 6-10 not top 5 in 10 years; A — Alex Wissner-Gross	28.8%	0.450	0.050	+0.010

Ticker exposure

32 ticker(s) linked

Beneficiaries (23)

ADUS COUR DOCN FROG GTLB INOD PL ROL SPIR SRFM UDMY TEAM NFLX PLTR RDDT UBER AMZN BABA SPOT GDDY GOOGL META MSFT

Adverse (6)

RHI BXP SLG MAN KFY TNET

Prerequisites (12)

Predictions that must hit first

Type	Pred	Title	Domain	Lag
prereq	235_038	David Sinclair begins partial epigenetic reprogramming trials with Life Biosciences in March 2026.	Biotech/Longevity	—
prereq	238_009	Recursive self-improvement is already happening now (no longer three years out)	AI	—
prereq	247_058	Jury selection begins April 27, 2026 for Musk v OpenAI trial	AI	—
prereq	SEM_008	Training runs costing $10 billion for a single model will commence sometime in 2025.	AI	—
prereq	SEM_012	Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) across engineering.	AI/Manufacturing	—
prereq	SEM_042	2025 will be the definitive year that agentic systems finally hit the mainstream.	AI/Agents	—
prereq	242_057	Artemis 2 launches April 1, 2026 - circumlunar mission	Space	—
correlate	S_AGI_MID_2029	AGI mid: Kurzweil 2029 path	agi_general_capability	—
correlate	S_ASI_MID_2034	ASI mid: Schmidt 'ASI in 6 years'	asi_recursive_self_improvement	—
correlate	S_AI_PAUSE_2026	Major-country AI pause beginning 2026	ai_regulatory_pause	—
killer	TK04	Macro Recession 2026-27 (Structural Deleveraging)	—	—
killer	TK07	Labor Political Backlash (UBI Mandate / AI Tax)	—	—

Dependents (1)

Predictions enabled by this

Type	Pred	Title	Domain	Lag
prereq	234_036	Job displacement will be issue 6-10 not top 5 in 10 years; AI discoveries will dominate	Labor/Jobs	—

Expected milestones (1)

From Sheet 17 Monitoring Triggers

Expected by	Description	Status
2027-09-30	[Capability 2027-09] [SEM_036] SWE-Bench / METR evals; Claude Code / Cursor enterprise adoption metrics [SEM_049] SWE-Bench + METR long-horizon eval results; enterprise code-gen market share	pending

Validations (1)

Resolution events

Observed at	Status	By	Notes
2026-04-29	partial	thesis_timeline_v1.0_import	Claude Opus 4.7 / Claude Code already at 42-54% global code-gen market share per workbook Thematic_Findings. SWE-Bench scores from frontier models already matching median professional programmers.

Linked documents (6)

Auto-generated by cosine similarity from Polymarket / Manifold / EDGAR / GDELT

Sim	Source	Title	Market prob	Polarity	Reviewed	Published
0.900	codex_research_pack	METR - Measuring AI Ability to Complete Long Tasks	—	corroborates	pending	2025-03-19
0.900	codex_research_pack	OECD - Exploring Possible AI Trajectories Through 2030	—	corroborates	pending	2026-04-26
0.630	manifold	When will the Jai programming language be public?	—	mentions	pending	2026-05-10
0.615	manifold	which will happen first? (Codeforces Rating)	—	mentions	pending	2026-05-24
0.596	manifold	Will MNX hire a backend developer before Manifest?	26%	mentions	pending	2026-04-30
0.567	manifold	2026 F1 Canada Qualification who qualifies better in each team	—	mentions	pending	2026-05-04

Raw metadata

From Thesis_Timeline_v1.0_FINAL workbook

{
  "nia": false,
  "mode": "FORECAST",
  "role": "Cited-Executive",
  "context": "Schmidt forecasts AI programmers matching world-class human level within 1-2y window.",
  "to_year": 2028,
  "conv_cues": "appearing within 1-2 years",
  "direction": "HAPPEN",
  "from_year": 2026,
  "timeframe": "1-2 years",
  "conv_level": "HIGH",
  "milestones": [
    {
      "kind": "prereq",
      "label": "David Sinclair begins partial epigenetic reprogramming trials with Life Biosciences in March 2026.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -10,
      "source_id": "235_038",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Nvidia quadrupled chip production output while only doubling human headcount — achieved by deploying AI coding tools (Cursor, Claude Code) a",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -9,
      "source_id": "SEM_012",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Training runs costing $10 billion for a single model will commence sometime in 2025.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -8,
      "source_id": "SEM_008",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "2025 will be the definitive year that agentic systems finally hit the mainstream.",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -7,
      "source_id": "SEM_042",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Jury selection begins April 27, 2026 for Musk v OpenAI trial",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -6,
      "source_id": "247_058",
      "expected_date": "2026-04-29",
      "observed_date": "2026-04-29"
    },
    {
      "kind": "prereq",
      "label": "Artemis 2 launches April 1, 2026 - circumlunar mission",
      "status": "hit",
      "weight": 0.5,
      "ordinal": -5,
      "source_id": "242_057",
      "expected_date": "2026-04-30",
      "observed_date": "2026-04-30"
    },
    {
      "kind": "llm_pre_event",
      "label": "SWE-Bench Verified reaches ≥95%",
      "source": "Anthropic blog, OpenAI evals, Papers With Code, METR",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -4,
      "source_id": null,
      "confidence": 0.65,
      "expected_date": "2026-11-30",
      "research_origin": "training",
      "expected_date_range": {
        "to": "2027-03-31",
        "from": "2026-08-01"
      },
      "measurement_criterion": "Top model score on SWE-Bench Verified ≥95% — within margin-of-error of 'world-class human engineer' baseline"
    },
    {
      "kind": "llm_pre_event",
      "label": "AI achieves Codeforces Grandmaster (2400+) or top-100 ICPC equivalent",
      "notes": "DeepMind AlphaCode 2 already at top-15% in 2023; 'Grandmaster' is a specific public rating threshold.",
      "source": "Codeforces.com, ICPC.global, lab blog posts (DeepMind AlphaCode follow-ups)",
      "status": "pending",
      "weight": 0.4,
      "ordinal": -3,
      "source_id": null,
      "confidence": 0.7,
      "expected_date": "2027-03-02",
      "research_origin": "training",
      "expected_date_range": {
        "to": "2027-08-31",
        "from": "2026-09-01"
      },
      "measurement_criterion": "AI lab demonstrates an AI system competing in Codeforces Round at Grandmaster division (rating ≥2400) OR scoring at top-100 ICPC World Finals level on equivalent problem set"
    },
    {
      "kind": "llm_pre_event",
      "label": "Frontier lab uses 'world-class programmer' framing in official communications",
      "notes": "Schmidt's claim (the prediction author) is itself a directional bet — labs will validate via marketing language well before consensus.",
      "source": "Lab blog posts, conference keynotes (Davo
... (truncated)