AI 2027 vs AI Futures Project Self-Grading

The AI Futures Project — the team behind AI 2027 — published a detailed self-grading of their 2025 predictions in early 2026. This is a unique comparison: the authors themselves assessing how their scenario held up against reality.

We compare their findings with our independent tracker assessments below.

The Big Picture

Metric	AI Futures Self-Grading	Our Tracker
Overall pace	65% of predicted pace	~70% speed ratio
Quantitative predictions	58–66% of pace	Mixed — some confirmed, some behind
Qualitative predictions	”Most on pace”	~60% confirmed or on-track
Adjusted takeoff window	Mid-2028 to mid-2030	Not yet scored (original: early 2027)

Why the Numbers Differ Slightly

The AI Futures team uses a pace-of-progress multiplier — measuring what fraction of the predicted change actually happened. Our tracker uses status categories (confirmed, on-track, behind, etc.) which capture direction and timing but not the exact magnitude of deviation.

Their 65% pace multiplier and our ~70% speed ratio are broadly consistent. Both say: progress is real but slower than the scenario depicted.

Prediction-by-Prediction Comparison

Coding & Benchmarks

Prediction	AI Futures Assessment	Our Tracker	Agreement?
SWE-bench 85% by mid-2025	Behind — best was 74.5% (Opus 4.1)	Behind (conf. 0.80)	Agree
Coding agents transform development	”Fairly accurate” — Claude Code at $500M+ run-rate	Confirmed (conf. 1.0)	Agree
METR time horizons	0.66×–1.04× of predicted pace	→ On Track (conf. 0.75)	Broadly agree
Agents unreliable	”Broadly accurate” — coding agents slightly more reliable than expected	Confirmed (conf. 0.85)	Agree

Economic & Revenue

Prediction	AI Futures Assessment	Our Tracker	Agreement?
OpenAI revenue ($18B by 2025)	Slightly ahead (~$20B annualized)	Emerging toward $35B 2026 target (conf. 0.55)	Agree
OpenAI valuation ($500B by Jun 2025)	Behind — hit $500B in Oct 2025	Behind on $2.5T 2026 target (conf. 0.70)	Agree
Infrastructure investment	Not specifically graded	Confirmed (conf. 1.0)	—

AI Capabilities & Research

Prediction	AI Futures Assessment	Our Tracker	Agreement?
First glimpse of AI agents	Correct — ChatGPT agent (Jul 2025)	Confirmed (conf. 0.85)	Agree
AI for AI research focus	AIs helping with coding, less with other research	Confirmed (conf. 0.85)	Agree
AI R&D uplift	Behind pace — estimates revised downward	→ On Track for 1.5× (conf. 0.70)	We’re slightly more optimistic
Models frequently updated	Correct — GPT-4o → GPT-5 → GPT-5.1	→ On Track (conf. 0.75)	Agree

Safety & Security

Prediction	AI Futures Assessment	Our Tracker	Agreement?
AI good at hacking/bioweapons	On track — Anthropic upgraded to ASL-3	→ On Track (conf. 0.70)	Agree
Sycophancy and deception	On track — MechaHitler partly user-prompted	→ On Track (conf. 0.65)	Agree
Model spec / alignment training	”Already true at publication”	Confirmed (conf. 0.80)	Agree

Where They Got It Wrong

Prediction	AI Futures Assessment	Our Tracker
Others 3–9 months behind OpenAI	Race closer than predicted (0–2 month lead)	Behind — gap narrower than expected
Compute growth (largest training run)	Uncertain — no confirmed run much larger than GPT-4.5	Behind on 10²⁸ FLOP target

Key Divergences: Our Tracker vs Their Self-Grading

Where We’re More Optimistic

AI R&D multiplier (1.5×): We rate this “on track” based on PostTrainBench results, Karpathy’s AutoResearch, and lab infrastructure. They say R&D uplift is “behind pace.” The difference may be definitional — they may be measuring a narrower uplift, while we look at the broader trend of AI contributing to AI research.

Where We Agree Most Strongly

Coding transformation is real — both assessments give this full marks
Agents exist but struggle with reliability — both confirmed
Revenue growth is approximately on track — both agree
SWE-bench progress is slower than predicted — strong agreement

Where Neither of Us Can Say Much Yet

Neuralese/non-text reasoning — too early (our: emerging, conf. 0.35)
Self-replication capabilities — too early (our: emerging, conf. 0.50)
Superhuman coder — not yet testable (predicted March 2027)

Their Adjusted Timeline

The AI Futures team’s biggest update is on timing, not direction:

Original AI 2027 Timeline	Updated Estimate (Feb 2026)
Takeoff: Early 2027	Takeoff: Mid-2028 to mid-2030
Full coding automation: ~2027	Daniel: 2029 median / Eli: early 2030s
65% pace → ~2 year delay	With compute slowdowns → ~3 year delay

This is a significant shift. The scenario’s direction is largely intact — AI research acceleration, coding automation, agent deployment, infrastructure buildout — but the timing is 2–3 years later than originally depicted.

What They’ll Track in 2026

The AI Futures team identified four key metrics to watch:

AI R&D uplift — AI 2027 predicted 1.9× by end 2026. This is the single most important metric for the takeoff timeline.
Revenue & valuations — $55B revenue, $2.5T valuation predicted for 2026. Tests economic traction.
Coding time horizons — ~3 work weeks by end 2026 on central trajectory (via METR measurements).
Other benchmarks — SWE-bench, VideoGameBench, and emerging evaluations.

Our tracker monitors all of these plus governance, geopolitics, and security developments that the self-grading didn’t specifically cover.

Who Is More Aggressive?

Area	More Aggressive Source	Notes
Overall timeline	AI 2027 original (early 2027 takeoff)	Authors themselves now say mid-2028 to mid-2030
Coding progress	AI 2027 original	85% SWE-bench target missed significantly
Revenue growth	Roughly equal	Both see ~on-pace revenue
AI R&D uplift	AI 2027 original	Authors downgraded their own estimate
Agent deployment	Roughly equal	Both confirmed agents arrived on schedule
Lab competition gap	AI 2027 original	Predicted 3–9 month gaps; actual is 0–2 months

Bottom Line

The AI 2027 authors are their own best critics. Their February 2026 self-grading confirms what our tracker shows independently: the direction of AI progress broadly matches the scenario, but the pace is about 65–70% of what was predicted. This translates to roughly a 2–3 year delay in the most dramatic milestones.

The strongest signal: both our tracker and the authors themselves agree that the qualitative predictions are holding up better than the quantitative ones. AI agents exist, coding is being transformed, labs are racing, infrastructure is booming — it’s just happening somewhat more slowly than the scenario’s compressed timeline.

Related:

Data source: Grading AI 2027’s 2025 Predictions by Daniel Kokotajlo and Eli Lifland, AI Futures Project (~February 2026)