The AI Futures Project — the team behind AI 2027 — published a detailed self-grading of their 2025 predictions in early 2026. This is a unique comparison: the authors themselves assessing how their scenario held up against reality.

We compare their findings with our independent tracker assessments below.

The Big Picture

MetricAI Futures Self-GradingOur Tracker
Overall pace65% of predicted pace~70% speed ratio
Quantitative predictions58–66% of paceMixed — some confirmed, some behind
Qualitative predictions”Most on pace”~60% confirmed or on-track
Adjusted takeoff windowMid-2028 to mid-2030Not yet scored (original: early 2027)

Why the Numbers Differ Slightly

The AI Futures team uses a pace-of-progress multiplier — measuring what fraction of the predicted change actually happened. Our tracker uses status categories (confirmed, on-track, behind, etc.) which capture direction and timing but not the exact magnitude of deviation.

Their 65% pace multiplier and our ~70% speed ratio are broadly consistent. Both say: progress is real but slower than the scenario depicted.


Prediction-by-Prediction Comparison

Coding & Benchmarks

PredictionAI Futures AssessmentOur TrackerAgreement?
SWE-bench 85% by mid-2025Behind — best was 74.5% (Opus 4.1)Behind (conf. 0.80)Agree
Coding agents transform development”Fairly accurate” — Claude Code at $500M+ run-rateConfirmed (conf. 1.0)Agree
METR time horizons0.66×–1.04× of predicted pace→ On Track (conf. 0.75)Broadly agree
Agents unreliable”Broadly accurate” — coding agents slightly more reliable than expectedConfirmed (conf. 0.85)Agree

Economic & Revenue

PredictionAI Futures AssessmentOur TrackerAgreement?
OpenAI revenue ($18B by 2025)Slightly ahead (~$20B annualized)Emerging toward $35B 2026 target (conf. 0.55)Agree
OpenAI valuation ($500B by Jun 2025)Behind — hit $500B in Oct 2025Behind on $2.5T 2026 target (conf. 0.70)Agree
Infrastructure investmentNot specifically gradedConfirmed (conf. 1.0)

AI Capabilities & Research

PredictionAI Futures AssessmentOur TrackerAgreement?
First glimpse of AI agentsCorrect — ChatGPT agent (Jul 2025)Confirmed (conf. 0.85)Agree
AI for AI research focusAIs helping with coding, less with other researchConfirmed (conf. 0.85)Agree
AI R&D upliftBehind pace — estimates revised downward→ On Track for 1.5× (conf. 0.70)We’re slightly more optimistic
Models frequently updatedCorrect — GPT-4o → GPT-5 → GPT-5.1→ On Track (conf. 0.75)Agree

Safety & Security

PredictionAI Futures AssessmentOur TrackerAgreement?
AI good at hacking/bioweaponsOn track — Anthropic upgraded to ASL-3→ On Track (conf. 0.70)Agree
Sycophancy and deceptionOn track — MechaHitler partly user-prompted→ On Track (conf. 0.65)Agree
Model spec / alignment training”Already true at publication”Confirmed (conf. 0.80)Agree

Where They Got It Wrong

PredictionAI Futures AssessmentOur Tracker
Others 3–9 months behind OpenAIRace closer than predicted (0–2 month lead)Behind — gap narrower than expected
Compute growth (largest training run)Uncertain — no confirmed run much larger than GPT-4.5Behind on 10²⁸ FLOP target

Key Divergences: Our Tracker vs Their Self-Grading

Where We’re More Optimistic

  • AI R&D multiplier (1.5×): We rate this “on track” based on PostTrainBench results, Karpathy’s AutoResearch, and lab infrastructure. They say R&D uplift is “behind pace.” The difference may be definitional — they may be measuring a narrower uplift, while we look at the broader trend of AI contributing to AI research.

Where We Agree Most Strongly

  • Coding transformation is real — both assessments give this full marks
  • Agents exist but struggle with reliability — both confirmed
  • Revenue growth is approximately on track — both agree
  • SWE-bench progress is slower than predicted — strong agreement

Where Neither of Us Can Say Much Yet

  • Neuralese/non-text reasoning — too early (our: emerging, conf. 0.35)
  • Self-replication capabilities — too early (our: emerging, conf. 0.50)
  • Superhuman coder — not yet testable (predicted March 2027)

Their Adjusted Timeline

The AI Futures team’s biggest update is on timing, not direction:

Original AI 2027 TimelineUpdated Estimate (Feb 2026)
Takeoff: Early 2027Takeoff: Mid-2028 to mid-2030
Full coding automation: ~2027Daniel: 2029 median / Eli: early 2030s
65% pace → ~2 year delayWith compute slowdowns → ~3 year delay

This is a significant shift. The scenario’s direction is largely intact — AI research acceleration, coding automation, agent deployment, infrastructure buildout — but the timing is 2–3 years later than originally depicted.


What They’ll Track in 2026

The AI Futures team identified four key metrics to watch:

  1. AI R&D uplift — AI 2027 predicted 1.9× by end 2026. This is the single most important metric for the takeoff timeline.
  2. Revenue & valuations — $55B revenue, $2.5T valuation predicted for 2026. Tests economic traction.
  3. Coding time horizons — ~3 work weeks by end 2026 on central trajectory (via METR measurements).
  4. Other benchmarks — SWE-bench, VideoGameBench, and emerging evaluations.

Our tracker monitors all of these plus governance, geopolitics, and security developments that the self-grading didn’t specifically cover.


Who Is More Aggressive?

AreaMore Aggressive SourceNotes
Overall timelineAI 2027 original (early 2027 takeoff)Authors themselves now say mid-2028 to mid-2030
Coding progressAI 2027 original85% SWE-bench target missed significantly
Revenue growthRoughly equalBoth see ~on-pace revenue
AI R&D upliftAI 2027 originalAuthors downgraded their own estimate
Agent deploymentRoughly equalBoth confirmed agents arrived on schedule
Lab competition gapAI 2027 originalPredicted 3–9 month gaps; actual is 0–2 months

Bottom Line

The AI 2027 authors are their own best critics. Their February 2026 self-grading confirms what our tracker shows independently: the direction of AI progress broadly matches the scenario, but the pace is about 65–70% of what was predicted. This translates to roughly a 2–3 year delay in the most dramatic milestones.

The strongest signal: both our tracker and the authors themselves agree that the qualitative predictions are holding up better than the quantitative ones. AI agents exist, coding is being transformed, labs are racing, infrastructure is booming — it’s just happening somewhat more slowly than the scenario’s compressed timeline.


Related:

Data source: Grading AI 2027’s 2025 Predictions by Daniel Kokotajlo and Eli Lifland, AI Futures Project (~February 2026)