AI 2027 vs Metaculus Crowd Forecasts

Metaculus hosts an AI 2027 Tournament with 19 forecasting questions derived directly from the AI 2027 scenario. The tournament launched in June 2025 and tracks crowd predictions on technology, economics, politics, and societal impact through 2028 and beyond.

This page compares the AI 2027 scenario’s claims, the Metaculus crowd forecasts, and our independent tracker assessments.

How to Read This Comparison

Different measurement systems: AI 2027 makes narrative claims (“X will happen by Y”). Metaculus uses probability percentages or numerical estimates. Our tracker uses status labels (confirmed, on-track, behind, etc.). We bridge these below, but they’re fundamentally different — probabilities capture uncertainty; status labels capture current evidence.

Rough mapping:

Our Status	Approximate Probability Equivalent
Confirmed	>90% (already happened)
Ahead	>80% and faster than expected
→ On Track	60–80%
Emerging	30–60%
Behind	20–50% (direction right, timing wrong)
Not Yet Testable	No probability meaningful yet

Side-by-Side: Key Tournament Questions

Technology & Capabilities

Neuralese Recurrence

Source	Assessment
AI 2027	Non-text reasoning (“neuralese”) deployed in early 2027 — a key capability unlock
Metaculus crowd	Median: ~Jan 2028 for public release (Q38598)
Our tracker	Emerging (conf. 0.35) — predicted early 2027
Jan Wehner forecast	Oct 2026 (more aggressive than crowd)

Analysis: The crowd is more conservative than the scenario by about a year. Our tracker agrees this is still early — no frontier lab has shipped neuralese recurrence publicly. Meta published research in 2024, but production deployment remains uncertain.

VideoGameBench Performance

Source	Assessment
AI 2027	Implies rapid capability gains across diverse tasks
Metaculus crowd	Median: ~58% by Jan 2028 (range: 34%–83%) (Q38609)
Our tracker	Not directly tracked — closest: agent autonomy predictions
Jan Wehner forecast	80% (more aggressive than crowd)

Analysis: Starting from <1% in mid-2025, the crowd expects strong but incomplete progress. This aligns with the broader pattern: capabilities improve dramatically but don’t hit the most aggressive targets.

AI-Authored Scientific Papers

Source	Assessment
AI 2027	AI increasingly contributes to research, approaching researcher parity
Metaculus crowd	~78% chance of AI-authored paper at NeurIPS/ICML/ICLR before 2028 (Q estimated)
Our tracker	AI for AI research confirmed (conf. 0.85)
Jan Wehner forecast	84%

Analysis: Broad agreement. AI is already contributing meaningfully to research. The specific milestone of a published AI-authored paper is more about institutional acceptance than capability.

OpenAI Self-Improvement Risk Levels

Source	Assessment
AI 2027	AI models gain enough capability to contribute to AI R&D, creating recursive loops
Metaculus crowd	”High” risk level: median ~May 2027; “Critical”: much later (Q estimated)
Our tracker	→ On Track for 1.5× R&D multiplier (conf. 0.70)
Jan Wehner forecast	”High” by May 2027; “Critical” by July 2032

Analysis: This is a crucial question for the takeoff timeline. The crowd and our tracker broadly agree that meaningful AI self-improvement is approaching but not yet at crisis levels. The AI 2027 scenario is more aggressive on timing.

Economics & Compute

Total AI Compute Capacity (Jan 2028)

Source	Assessment
AI 2027	Massive compute scaling — trillions in infrastructure investment
Metaculus crowd	Actively forecasted (Q38419) — specific numerical estimate
Our tracker	Infrastructure investment confirmed (conf. 1.0); 10²⁸ FLOP training run behind (conf. 0.70)

Analysis: No disagreement that compute is scaling enormously. The debate is about how fast — and whether the largest training runs match the scenario’s timeline.

Compute Concentration

Source	Assessment
AI 2027	”OpenBrain” (based on OpenAI/Google) dominates; others 3–9 months behind
Metaculus crowd	Highest % of compute held by one org on Jan 2028 (Q38393)
Our tracker	Lab gap narrower than predicted (0–2 months, not 3–9)

Analysis: The scenario predicted more concentration than we’re seeing. Multiple labs remain competitive. The Metaculus question will help quantify how concentrated compute actually becomes.

Geopolitics & Governance

US-China AI Agreement

Source	Assessment
AI 2027	No meaningful cooperation — instead an accelerating race
Metaculus crowd	~4–5% chance of formal US-China agreement before 2029 (Q estimated)
Our tracker	Export controls confirmed impacting Chinese compute (conf. 0.80)
Jan Wehner forecast	4%

Analysis: Strong agreement across all sources. A formal US-China AI agreement is extremely unlikely. The competitive dynamic AI 2027 described is playing out.

AI as “Most Important Problem” (Gallup)

Source	Assessment
AI 2027	Growing public concern, anti-AI protests, political salience
Metaculus crowd	Median ~19% by Jan 2028 (Q38407)
Our tracker	Anti-AI protest emerging (conf. 0.50); Labor market impact ahead of predicted
Jan Wehner forecast	3% (dramatically lower than crowd)

Analysis: This is one of the biggest disagreements within the forecasting community. The Metaculus crowd seems very aggressive at 19% — for reference, “inflation” peaked at ~6% on Gallup. Jan Wehner’s 3% seems more calibrated to historical polling patterns. Our tracker sees growing public concern but hasn’t committed to a specific percentage.

Category Scorecard

Category	AI 2027 Scenario	Metaculus Crowd Lean	Our Tracker Assessment
Coding automation	Very aggressive (superhuman by Mar 2027)	Moderate — expects progress but slower	Mixed — agents confirmed, benchmarks behind
AI research acceleration	Central claim — recursive improvement by 2027	Moderate timeline (~2027–2028 for key milestones)	On track for 1.5× but timing uncertain
Compute scaling	Massive and rapid	Scaling confirmed, specifics debated	Confirmed but largest runs behind
Revenue/valuation	$55B revenue, $2.5T valuation by end 2026	Not directly tracked in tournament	Behind on valuation, emerging on revenue
Public concern	High — protests, political salience	High — 19% “most important problem”	Emerging signals, possibly ahead
Geopolitical race	Intense US-China competition, no cooperation	Strong agreement — 4–5% chance of agreement	Confirmed competition dynamic
Neuralese / novel architectures	Key unlock in early 2027	Conservative — median ~2028	Emerging, low confidence
Safety / alignment	Increasingly urgent, partially ignored	Moderate concern acknowledged	On track — ASL-3 upgrades, model specs

Who Is More Aggressive?

Question	Most Aggressive	Least Aggressive
Overall AI progress pace	AI 2027 scenario	Metaculus crowd
Public concern about AI	Metaculus crowd (19%)	Jan Wehner (3%)
Neuralese deployment	AI 2027 (early 2027)	Metaculus crowd (~2028)
US-China cooperation	All agree: very unlikely	—
Coding automation	AI 2027	Our tracker (benchmarks behind)
AI self-improvement risk	AI 2027	Metaculus crowd (more gradual)

Where All Three Sources Agree

These are the highest-confidence areas — when the scenario, crowd wisdom, and our evidence-based tracker all point the same direction:

AI coding tools are transforming software development — universally acknowledged
US-China AI cooperation is extremely unlikely — all sources below 5%
Compute infrastructure is scaling massively — no disagreement
AI agents exist but reliability remains a challenge — all confirm
AI is increasingly contributing to AI research — direction agreed, pace debated

Where They Disagree Most

Public salience of AI — 3% to 19% range on Gallup question
Neuralese timeline — 1+ year gap between scenario and crowd
Speed of coding automation — scenario’s SWE-bench targets missed
Self-improvement timeline — scenario expects earlier recursive loops than crowd

Methodology Note

Metaculus forecasts are live crowd predictions that update continuously. The values cited here were gathered in March 2026 from search results, the Jan Wehner analysis (August 2025), and Metaculus question descriptions. For the latest crowd forecasts, visit the Metaculus AI 2027 Tournament directly.

Our tracker assessments are evidence-based judgments updated as new information arrives. See our Methodology page for how we assign status labels and confidence scores.

Related:

The Metaculus tournament originally had 16 questions (as of August 2025) and has since expanded to 19. Not all questions map directly to our tracked predictions.