Computer-using agents marketed as 'personal assistants'

Author Johannes Haus
Last updated
Confirmed · Agent Autonomy · 85% confidence
Predicted: Mid 2025 · Updated: 2026-03-13 · Source: ai-2027.com, Mid 2025: Stumbling Agents
Advertisements for computer-using agents emphasize the term 'personal assistant': you can prompt them with tasks like 'order me a burrito on DoorDash' or 'open my budget spreadsheet and sum this month's expenses.'

At a glance

  • Assessment: Confirmed
  • Confidence: 85%
  • Predicted timing: Mid 2025
  • Primary source: ai-2027.com, Mid 2025: Stumbling Agents

What AI 2027 Predicted

The scenario describes mid-2025 as the era when computer-using agents are heavily marketed as “personal assistants.” The framing emphasizes consumer-friendly tasks: ordering food, managing spreadsheets, browsing on your behalf. However, the scenario also predicts that adoption would be limited — the agents are “impressive in cherry-picked examples, but in practice unreliable,” and most people would not find them worth the hassle for everyday tasks.

How We Track This

We monitor:

  • Launches of computer-use agents from major labs (OpenAI Operator, Anthropic Computer Use, Google Project Mariner)
  • Marketing language and positioning of these products
  • Consumer adoption metrics and usage data
  • User reviews and reliability reports
  • Computer-use benchmark scores (CUB, WebArena, OSWorld)

Current Evidence

This prediction has been confirmed. Multiple labs launched computer-using agents in 2025, marketed with exactly the personal-assistant framing AI 2027 described:

OpenAI Operator (January 2025): OpenAI launched Operator as a computer-using agent available to ChatGPT Pro subscribers. Marketing emphasized tasks like booking restaurants, ordering groceries, and filling out forms — classic personal assistant positioning.

Anthropic Computer Use: Claude’s computer use capability was released in beta in October 2024 and expanded through 2025, enabling the model to control a computer desktop to complete tasks.

Google Project Mariner: Google launched its computer-use agent through the Gemini ecosystem, with similar personal-assistant-style marketing.

Adoption remains limited: Consistent with AI 2027’s prediction, adoption has been tepid. Computer-use benchmark (CUB) scores remain low — the best agents achieved around 10% on comprehensive real-world task suites as of early 2026. User reports consistently describe agents as impressive in demos but frustrating in practice. The AI Futures Project graded the “agents impressive but unreliable” prediction as “broadly accurate.”

The marketing-reality gap: Agents are marketed for tasks like “order me a burrito on DoorDash” but frequently fail on multi-step real-world tasks involving authentication, dynamic web interfaces, or error recovery. This gap is consistent with the scenario’s depiction of the period, though it’s worth noting that a marketing-reality gap in new technology is a common pattern, not a uniquely prescient prediction.

Sources:

Counterevidence & Limitations

  • Coding agents (Claude Code, Cursor) have seen much stronger adoption than consumer personal-assistant agents — the scenario somewhat understates the developer-tool path
  • Some agents may be gaining traction in narrow enterprise use cases that don’t match the consumer framing
  • The “personal assistant” marketing may be shifting toward “coding assistant” or “work agent” as companies realize consumer adoption is limited
  • It’s hard to measure adoption precisely — lab revenue numbers conflate agent and chatbot usage

What Would Change Our Assessment

  • Already confirmed. Would strengthen with quantitative adoption data showing low consumer agent usage relative to marketing spend
  • Partial revision: If computer-use agents achieve high reliability (>50% CUB scores) and mainstream adoption, the “low adoption” part of the prediction would weaken, though the marketing framing would remain confirmed

Update History

DateUpdate
2026-03-13Marketing framing matches prediction precisely. Adoption remains low — CUB benchmark scores ~10% — but the product category is established as predicted.
2025-10OpenAI launches ChatGPT Atlas browser (October 21) as a direct Chrome competitor with integrated multi-step task capability. The race to own the primary human-computer interface is explicitly framed around AI assistants replacing browsers as the access layer. OpenAI launches Operator; Anthropic ships Claude Computer Use; Google releases Project Mariner. All positioned as personal assistants for computer tasks.
2025-09Google integrates Gemini directly into Chrome browser for U.S. desktop users (September 18), with Calendar, YouTube, and Maps integrations. “Personal assistant” framing is now the dominant positioning across all major AI products.
2025-08Google Search AI Mode goes global (August 21) with agentic capabilities including restaurant reservations via OpenTable and Resy. Search is being repositioned as an assistant that takes actions, not just returns links.
2025-05OpenAI Codex (May 15) launched with “works for you asynchronously” framing. Claude Code marketed around enterprise productivity. Google I/O (May 20) positions Gemini and Astra as personal and work assistants. The “personal assistant” framing is becoming universal.