Model Watch · Reviewed

Google Gemini 3

Announced Nov 18, 2025Reviewed Jun 23, 2026

What they claimed

Google positioned Gemini 3 as its most intelligent model and the new state of the art in reasoning, multimodal understanding, and agentic 'vibe' coding. It led with record benchmark figures, including a 1501 Elo score on the LMArena leaderboard, 37.5% on Humanity's Last Exam with no tools, and 91.9% on GPQA Diamond, plus a Deep Think mode claiming even higher. The framing was that Gemini 3 had taken the outright frontier lead rather than merely matching rivals.

What shipped

Gemini 3 Pro shipped on day one across the Gemini app, AI Mode in Search for paid subscribers, the Gemini API, AI Studio, Vertex AI, and a new agentic coding tool called Google Antigravity, with the higher-end Deep Think mode promised to Ultra subscribers in the following weeks.

The verdict

Gemini 3 looks like the launch that finally validated Google's frontier ambitions after the panic of Bard and the staged demo of Gemini 1.0: this time the strong benchmarks were paired with same-day, broad availability rather than a promise. Independent leaderboards and early reviews corroborated that it took the top position over prior leaders from OpenAI and Anthropic, at least at launch. The usual caveats apply and the picture is still settling: leaderboard rankings rotate quickly, vendor-run benchmark numbers should be read skeptically, and competitors have since responded. For executives, the takeaway is that the lead is real but provisional, the kind of advantage measured in months, not years.

Why it matters

It marks the point where Google credibly retook the AI frontier, a reminder that leadership in this market is contestable and that 'who is ahead' is a snapshot, not a standing.

Sources

All tracked models