Gemini 3.1 Pro
Top PickGoogle's reasoning-optimized flagship, released February 19, 2026, and currently the #1 ranked model on the Artificial Analysis Intelligence Index (score: 57 out of 114 models). Gemini 3.1 Pro is a direct upgrade to Gemini 3 Pro — same 1M token context window and same $2/$12 pricing — but with dramatically improved reasoning. Its ARC-AGI-2 abstract reasoning score more than doubled from 31.1% to 77.1%, and it nearly doubled its APEX-Agents agentic task score (18.4% → 33.5%). It leads on scientific knowledge (GPQA Diamond 94.3%), competitive coding (LiveCodeBench Pro Elo 2887), and multi-step agentic search (BrowseComp 85.9%). A dedicated custom-tools API endpoint is available for agentic pipeline use. Currently in preview — generally available soon.
Context window
1.0M tokens
API (blended)
$4.50/1M
Consumer access
Free (limited) / $20/mo
Multimodal
Yes
Strengths
- +#1 Artificial Analysis Intelligence Index score (57) as of February 2026 — leads 114 models
- +ARC-AGI-2: 77.1% — more than doubled Gemini 3 Pro (31.1%); leads all known published models
- +APEX-Agents: 33.5% — best long-horizon agentic task performance; nearly doubled predecessor
- +GPQA Diamond: 94.3% scientific knowledge — highest published score across all models
- +Same $2/$12 pricing as Gemini 3 Pro — major capability upgrade at no extra cost
- +Dedicated custom-tools API endpoint (gemini-3.1-pro-preview-customtools) for agentic workflows
Weaknesses
- -Preview only as of February 2026 — not yet generally available
- -Time to first token: 29.96s — high latency makes it unsuitable for interactive or streaming use
- -GDPval-AA Elo only 1317 — trails Claude Sonnet 4.6 (1633) by 316 points on enterprise expert tasks
- -Very verbose — generates far more tokens per task (cost impact at scale)
- -Prompts over 200K tokens billed at 2× — full 1M context at scale gets expensive quickly
Best for
Not ideal for
Pricing details
Subscription plans
API pricing
Prices verified February 2026. LLM pricing changes frequently — verify at the provider's site before budgeting.
Last updated: February 2026