Rankings
Best LLMs Overall
Ranked by composite quality score across 5 dimensions. Price is not included — this ranking answers “which model is most capable,” not “which is cheapest.” For price-aware picks, see best by price.
Score = Intelligence (35) + Tool Use (30) + Context (10) + Trust (15) + Speed (10) = 100 pts → ÷10 = rating. Tool Use: tool calling API (+10), MCP support (+10), parallel tool calls (+5) — verified capability checklist. Trust: US/EU (+7), privacy (+5), open source (+3).
top pick overall
Gemini 3.1 Pro
Google · Quality score 8.7/10 · 86.6/100 pts
Pending Review
These models lack enough independently verified benchmark data for a reliable score. Categories showing 0 are zeroed until we have the data to back them up. They'll move into the main rankings once testing is complete.
Missing: AA Intelligence Index (estimated — not yet measured by Artificial Analysis) · Speed / TPS (estimated — not yet measured by Artificial Analysis) · AA-Omniscience (no data — scored as neutral 5/15)
Last updated February 2026. Intelligence scores from Artificial Analysis. Speed from AA speed leaderboard. See how we rate for full methodology.