Ranking desk
AI Leaderboard Desk
This page now separates two different questions. Frontier now answers what is current across the major labs. Evaluated composite answers which models are currently strongest inside the benchmark-backed scored set.
Important
The old “composite leaderboard” confusion came from treating one scored table as the answer to everything. The frontier lane keeps newer launches visible immediately, while the evaluated composite only ranks models once there is enough public evidence to score them with some confidence.
| Model | State | Price |
|---|---|---|
| GPT-5.4 OpenAI / 1.1M context | tracking tracking | $2.50 / $15.00 |
| Claude Opus 4.6 Anthropic / 1.0M context | scored active | $15.00 / $75.00 |
| Gemini 3.1 Pro Google / 1.0M context | partial tracking | $2.00 / $12.00 |
| Claude Sonnet 4.6 Anthropic / 1.0M context | scored active | $3.00 / $15.00 |
| Grok 4.20 xAI / 2.0M context | tracking tracking | $2.00 / $6.00 |
| Qwen 3.6 Plus Alibaba / 1.0M context | tracking tracking | $0.33 / $1.95 |
| MiniMax M2.7 MiniMax / 197K context | tracking tracking | $0.30 / $1.20 |
| GLM-5 Zhipu AI / 80K context | tracking tracking | $0.72 / $2.30 |
| Kimi K2.5 Moonshot AI / 131K context | tracking tracking | $0.57 / $2.30 |
| Gemma 4 Google / 262K context | tracking tracking Open | $0.13 / $0.38 |
| DeepSeek R1 DeepSeek / 64K context | scored active Open | $0.70 / $2.50 |
| DeepSeek V3.2 DeepSeek / 164K context | partial active Open | $0.20 / $0.77 |
| Kimi K2.6 Moonshot AI / 262K context | tracking tracking | $0.60 / $2.80 |
| Claude Opus 4.7 Anthropic / 1.0M context | tracking tracking | $5.00 / $25.00 |
| Claude Mythos Preview Anthropic / 0K context | tracking preview | Included |
| Claude Opus 4.6 (Fast) Anthropic / 1.0M context | tracking tracking | $30.00 / $150.00 |
| GLM 5.1 Zhipu AI / 203K context | tracking tracking | $0.70 / $4.40 |
| GLM 5V Turbo Zhipu AI / 203K context | tracking tracking | $1.20 / $4.00 |
The frontier lane is intentionally not a synthetic score. It keeps the current flagship and launch watchlist visible even when benchmark coverage is still catching up.
Evaluated composite now uses a weighted blend of normalized benchmark results, the existing quality layer, and a freshness signal. It also penalizes thin evidence, stale provider generations, and beta or compact variants so older benchmark saturation does not dominate the story.
Explore the field
Search and filter the full ranked dataset
The top tables are quick reads. This table is for actual decision-making when you want a specific lab, use case, release window, or pricing posture.
| Model | Composite | Coverage | Best for | Price |
|---|---|---|---|---|
| GPT-5.2 OpenAI / 400K context | 68.0 | 53% | Chat API | $1.75 / $14.00 |
| Claude Opus 4.6 Anthropic / 1.0M context | 65.3 | 45% | Chat API | $15.00 / $75.00 |
| Claude Sonnet 4.6 Anthropic / 1.0M context | 63.3 | 28% | Chat API | $3.00 / $15.00 |
| Llama 4 Maverick Meta / 1.0M context | 55.1 | 28% | Coding Open API | $0.15 / $0.60 |
| DeepSeek V3.2 DeepSeek / 164K context | 48.2 | 17% | Coding Open API | $0.20 / $0.77 |
| Gemini 3.1 Pro Google / 1.0M context | 44.7 | 15% | Chat API | $2.00 / $12.00 |
| O3 OpenAI / 200K context | 41.4 | 68% | General use API | $2.00 / $8.00 |
| GPT-5 OpenAI / 400K context | 41.1 | 23% | Chat API | $1.25 / $10.00 |
| Gemini 2.5 Pro Google / 1.0M context | 39.4 | 68% | General use API | $1.25 / $10.00 |
| Grok 4 xAI / 256K context | 39.0 | 46% | Chat API | $3.00 / $15.00 |
| DeepSeek R1 DeepSeek / 64K context | 38.0 | 68% | General use Open API | $0.70 / $2.50 |
| Claude Opus 4 Anthropic / 200K context | 37.4 | 39% | General use API | $15.00 / $75.00 |
| Claude Sonnet 4 Anthropic / 1.0M context | 37.1 | 43% | General use API | $3.00 / $15.00 |
| O3 Pro OpenAI / 200K context | 34.9 | 37% | Reasoning API | $20.00 / $80.00 |
| GPT-4.1 OpenAI / 1.0M context | 32.9 | 39% | General use API | $2.00 / $8.00 |
| DeepSeek V3 DeepSeek / 164K context | 31.0 | 31% | Coding Open API | $0.32 / $0.89 |
| GPT-4o (2024-05-13) OpenAI / 128K context | 30.4 | 35% | Coding API | $5.00 / $15.00 |
| Grok 3 Beta xAI / 131K context | 30.3 | 38% | Coding API | $3.00 / $15.00 |
| QwQ 32B Alibaba / 131K context | 29.8 | 26% | Coding Open API | $0.15 / $0.58 |
| Qwen3 235B A22B Alibaba / 131K context | 29.4 | 21% | Chat Open API | $0.46 / $1.82 |
| O4 Mini OpenAI / 200K context | 27.6 | 26% | Coding API | $1.10 / $4.40 |
| Qwen2.5 72B Instruct Alibaba / 33K context | 27.6 | 31% | Coding Open API | $0.12 / $0.39 |
| Gemini 2.5 Flash Google / 1.0M context | 27.3 | 31% | Coding API | $0.30 / $2.50 |
| Command A Cohere / 256K context | 25.6 | 8% | Chat Open API | $2.50 / $10.00 |
| Llama 3.3 70B Instruct Meta / 131K context | 24.4 | 8% | Chat Open API | $0.12 / $0.38 |
| Mistral Large Mistral / 128K context | 21.8 | 8% | Chat Open API | $2.00 / $6.00 |
| Mistral Small 3.1 24B Mistral / 128K context | 18.6 | 8% | Chat Open API | $0.35 / $0.56 |
| Gemini 2.0 Flash Google / 1.0M context | 16.3 | 8% | Chat API | $0.10 / $0.40 |
| Claude 3.5 Haiku Anthropic / 200K context | 15.2 | 8% | Chat API | $0.80 / $4.00 |
| GPT-4o-mini (2024-07-18) OpenAI / 128K context | 14.7 | 8% | Chat API | $0.15 / $0.60 |
| Command A Reasoning Cohere / 256K context | 12.9 | 0% | Frontier tracking Open API | $2.50 / $10.00 |
| GPT-5.2 Pro OpenAI / 400K context | 12.8 | 0% | Frontier tracking API | $21.00 / $168.00 |
| Nova 2 Lite Amazon / 1.0M context | 11.4 | 0% | Frontier tracking API | $0.30 / $2.50 |
| Claude Opus 4.5 Anthropic / 200K context | 10.5 | 0% | Frontier tracking API | $5.00 / $25.00 |
| Gemini 3 Pro Google / 1.0M context | 10.5 | 0% | Frontier tracking API | $2.00 / $12.00 |
| Llama 4 Scout Meta / 328K context | 10.2 | 0% | Frontier tracking Open API | $0.08 / $0.30 |
| Grok 4.1 Fast xAI / 2.0M context | 10.0 | 0% | Frontier tracking API | $0.20 / $0.50 |
| GPT-5 Pro OpenAI / 400K context | 9.7 | 0% | Frontier tracking API | $15.00 / $120.00 |
| Claude Sonnet 4.5 Anthropic / 1.0M context | 9.6 | 0% | Frontier tracking API | $3.00 / $15.00 |
| Qwen3 Max Alibaba / 262K context | 9.4 | 0% | Frontier tracking Open API | $0.78 / $3.90 |
| Qwen3 Coder 480B A35B Alibaba / 262K context | 9.3 | 0% | Frontier tracking Open API | $0.22 / $1.00 |
| Grok 4 Fast xAI / 2.0M context | 9.1 | 0% | Frontier tracking API | $0.20 / $0.50 |
| Gemini 3 Flash Preview Google / 1.0M context | 8.7 | 0% | Frontier tracking API | $0.50 / $3.00 |
| R1 0528 DeepSeek / 164K context | 8.7 | 0% | Frontier tracking Open API | $0.50 / $2.15 |
| Claude Haiku 4.5 Anthropic / 200K context | 7.6 | 0% | Frontier tracking API | $1.00 / $5.00 |
| GPT-5 Nano OpenAI / 400K context | 7.3 | 0% | Frontier tracking API | $0.05 / $0.40 |
| Mistral Medium 3 Mistral / 131K context | 6.4 | 0% | Frontier tracking Open API | $0.40 / $2.00 |
| Pixtral Large 2411 Mistral / 131K context | 5.7 | 0% | Frontier tracking API | $2.00 / $6.00 |
| Nova Pro 1.0 Amazon / 300K context | 5.7 | 0% | Frontier tracking API | $0.80 / $3.20 |
| Codestral 2508 Mistral / 256K context | 5.6 | 0% | Frontier tracking API | $0.30 / $0.90 |
| MiniMax-01 MiniMax / 1.0M context | 5.6 | 0% | Frontier tracking API | $0.20 / $1.10 |
| Nova Premier 1.0 Amazon / 1.0M context | 5.6 | 0% | Frontier tracking API | $2.50 / $12.50 |
| Sonar Pro Perplexity / 200K context | 5.6 | 0% | Frontier tracking API | $3.00 / $15.00 |
| Sonar Perplexity / 127K context | 5.5 | 0% | Frontier tracking API | $1.00 / $1.00 |
| Gemini 2.5 Flash Lite Google / 1.0M context | 5.2 | 0% | Frontier tracking API | $0.10 / $0.40 |
| Nova Micro 1.0 Amazon / 128K context | 5.2 | 0% | Frontier tracking API | $0.04 / $0.14 |
| Reka Flash 3 Reka / 66K context | 5.1 | 0% | Frontier tracking API | $0.10 / $0.20 |
| Llama 3.1 405B Meta / 131K context | 5.0 | 0% | Frontier tracking Open API | $3.00 / $3.00 |
| Command R+ (08-2024) Cohere / 128K context | 5.0 | 0% | Frontier tracking Open API | $2.50 / $10.00 |
| o3 Mini OpenAI / 200K context | 4.9 | 0% | Frontier tracking API | $1.10 / $4.40 |
| Reka Core Reka / 128K context | 4.8 | 0% | Frontier tracking API | $3.00 / $15.00 |
| GPT-4.1 Mini OpenAI / 1.0M context | 4.7 | 0% | Frontier tracking API | $0.40 / $1.60 |
| Jamba 1.5 Large AI21 Labs / 256K context | 4.7 | 0% | Frontier tracking API | $2.00 / $8.00 |
| Command R (08-2024) Cohere / 128K context | 4.7 | 0% | Frontier tracking Open API | $0.15 / $0.60 |
| Gemini 2.0 Flash Lite Google / 1.0M context | 4.6 | 0% | Frontier tracking API | $0.07 / $0.30 |
| Mistral Nemo Mistral / 131K context | 4.6 | 0% | Frontier tracking Open API | $0.02 / $0.04 |
| GPT-4.1 Nano OpenAI / 1.0M context | 4.5 | 0% | Frontier tracking API | $0.10 / $0.40 |
| Nova Lite 1.0 Amazon / 300K context | 4.4 | 0% | Frontier tracking API | $0.06 / $0.24 |
| Command R7B (12-2024) Cohere / 128K context | 4.3 | 0% | Frontier tracking Open API | $0.04 / $0.15 |
| Grok 3 Mini Beta xAI / 131K context | 4.2 | 0% | Frontier tracking API | $0.30 / $0.50 |
| Jamba 1.5 Mini AI21 Labs / 256K context | 3.6 | 0% | Frontier tracking API | $0.20 / $0.40 |