Ranking desk

AI Leaderboard Desk

This page now separates two different questions. Frontier now answers what is current across the major labs. Evaluated composite answers which models are currently strongest inside the benchmark-backed scored set.

Important

The old “composite leaderboard” confusion came from treating one scored table as the answer to everything. The frontier lane keeps newer launches visible immediately, while the evaluated composite only ranks models once there is enough public evidence to score them with some confidence.

Model State Price
GPT-5.4 OpenAI / 1.1M context
tracking tracking
$2.50 / $15.00
Claude Opus 4.6 Anthropic / 1.0M context
scored active
$15.00 / $75.00
Gemini 3.1 Pro Google / 1.0M context
partial tracking
$2.00 / $12.00
Claude Sonnet 4.6 Anthropic / 1.0M context
scored active
$3.00 / $15.00
Grok 4.20 xAI / 2.0M context
tracking tracking
$2.00 / $6.00
Qwen 3.6 Plus Alibaba / 1.0M context
tracking tracking
$0.33 / $1.95
MiniMax M2.7 MiniMax / 197K context
tracking tracking
$0.30 / $1.20
GLM-5 Zhipu AI / 80K context
tracking tracking
$0.72 / $2.30
Kimi K2.5 Moonshot AI / 131K context
tracking tracking
$0.57 / $2.30
Gemma 4 Google / 262K context
tracking tracking Open
$0.13 / $0.38
DeepSeek R1 DeepSeek / 64K context
scored active Open
$0.70 / $2.50
DeepSeek V3.2 DeepSeek / 164K context
partial active Open
$0.20 / $0.77
Kimi K2.6 Moonshot AI / 262K context
tracking tracking
$0.60 / $2.80
Claude Opus 4.7 Anthropic / 1.0M context
tracking tracking
$5.00 / $25.00
Claude Mythos Preview Anthropic / 0K context
tracking preview
Included
Claude Opus 4.6 (Fast) Anthropic / 1.0M context
tracking tracking
$30.00 / $150.00
GLM 5.1 Zhipu AI / 203K context
tracking tracking
$0.70 / $4.40
GLM 5V Turbo Zhipu AI / 203K context
tracking tracking
$1.20 / $4.00

The frontier lane is intentionally not a synthetic score. It keeps the current flagship and launch watchlist visible even when benchmark coverage is still catching up.

Evaluated composite now uses a weighted blend of normalized benchmark results, the existing quality layer, and a freshness signal. It also penalizes thin evidence, stale provider generations, and beta or compact variants so older benchmark saturation does not dominate the story.

Explore the field

Search and filter the full ranked dataset

The top tables are quick reads. This table is for actual decision-making when you want a specific lab, use case, release window, or pricing posture.

Model Composite Coverage Best for Price
GPT-5.2

OpenAI / 400K context

68.0 53%
Chat
API
$1.75 / $14.00
Claude Opus 4.6

Anthropic / 1.0M context

65.3 45%
Chat
API
$15.00 / $75.00
Claude Sonnet 4.6

Anthropic / 1.0M context

63.3 28%
Chat
API
$3.00 / $15.00
Llama 4 Maverick

Meta / 1.0M context

55.1 28%
Coding
Open API
$0.15 / $0.60
DeepSeek V3.2

DeepSeek / 164K context

48.2 17%
Coding
Open API
$0.20 / $0.77
Gemini 3.1 Pro

Google / 1.0M context

44.7 15%
Chat
API
$2.00 / $12.00
O3

OpenAI / 200K context

41.4 68%
General use
API
$2.00 / $8.00
GPT-5

OpenAI / 400K context

41.1 23%
Chat
API
$1.25 / $10.00
Gemini 2.5 Pro

Google / 1.0M context

39.4 68%
General use
API
$1.25 / $10.00
Grok 4

xAI / 256K context

39.0 46%
Chat
API
$3.00 / $15.00
DeepSeek R1

DeepSeek / 64K context

38.0 68%
General use
Open API
$0.70 / $2.50
Claude Opus 4

Anthropic / 200K context

37.4 39%
General use
API
$15.00 / $75.00
Claude Sonnet 4

Anthropic / 1.0M context

37.1 43%
General use
API
$3.00 / $15.00
O3 Pro

OpenAI / 200K context

34.9 37%
Reasoning
API
$20.00 / $80.00
GPT-4.1

OpenAI / 1.0M context

32.9 39%
General use
API
$2.00 / $8.00
DeepSeek V3

DeepSeek / 164K context

31.0 31%
Coding
Open API
$0.32 / $0.89
GPT-4o (2024-05-13)

OpenAI / 128K context

30.4 35%
Coding
API
$5.00 / $15.00
Grok 3 Beta

xAI / 131K context

30.3 38%
Coding
API
$3.00 / $15.00
QwQ 32B

Alibaba / 131K context

29.8 26%
Coding
Open API
$0.15 / $0.58
Qwen3 235B A22B

Alibaba / 131K context

29.4 21%
Chat
Open API
$0.46 / $1.82
O4 Mini

OpenAI / 200K context

27.6 26%
Coding
API
$1.10 / $4.40
Qwen2.5 72B Instruct

Alibaba / 33K context

27.6 31%
Coding
Open API
$0.12 / $0.39
Gemini 2.5 Flash

Google / 1.0M context

27.3 31%
Coding
API
$0.30 / $2.50
Command A

Cohere / 256K context

25.6 8%
Chat
Open API
$2.50 / $10.00
Llama 3.3 70B Instruct

Meta / 131K context

24.4 8%
Chat
Open API
$0.12 / $0.38
Mistral Large

Mistral / 128K context

21.8 8%
Chat
Open API
$2.00 / $6.00
Mistral Small 3.1 24B

Mistral / 128K context

18.6 8%
Chat
Open API
$0.35 / $0.56
Gemini 2.0 Flash

Google / 1.0M context

16.3 8%
Chat
API
$0.10 / $0.40
Claude 3.5 Haiku

Anthropic / 200K context

15.2 8%
Chat
API
$0.80 / $4.00
GPT-4o-mini (2024-07-18)

OpenAI / 128K context

14.7 8%
Chat
API
$0.15 / $0.60
Command A Reasoning

Cohere / 256K context

12.9 0%
Frontier tracking
Open API
$2.50 / $10.00
GPT-5.2 Pro

OpenAI / 400K context

12.8 0%
Frontier tracking
API
$21.00 / $168.00
Nova 2 Lite

Amazon / 1.0M context

11.4 0%
Frontier tracking
API
$0.30 / $2.50
Claude Opus 4.5

Anthropic / 200K context

10.5 0%
Frontier tracking
API
$5.00 / $25.00
Gemini 3 Pro

Google / 1.0M context

10.5 0%
Frontier tracking
API
$2.00 / $12.00
Llama 4 Scout

Meta / 328K context

10.2 0%
Frontier tracking
Open API
$0.08 / $0.30
Grok 4.1 Fast

xAI / 2.0M context

10.0 0%
Frontier tracking
API
$0.20 / $0.50
GPT-5 Pro

OpenAI / 400K context

9.7 0%
Frontier tracking
API
$15.00 / $120.00
Claude Sonnet 4.5

Anthropic / 1.0M context

9.6 0%
Frontier tracking
API
$3.00 / $15.00
Qwen3 Max

Alibaba / 262K context

9.4 0%
Frontier tracking
Open API
$0.78 / $3.90
Qwen3 Coder 480B A35B

Alibaba / 262K context

9.3 0%
Frontier tracking
Open API
$0.22 / $1.00
Grok 4 Fast

xAI / 2.0M context

9.1 0%
Frontier tracking
API
$0.20 / $0.50
Gemini 3 Flash Preview

Google / 1.0M context

8.7 0%
Frontier tracking
API
$0.50 / $3.00
R1 0528

DeepSeek / 164K context

8.7 0%
Frontier tracking
Open API
$0.50 / $2.15
Claude Haiku 4.5

Anthropic / 200K context

7.6 0%
Frontier tracking
API
$1.00 / $5.00
GPT-5 Nano

OpenAI / 400K context

7.3 0%
Frontier tracking
API
$0.05 / $0.40
Mistral Medium 3

Mistral / 131K context

6.4 0%
Frontier tracking
Open API
$0.40 / $2.00
Pixtral Large 2411

Mistral / 131K context

5.7 0%
Frontier tracking
API
$2.00 / $6.00
Nova Pro 1.0

Amazon / 300K context

5.7 0%
Frontier tracking
API
$0.80 / $3.20
Codestral 2508

Mistral / 256K context

5.6 0%
Frontier tracking
API
$0.30 / $0.90
MiniMax-01

MiniMax / 1.0M context

5.6 0%
Frontier tracking
API
$0.20 / $1.10
Nova Premier 1.0

Amazon / 1.0M context

5.6 0%
Frontier tracking
API
$2.50 / $12.50
Sonar Pro

Perplexity / 200K context

5.6 0%
Frontier tracking
API
$3.00 / $15.00
Sonar

Perplexity / 127K context

5.5 0%
Frontier tracking
API
$1.00 / $1.00
Gemini 2.5 Flash Lite

Google / 1.0M context

5.2 0%
Frontier tracking
API
$0.10 / $0.40
Nova Micro 1.0

Amazon / 128K context

5.2 0%
Frontier tracking
API
$0.04 / $0.14
Reka Flash 3

Reka / 66K context

5.1 0%
Frontier tracking
API
$0.10 / $0.20
Llama 3.1 405B

Meta / 131K context

5.0 0%
Frontier tracking
Open API
$3.00 / $3.00
Command R+ (08-2024)

Cohere / 128K context

5.0 0%
Frontier tracking
Open API
$2.50 / $10.00
o3 Mini

OpenAI / 200K context

4.9 0%
Frontier tracking
API
$1.10 / $4.40
Reka Core

Reka / 128K context

4.8 0%
Frontier tracking
API
$3.00 / $15.00
GPT-4.1 Mini

OpenAI / 1.0M context

4.7 0%
Frontier tracking
API
$0.40 / $1.60
Jamba 1.5 Large

AI21 Labs / 256K context

4.7 0%
Frontier tracking
API
$2.00 / $8.00
Command R (08-2024)

Cohere / 128K context

4.7 0%
Frontier tracking
Open API
$0.15 / $0.60
Gemini 2.0 Flash Lite

Google / 1.0M context

4.6 0%
Frontier tracking
API
$0.07 / $0.30
Mistral Nemo

Mistral / 131K context

4.6 0%
Frontier tracking
Open API
$0.02 / $0.04
GPT-4.1 Nano

OpenAI / 1.0M context

4.5 0%
Frontier tracking
API
$0.10 / $0.40
Nova Lite 1.0

Amazon / 300K context

4.4 0%
Frontier tracking
API
$0.06 / $0.24
Command R7B (12-2024)

Cohere / 128K context

4.3 0%
Frontier tracking
Open API
$0.04 / $0.15
Grok 3 Mini Beta

xAI / 131K context

4.2 0%
Frontier tracking
API
$0.30 / $0.50
Jamba 1.5 Mini

AI21 Labs / 256K context

3.6 0%
Frontier tracking
API
$0.20 / $0.40