nowJobs market snapshot refreshed nowRecomputed benchmark-weighted quality scores nowSynced Chatbot Arena benchmark track nowUpdated speed measurements nowPulled latest OpenRouter price index nowValidated official pricing snapshots 25 MayOpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform 25 MayPublished the 2026-05-25 daily digest 25 MayWorkbench Launches Open Source BullMQ Dashboard For Node Backends 24 MaySpecBench Tests Reward Hacking In Long Horizon Coding Agents nowJobs market snapshot refreshed nowRecomputed benchmark-weighted quality scores nowSynced Chatbot Arena benchmark track nowUpdated speed measurements nowPulled latest OpenRouter price index nowValidated official pricing snapshots 25 MayOpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform 25 MayPublished the 2026-05-25 daily digest 25 MayWorkbench Launches Open Source BullMQ Dashboard For Node Backends 24 MaySpecBench Tests Reward Hacking In Long Horizon Coding Agents

Best AI Models for Legal

9 models ranked by legal benchmark performance on LegalBench — covering issue-spotting, rule-recall, legal interpretation, and rhetorical analysis.

Best Overall

GPT-5.2

OpenAI · Avg: 88.0

Best Value

Llama 4 Maverick

Meta · $0.15/M in

Best Open Source

DeepSeek R1

DeepSeek · Avg: 76.0

#	Model	Legal Avg	LegalBench	Quality	Price
1	GPT-5.2 OpenAI	88.0	88.0	90.0	$1.75
2	Claude Opus 4.6 Anthropic	86.0	86.0	89.0	$15.00
3	O3 OpenAI	85.0	85.0	88.0	$2.00
4	Gemini 2.5 Pro Google	84.0	84.0	83.0	$1.25
5	Claude Opus 4 Anthropic	82.0	82.0	84.0	$15.00
6	Claude Sonnet 4 Anthropic	79.0	79.0	79.0	$3.00
7	GPT-4o OpenAI	78.0	78.0	75.0	$2.50
8	DeepSeek R1 OSS DeepSeek	76.0	76.0	85.0	$0.70
9	Llama 4 Maverick OSS Meta	72.0	72.0	76.0	$0.15

About Legal AI Benchmarks

LegalBench evaluates AI across 162 legal reasoning tasks spanning six categories: issue-spotting, rule-recall, rule-application, rule-conclusion, interpretation, and rhetorical understanding. Scores above 85% indicate strong legal reasoning capability.

Note: AI models should not be used as a substitute for qualified legal counsel. Benchmark scores measure reasoning ability, not legal advice quality.

Other Notable Models

These models don't have published legal benchmark scores yet.

GPT-5.2 Pro

OpenAI · Quality: 93

GPT-5 Pro

OpenAI · Quality: 90

O4 Mini

OpenAI · Quality: 90

O3 Pro

OpenAI · Quality: 88

GPT-5

OpenAI · Quality: 87

Qwen3 235B A22B

Alibaba · Quality: 87

Claude Opus 4.5

Anthropic · Quality: 86

Claude Sonnet 4.6

Anthropic · Quality: 86

Qwen3 Max

Alibaba · Quality: 85

OpenAI · Quality: 84

View full leaderboard → Healthcare AI → Finance AI →