Last updated: April 5, 2026 · Reviewed by Daniel Ashford

🤖 Claude Sonnet 4 — Review & Scores

by Anthropic · Rank #4 · Performance

Best price-to-performance ratio. Nearly Opus-level quality at 80% lower cost. The production workhorse.

LLM JUDGE INDEX™
93.2
+1.8/week

Evaluation Scores

🎯 Accuracy93
🧠 Reasoning92
🛡️ Safety96
💻 Coding91
Creativity93
📋 Instruction94

Specifications & Pricing

Input
$3/M
Output
$15/M
Context
200K
Latency
1.2s
Arena
#5
Index
93.2
DA
Daniel Ashford
Founder & Lead Evaluator · 200+ models evaluated
Try on Anthropic →
Best For
Production apps
Content creation
Analysis
Compare
vs Claude Opus 4
vs GPT-5.3 Codex
vs Gemini 2.5 Ultra
vs GPT-4o
SPONSORED

Evaluate Claude Sonnet 4 on your production data.

Try Evidently Free →