Anthropic
Comparing 10 models from Anthropic.
| Model | Total ▼ | Released | Context | Size | Creative writing | Rule following | Utility | Mathematics | Tooling | Language | Logic |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Claude Opus 4.6 | 88.14% | Feb 4, 26 | 1m | – | 71.27% | 84.05% | 92.83% | 100.00% | 95.38% | 91.00% | 80.63% |
| Claude Opus 4.5 | 84.41% | Nov 24, 25 | 200k | – | 68.57% | 84.22% | 84.50% | 100.00% | 87.31% | 93.75% | 81.25% |
| Claude Sonnet 4 | 83.20% | May 22, 25 | 200k | – | 43.05% | 68.89% | 95.00% | 100.00% | 92.31% | 87.30% | 93.75% |
| Claude 3.5 Sonnet (new) | 80.29% | Oct 22, 24 | 200k | – | 38.88% | 66.81% | 91.83% | 100.00% | 92.69% | 84.14% | 85.00% |
| Claude Opus 4 | 80.23% | May 22, 25 | 200k | – | 54.27% | 74.82% | 85.67% | 100.00% | 80.00% | 87.81% | 87.50% |
| Claude Sonnet 4.5 | 78.78% | Sep 29, 25 | 1m | – | 45.47% | 73.70% | 85.00% | 100.00% | 83.08% | 86.13% | 81.25% |
| Claude 3.5 Haiku | 73.73% | Oct 22, 24 | 200k | – | 38.63% | 60.94% | 82.33% | 100.00% | 82.31% | 77.49% | 87.50% |
| Claude Haiku 4.5 | 72.19% | Oct 15, 25 | 200k | – | 49.44% | 59.16% | 80.00% | 100.00% | 72.31% | 85.84% | 81.25% |
| Claude 3.7 Sonnet | 71.47% | Feb 19, 25 | 200k | – | 45.49% | 58.49% | 80.67% | 100.00% | 66.92% | 88.92% | 81.25% |
| Claude 3 Haiku | 59.75% | Mar 13, 24 | 200k | – | 36.00% | 50.47% | 64.67% | 95.00% | 53.46% | 72.18% | 80.63% |
Model Performance
Cost vs Performance
Compares total benchmark cost against overall score for Anthropic models. Quadrant lines are drawn at the median values.