Sao10K
Comparing 3 models from Sao10K.
| Model | Total ▼ | Released | Context | Size | Creative writing | Rule following | Utility | Mathematics | Tooling | Language | Logic |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sao10K L3.1 70B Hanami x1 | 60.77% | Sep 9, 24 | 16k | 70B | 27.79% | 53.98% | 68.67% | 100.00% | 53.08% | 72.63% | 73.75% |
| Llama 3.1 Euryale 70B v2.2 | 55.08% | Aug 26, 24 | 8k | 70B | 22.95% | 48.04% | 62.89% | 75.00% | 49.36% | 63.68% | 71.25% |
| Llama 3 Euryale 70B v2.1 | 51.22% | Jun 14, 24 | 16k | 70B | 17.87% | 40.19% | 59.17% | 80.00% | 44.62% | 60.17% | 79.38% |
Model Performance
Cost vs Performance
Compares total benchmark cost against overall score for Sao10K models. Quadrant lines are drawn at the median values.