Microsoft
Comparing 4 models from Microsoft.
| Model | Total ▼ | Released | Context | Size | Creative writing | Rule following | Utility | Mathematics | Tooling | Language | Logic |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Phi-3.5 Mini 128k | 55.49% | Aug 17, 24 | 128k | 3.8B | 18.74% | 44.81% | 61.44% | 80.00% | 55.64% | 59.00% | 88.75% |
| WizardLM 2 8x22b | 52.87% | Apr 15, 24 | 65k | 8x22B | 8.45% | 39.41% | 62.06% | 50.00% | 59.36% | 73.07% | 63.13% |
| Phi-3 Mini 128k | 49.87% | Jul 1, 24 | 128k | 3.8B | 21.40% | 35.96% | 54.89% | 85.00% | 42.44% | 58.12% | 94.38% |
| Phi-3 Medium 128k | 43.98% | Apr 21, 24 | 128k | 14B | 17.53% | 37.23% | 46.94% | 50.00% | 30.26% | 62.92% | 75.63% |
Model Performance
Cost vs Performance
Compares total benchmark cost against overall score for Microsoft models. Quadrant lines are drawn at the median values.