Vendors

Model creators/vendors and how their models compare across the benchmark.

Vendor Models Avg Score Best Score ▼Best Model
Anthropic1690.10%95.06%Claude Opus 4.6 (Reasoning)
Qwen1686.77%94.55%Qwen3.7 Max
Google2184.62%94.08%Gemini 3.1 Pro (Preview)
OpenAI2784.73%93.85%GPT-5.4 (Reasoning)
Z.AI988.14%93.74%Z.AI GLM 5.1
MoonshotAI291.71%92.57%MoonshotAI: Kimi K2.6
xAI485.27%90.99%Grok 4.3 (Reasoning)
minimax387.80%90.45%MiniMax M3
bytedance-seed482.62%89.59%ByteDance Seed 1.6
DeepSeek983.63%89.28%DeepSeek V4 Pro (Reasoning)
aion-labs186.66%86.66%Aion 2.0
xiaomi285.00%86.05%Xiaomi MIMO v2.5 Pro
Mistral AI1272.18%84.29%Mistral Large 3
inception181.99%81.99%Inception Mercury 2
NVIDIA376.97%81.69%Nemotron 3 Super
Nous Research275.27%80.80%Hermes 3 405B
Writer178.11%78.11%Writer: Palmyra X5
Meta177.41%77.41%Llama 3.1 70B
TheDrummer172.68%72.68%Cydonia 24B V4.1
Microsoft171.45%71.45%WizardLM 2 8x22b
arcee-ai167.68%67.68%Arcee AI: Trinity Mini
Cohere167.04%67.04%Cohere Command R+ (Aug. 2024)