Friend got new kittens (German)

Language Comprehension

Does the model understand more than just English?

Performance Score Distribution (Top 20)

Click a model name to view its detail page.

	Score
Claude Opus 4.6 (Reasoning)	100%
Qwen3.7 Max	100%
Qwen3.6 Max Preview	100%
GPT-5.5 (Reasoning)	100%
Claude Sonnet 4.6 (Reasoning)	100%
Z.AI GLM 5.2 (Reasoning, High)	100%
Z.AI GLM 5 Turbo	100%
MoonshotAI: Kimi K2.6	100%
Claude Opus 4.7 (Reasoning)	100%
GPT-5.5 (Reasoning, Low)	100%
Claude Opus 4.6	100%
GPT-5 Mini	100%
Qwen 3.5 397B A17B	100%
Grok 4.20 (Beta, Reasoning)	100%
MoonshotAI: Kimi K2.5	100%
Claude Sonnet 4.6	100%
MiniMax M3	100%
Qwen 3.5 122B	100%
Qwen 3.5 27B	100%
GPT-5.4 Mini (Reasoning)	100%

	Score	Cost	Time
Inception Mercury	100%	$0.0000	544ms
Ministral 8B	80%	$0.0000	733ms
Ministral 3 3B	100%	$0.0000	995ms
Stealth: Aurora Alpha	100%	—	914ms
Ministral 3 8B	100%	$0.0000	1.0s
Llama 3.1 8B	80%	$0.0000	1.1s
Mistral NeMO	100%	$0.0000	1.2s
Ministral 3 14B	100%	$0.0000	1.6s
Mistral Small 4	100%	$0.0001	1.2s
Gemini 3.1 Flash Lite (Reasoning)	80%	$0.0001	827ms
Gemini 3.1 Flash Lite	60%	$0.0001	725ms
Mistral Small Creative	100%	$0.0001	1.2s
Gemini 3.1 Flash Lite (Preview)	80%	$0.0001	841ms
Arcee AI: Trinity Large (Preview)	100%	$0.0000	3.0s
GPT-5.4 Nano	100%	$0.0001	989ms
Mistral Small 3.2 24B	100%	$0.0001	2.3s
Cydonia 24B V4.1	100%	$0.0001	2.0s
GPT-4.1 Nano	100%	$0.0000	2.4s
Stealth: Healer Alpha	100%	$0.0000	3.8s
LFM2 24B	100%	$0.0000	3.9s

	Score	Cost	Speed	Stability
Inception Mercury	100%	$0.0000	544ms	100%
Stealth: Aurora Alpha	100%	—	914ms	100%
Ministral 3 3B	100%	$0.0000	995ms	100%
Ministral 3 8B	100%	$0.0000	1.0s	100%
Mistral NeMO	100%	$0.0000	1.2s	100%
GPT-5.4 Nano	100%	$0.0001	989ms	100%
Mistral Small 4	100%	$0.0001	1.2s	100%
Mistral Small Creative	100%	$0.0001	1.2s	100%
Ministral 3 14B	100%	$0.0000	1.6s	100%
Cydonia 24B V4.1	100%	$0.0001	2.0s	100%
Mistral Small 3.2 24B	100%	$0.0001	2.3s	100%
GPT-4.1 Nano	100%	$0.0000	2.4s	100%
GPT-4.1 Mini	100%	$0.0001	2.0s	100%
Mistral Medium 3.1	100%	$0.0003	1.6s	100%
Arcee AI: Trinity Large (Preview)	100%	$0.0000	3.0s	100%
Skyfall 36B V2	100%	$0.0002	2.4s	100%
Hermes 3 70B	100%	$0.0001	3.1s	100%
Stealth: Healer Alpha	100%	$0.0000	3.8s	100%
LFM2 24B	100%	$0.0000	3.9s	100%
Mistral Large 3	100%	$0.0002	2.7s	100%

Language Comprehension

Friend got new kittens (German)

Performance Score Distribution (Top 20)

Price-Performance Score Distribution (Top 20)

Most Stable Models (Top 20)

Top Overall Models (Top 20)

Rank	Model	Avg. Cost	Avg. Time	Stability	# 1	# 2	# 3	# 4	# 5	Total
70	Claude Opus 4.6 (Reasoning)	$0.0072	6.8s	100%	100	100	100	100	100	100%
88	Qwen3.7 Max	$0.011	26.4s	100%	100	100	100	100	100	100%
80	Qwen3.6 Max Preview	$0.0088	37.2s	100%	100	100	100	100	100	100%
54	GPT-5.5 (Reasoning)	$0.0036	5.2s	100%	100	100	100	100	100	100%
69	Claude Sonnet 4.6 (Reasoning)	$0.0062	9.7s	100%	100	100	100	100	100	100%
62	Z.AI GLM 5.2 (Reasoning, High)	$0.0033	15.0s	100%	100	100	100	100	100	100%
63	Z.AI GLM 5 Turbo	$0.0040	12.1s	100%	100	100	100	100	100	100%
103	MoonshotAI: Kimi K2.6	$0.0057	1.2m	100%	100	100	100	100	100	100%
64	Claude Opus 4.7 (Reasoning)	$0.0057	3.7s	100%	100	100	100	100	100	100%
52	GPT-5.5 (Reasoning, Low)	$0.0033	4.1s	100%	100	100	100	100	100	100%
47	Claude Opus 4.6	$0.0025	3.9s	100%	100	100	100	100	100	100%
49	GPT-5 Mini	$0.0015	10.9s	100%	100	100	100	100	100	100%
123	Qwen 3.5 397B A17B	$0.0099	1.3m	100%	100	100	100	100	100	100%
68	Grok 4.20 (Beta, Reasoning)	$0.0067	4.9s	100%	100	100	100	100	100	100%
56	MoonshotAI: Kimi K2.5	$0.0021	14.3s	100%	100	100	100	100	100	100%
42	Claude Sonnet 4.6	$0.0017	3.0s	100%	100	100	100	100	100	100%
58	MiniMax M3	$0.0009	22.8s	100%	100	100	100	100	100	100%
65	Qwen 3.5 122B	$0.0051	12.3s	100%	100	100	100	100	100	100%
72	Qwen 3.5 27B	$0.0048	23.1s	100%	100	100	100	100	100	100%
26	GPT-5.4 Mini (Reasoning)	$0.0006	2.6s	100%	100	100	100	100	100	100%
61	Qwen 3.5 Plus (2026-04-20)	$0.0026	17.2s	100%	100	100	100	100	100	100%
50	Claude Opus 4.5	$0.0030	3.9s	100%	100	100	100	100	100	100%
59	ByteDance Seed 1.6	$0.0017	21.1s	100%	100	100	100	100	100	100%
35	Grok 4.1 Fast	$0.0003	6.0s	100%	100	100	100	100	100	100%
46	Qwen 3.6 Flash	$0.0018	7.5s	100%	100	100	100	100	100	100%
75	DeepSeek V4 Pro (Reasoning)	$0.0036	54.9s	100%	100	100	100	100	100	100%
91	Grok 4	$0.013	21.3s	100%	100	100	100	100	100	100%
66	DeepSeek V4 Flash (Reasoning)	$0.0002	39.1s	100%	100	100	100	100	100	100%
73	Z.AI GLM 4.6	$0.0026	50.1s	100%	100	100	100	100	100	100%
51	Stealth: Hunter Alpha	$0.0000	20.5s	100%	100	100	100	100	100	100%
74	Claude Opus 4	$0.012	11.1s	100%	100	100	100	100	100	100%
71	Qwen 3.5 35B	$0.0052	18.7s	100%	100	100	100	100	100	100%
44	MiniMax M2.5	$0.0008	10.7s	100%	100	100	100	100	100	100%
53	Aion 2.0	$0.0012	17.4s	100%	100	100	100	100	100	100%
48	MiniMax M2.7	$0.0009	13.2s	100%	100	100	100	100	100	100%
27	Qwen 3.5 Plus (2026-02-15)	$0.0003	4.4s	100%	100	100	100	100	100	100%
25	Grok 4 Fast	$0.0003	3.5s	100%	100	100	100	100	100	100%
18	Stealth: Healer Alpha	$0.0000	3.8s	100%	100	100	100	100	100	100%
55	Qwen 3.5 Flash	$0.0010	19.5s	100%	100	100	100	100	100	100%
57	Z.AI GLM 4.5	$0.0015	18.3s	100%	100	100	100	100	100	100%
39	GPT-OSS 120B	$0.0002	9.6s	100%	100	100	100	100	100	100%
37	GPT-4o, May 13th (temp=0)	$0.0014	2.1s	100%	100	100	100	100	100	100%
20	Mistral Large 3	$0.0002	2.7s	100%	100	100	100	100	100	100%
67	ByteDance Seed 2.0 Lite	$0.0023	28.3s	100%	100	100	100	100	100	100%
31	DeepSeek-V2 Chat	$0.0000	6.5s	100%	100	100	100	100	100	100%
2	Stealth: Aurora Alpha	—	914ms	100%	100	100	100	100	100	100%
45	Claude 3.7 Sonnet	$0.0022	3.4s	100%	100	100	100	100	100	100%
24	Claude Haiku 4.5	$0.0005	1.9s	100%	100	100	100	100	100	100%
41	GPT-4o, May 13th (temp=1)	$0.0016	2.5s	100%	100	100	100	100	100	100%
32	DeepSeek V3 (2024-12-26)	$0.0002	5.5s	100%	100	100	100	100	100	100%
23	GPT-4o, Aug. 6th (temp=0)	$0.0006	1.7s	100%	100	100	100	100	100	100%
60	Nemotron 3 Super	$0.0000	30.8s	100%	100	100	100	100	100	100%
34	Mistral Large 2	$0.0007	3.3s	100%	100	100	100	100	100	100%
13	GPT-4.1 Mini	$0.0001	2.0s	100%	100	100	100	100	100	100%
22	GPT-4o, Aug. 6th (temp=1)	$0.0006	1.3s	100%	100	100	100	100	100	100%
43	Hermes 3 405B	$0.0000	13.4s	100%	100	100	100	100	100	100%
30	DeepSeek V3 (2025-03-24)	$0.0002	5.6s	100%	100	100	100	100	100	100%
33	Mistral Large	$0.0008	2.5s	100%	100	100	100	100	100	100%
1	Inception Mercury	$0.0000	544ms	100%	100	100	100	100	100	100%
38	Qwen 3 32B	$0.0001	9.5s	100%	100	100	100	100	100	100%
36	Writer: Palmyra X5	$0.0009	3.6s	100%	100	100	100	100	100	100%
11	Mistral Small 3.2 24B	$0.0001	2.3s	100%	100	100	100	100	100	100%
14	Mistral Medium 3.1	$0.0003	1.6s	100%	100	100	100	100	100	100%
28	Gemma 3 27B	$0.0001	6.0s	100%	100	100	100	100	100	100%
7	Mistral Small 4	$0.0001	1.2s	100%	100	100	100	100	100	100%
15	Arcee AI: Trinity Large (Preview)	$0.0000	3.0s	100%	100	100	100	100	100	100%
8	Mistral Small Creative	$0.0001	1.2s	100%	100	100	100	100	100	100%
21	Qwen 2.5 72B	$0.0001	3.9s	100%	100	100	100	100	100	100%
10	Cydonia 24B V4.1	$0.0001	2.0s	100%	100	100	100	100	100	100%
6	GPT-5.4 Nano	$0.0001	989ms	100%	100	100	100	100	100	100%
40	WizardLM 2 8x22b	$0.0004	8.9s	100%	100	100	100	100	100	100%
9	Ministral 3 14B	$0.0000	1.6s	100%	100	100	100	100	100	100%
4	Ministral 3 8B	$0.0000	1.0s	100%	100	100	100	100	100	100%
12	GPT-4.1 Nano	$0.0000	2.4s	100%	100	100	100	100	100	100%
17	Hermes 3 70B	$0.0001	3.1s	100%	100	100	100	100	100	100%
3	Ministral 3 3B	$0.0000	995ms	100%	100	100	100	100	100	100%
5	Mistral NeMO	$0.0000	1.2s	100%	100	100	100	100	100	100%
16	Skyfall 36B V2	$0.0002	2.4s	100%	100	100	100	100	100	100%
19	LFM2 24B	$0.0000	3.9s	100%	100	100	100	100	100	100%
29	Rocinante 12B	$0.0001	6.0s	100%	100	100	100	100	100	100%
154	Gemini 3.1 Pro (Preview)	$0.015	16.6s	20%	100	100	100	100	0	80%
102	GPT-5.4 (Reasoning)	$0.0023	4.4s	20%	100	100	100	100	0	80%
125	Gemini 3.5 Flash (Reasoning)	$0.0087	5.1s	20%	100	100	100	100	0	80%
128	Claude Opus 4.8 (Reasoning)	$0.0094	6.5s	20%	100	100	100	100	0	80%
117	Claude Opus 4.8 (Reasoning, Low)	$0.0074	5.2s	20%	100	100	100	100	0	80%
121	Grok 4.3 (Reasoning)	$0.0035	28.5s	20%	100	100	100	100	0	80%
110	Grok 4.20 (Reasoning)	$0.0036	15.8s	20%	100	100	100	100	0	80%
97	GPT-5.1	$0.0013	4.5s	20%	100	100	100	100	0	80%
99	Gemini 3 Flash (Preview, Reasoning)	$0.0018	4.1s	20%	100	100	100	100	0	80%
119	Z.AI GLM 5	$0.0037	25.9s	20%	100	100	100	100	0	80%
105	Gemma 4 26B (Reasoning)	$0.0002	22.6s	20%	100	100	100	100	0	80%
150	Gemini 3 Pro (Preview)	$0.015	11.3s	20%	100	100	100	100	0	80%
100	Claude Sonnet 4.5	$0.0019	3.7s	20%	100	100	100	100	0	80%
90	GPT-4.1	$0.0007	1.8s	20%	100	100	100	100	0	80%
96	Xiaomi MIMO v2.5 Pro	$0.0009	6.0s	20%	100	100	100	100	0	80%
78	Gemini 3.1 Flash Lite (Reasoning)	$0.0001	827ms	20%	100	100	100	100	0	80%
101	Gemini 3.5 Flash (Reasoning, Minimal)	$0.0022	2.0s	20%	100	100	100	100	0	80%
92	Gemini 3 Flash (Preview)	$0.0007	2.2s	20%	100	100	100	100	0	80%
79	Gemini 3.1 Flash Lite (Preview)	$0.0001	841ms	20%	100	100	100	100	0	80%
106	Claude 3.5 Sonnet	$0.0033	7.9s	20%	100	100	100	100	0	80%
139	Qwen 3.5 9B	$0.0009	1.2m	20%	100	100	100	100	0	80%
87	GPT-5.4 Mini (Reasoning, Low)	$0.0004	2.5s	20%	100	100	100	100	0	80%
85	Grok 4.20 (Beta)	$0.0006	868ms	20%	100	100	100	100	0	80%
98	DeepSeek V3.1	$0.0001	10.8s	20%	100	100	100	100	0	80%
104	Z.AI GLM 4.7 Flash	$0.0005	20.5s	20%	100	100	100	100	0	80%
93	DeepSeek V4 Pro	$0.0002	5.8s	20%	100	100	100	100	0	80%
118	DeepSeek V4 Flash	$0.0001	45.1s	20%	100	100	100	100	0	80%
82	Inception Mercury 2	$0.0002	712ms	20%	100	100	100	100	0	80%
86	Grok 4.20	$0.0004	1.8s	20%	100	100	100	100	0	80%
107	Z.AI GLM 4.5 Air	$0.0007	23.7s	20%	100	100	100	100	0	80%
81	GPT-5.4 Mini	$0.0002	896ms	20%	100	100	100	100	0	80%
94	Qwen3 235B A22B Instruct 2507	$0.0001	6.6s	20%	100	100	100	100	0	80%
89	Grok 4.3	$0.0004	2.9s	20%	100	100	100	100	0	80%
83	Llama 3.1 70B	$0.0001	2.4s	20%	100	100	100	100	0	80%
84	Gemma 3 12B	$0.0000	3.6s	20%	100	100	100	100	0	80%
95	Llama 3.1 Nemotron 70B	$0.0001	9.0s	20%	100	100	100	100	0	80%
76	Ministral 8B	$0.0000	733ms	20%	100	100	100	100	0	80%
77	Llama 3.1 8B	$0.0000	1.1s	20%	100	100	100	100	0	80%
140	Z.AI GLM 5.1	$0.0045	24.3s	2%	100	100	100	0	0	60%
132	GPT-5	$0.0044	9.6s	2%	100	100	100	0	0	60%
113	GPT-5.2	$0.0012	3.4s	2%	100	100	100	0	0	60%
116	GPT-5.5	$0.0020	2.5s	2%	100	100	100	0	0	60%
141	Gemini 2.5 Pro	$0.0081	8.0s	2%	100	100	100	0	0	60%
131	Qwen 3.6 27B	$0.0031	16.1s	2%	100	100	100	0	0	60%
126	Qwen 3.6 35B	$0.0018	11.4s	2%	100	100	100	0	0	60%
115	Claude Sonnet 4	$0.0018	2.8s	2%	100	100	100	0	0	60%
156	ByteDance Seed 2.0 Mini	$0.0013	1.4m	2%	100	100	100	0	0	60%
108	Gemini 3.1 Flash Lite	$0.0001	725ms	2%	100	100	100	0	0	60%
112	Gemma 4 26B	$0.0001	6.1s	2%	100	100	100	0	0	60%
114	GPT-5.4 Nano (Reasoning)	$0.0002	11.2s	2%	100	100	100	0	0	60%
109	GPT-5.4 Nano (Reasoning, Low)	$0.0001	2.3s	2%	100	100	100	0	0	60%
120	Nemotron 3 Nano	$0.0002	14.0s	2%	100	100	100	0	0	60%
111	Arcee AI: Trinity Mini	$0.0001	3.2s	2%	100	100	100	0	0	60%
130	GPT-5.4 (Reasoning, Low)	$0.0013	3.2s	0%	100	100	0	0	0	40%
145	Claude Opus 4.7	$0.0065	4.3s	0%	100	100	0	0	0	40%
151	Z.AI GLM 4.7	$0.0023	31.3s	0%	100	100	0	0	0	40%
133	Gemini 2.5 Flash (Reasoning)	$0.0023	4.8s	0%	100	100	0	0	0	40%
129	Xiaomi MIMO v2.5	$0.0007	4.8s	0%	100	100	0	0	0	40%
124	Gemini 2.5 Flash	$0.0002	773ms	0%	100	100	0	0	0	40%
122	Gemini 2.5 Flash Lite	$0.0000	631ms	0%	100	100	0	0	0	40%
127	Mistral Small 4 (Reasoning)	$0.0001	2.3s	0%	100	100	0	0	0	40%
143	Gemma 4 31B (Reasoning)	$0.0002	18.4s	0%	100	0	0	0	0	20%
138	DeepSeek V3.2	$0.0001	3.0s	0%	100	0	0	0	0	20%
135	GPT-4o Mini (temp=1)	$0.0000	1.2s	0%	100	0	0	0	0	20%
137	Claude 3 Haiku	$0.0002	1.5s	0%	100	0	0	0	0	20%
136	Gemma 3 4B	$0.0000	1.4s	0%	100	0	0	0	0	20%
134	Ministral 3B	$0.0000	538ms	0%	100	0	0	0	0	20%
153	o4 Mini High	$0.0020	5.4s	0%	0	0	0	0	0	0%
152	o4 Mini	$0.0011	5.1s	0%	0	0	0	0	0	0%
144	Gemma 4 31B	$0.0000	2.6s	0%	0	0	0	0	0	0%
146	GPT-5.4	$0.0009	1.2s	0%	0	0	0	0	0	0%
147	Gemini 2.5 Flash Lite (Reasoning)	$0.0005	4.6s	0%	0	0	0	0	0	0%
155	GPT-5 Nano	$0.0008	21.9s	0%	0	0	0	0	0	0%
142	GPT-4o Mini (temp=0)	$0.0000	1.2s	0%	0	0	0	0	0	0%
149	ByteDance Seed 1.6 Flash	$0.0003	6.9s	0%	0	0	0	0	0	0%
148	Cohere Command R+ (Aug. 2024)	$0.0012	2.2s	0%	0	0	0	0	0	0%
79.36%