Language Writing

Can the model generate text in different languages?

Price-Performance Score Distribution (Top 20)

Click a model name to view its detail page.

ScoreCostTime
Stealth: Aurora Alpha100%—2.0s
Inception Mercury96%$0.00021.5s
GPT-4.1 Nano93%$0.00014.0s
Inception Mercury 2100%$0.00061.4s
GPT-4.1 Mini99%$0.00043.4s
GPT-4o Mini (temp=0)100%$0.00034.8s
Mistral NeMO67%$0.00014.3s
GPT-4o Mini (temp=1)100%$0.00035.6s
Claude 3 Haiku81%$0.00073.8s
Arcee AI: Trinity Mini81%$0.000215.9s
Gemini 3.1 Flash Lite (Preview)95%$0.00113.7s
Nemotron 3 Nano95%$0.000210.8s
GPT-5.4 Mini97%$0.00202.5s
Nemotron 3 Super98%$0.000021.7s
Mistral Small 3.2 24B71%$0.000311.0s
DeepSeek-V2 Chat100%$0.000116.1s
GPT-5.4 Mini (Reasoning, Low)100%$0.00223.5s
GPT-5.4 Nano (Reasoning)98%$0.00166.1s
Gemini 3 Flash (Preview)100%$0.00205.6s
GPT-5.4 Nano92%$0.00176.6s
0.500.600.700.800.901.00

Cost vs Performance

Compares total cost for this test against the test score. Quadrant lines are drawn at the median values. Only models with available cost data are shown.

11 low-scoring outliers hidden: Ministral 8B (52.8%), Rocinante 12B (51.9%), Mistral Medium 3.1 (49.0%), Mistral Small 4 (48.9%), Ministral 3 8B (47.9%), Qwen3 235B A22B Instruct 2507 (46.7%), Writer: Palmyra X5 (43.2%), Ministral 3 3B (36.2%), Mistral Small Creative (33.7%), Llama 3.1 Nemotron 70B (33.6%), Ministral 3 14B (10.0%).

Most Stable Models (Top 20)

Ranked by stability (median × consistency). Click a model name to view its detail page.

ScoreConsistencyStability
Claude Sonnet 4.6100%100%100%
o4 Mini100%100%100%
Gemini 3 Flash (Preview)100%100%100%
DeepSeek-V2 Chat100%100%100%
Stealth: Aurora Alpha100%100%100%
GPT-4o, Aug. 6th (temp=0)100%100%100%
GPT-4o Mini (temp=1)100%100%100%
GPT-4o Mini (temp=0)100%100%100%
GPT-5.4 Mini (Reasoning, Low)100%99%99%
Z.AI GLM 5 Turbo100%98%98%
Z.AI GLM 4.5100%97%97%
o4 Mini High100%97%97%
Inception Mercury 2100%96%96%
Claude Opus 4.599%96%96%
GPT-5 Nano99%96%96%
GPT-4o, Aug. 6th (temp=1)99%94%94%
Hermes 3 405B99%94%94%
GPT-4.1 Mini99%93%93%
Nemotron 3 Super98%91%91%
GPT-4o, May 13th (temp=0)97%89%89%
80%90%100%

Top Overall Models (Top 20)

Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.

ScoreCostSpeedStability
Stealth: Aurora Alpha100%—2.0s100%
GPT-4o Mini (temp=0)100%$0.00034.8s100%
GPT-4o Mini (temp=1)100%$0.00035.6s100%
Inception Mercury 2100%$0.00061.4s96%
Gemini 3 Flash (Preview)100%$0.00205.6s100%
GPT-5.4 Mini (Reasoning, Low)100%$0.00223.5s99%
GPT-4.1 Mini99%$0.00043.4s93%
DeepSeek-V2 Chat100%$0.000116.1s100%
GPT-4o, Aug. 6th (temp=0)100%$0.00526.1s100%
Z.AI GLM 4.5100%$0.001314.5s97%
Z.AI GLM 5 Turbo100%$0.003714.7s98%
Hermes 3 405B99%$0.000021.0s94%
GPT-5.4 Mini97%$0.00202.5s87%
GPT-4o, Aug. 6th (temp=1)99%$0.00566.5s94%
o4 Mini100%$0.007116.7s100%
Nemotron 3 Super98%$0.000021.7s91%
GPT-5.4 Nano (Reasoning)98%$0.00166.1s83%
Grok 4.20 (Beta)97%$0.00343.3s84%
Gemini 2.5 Flash97%$0.00266.0s83%
Claude Sonnet 4.6100%$0.01013.7s100%
60%70%80%90%100%
Model Total â–¼Character dialogue (Spanish) in a storyCharacter dialogue (French) in a storyCharacter dialogue (German) in a storyCharacter dialogue (Italian) in a storyCharacter dialogue (Hindi) in a story
Claude Sonnet 4.6100%100%100%100%100%100%
o4 Mini100%100%100%100%100%100%
Gemini 3 Flash (Preview)100%100%100%100%100%100%
DeepSeek-V2 Chat100%100%100%100%100%100%
Stealth: Aurora Alpha100%100%100%100%100%100%
GPT-4o, Aug. 6th (temp=0)100%100%100%100%100%100%
GPT-4o Mini (temp=1)100%100%100%100%100%100%
GPT-4o Mini (temp=0)100%100%100%100%100%100%
GPT-5.4 Mini (Reasoning, Low)100%100%100%99%100%100%
Z.AI GLM 5 Turbo100%100%100%100%99%100%
Z.AI GLM 4.5100%100%100%100%98%100%
Inception Mercury 2100%100%100%100%98%100%
o4 Mini High100%99%100%99%100%100%
GPT-4o, Aug. 6th (temp=1)99%100%100%100%100%97%
GPT-5 Nano99%98%99%100%100%100%
1–15 of 118
Page 1 / 8

Character dialogue (Spanish) in a story

Character dialogue (French) in a story

Character dialogue (German) in a story

Character dialogue (Italian) in a story

Character dialogue (Hindi) in a story