Language Writing
Can the model generate text in different languages?
| Model | Total â–¼ | Character dialogue (Spanish) in a story | Character dialogue (French) in a story | Character dialogue (German) in a story | Character dialogue (Italian) in a story | Character dialogue (Hindi) in a story |
|---|---|---|---|---|---|---|
| Gemini 3 Flash (Preview) | 99% | 100% | 96% | 98% | 100% | 100% |
| Hermes 3 405B | 97% | 96% | 96% | 94% | 100% | 98% |
| Z.AI GLM 4.5 | 96% | 100% | 94% | 91% | 96% | 100% |
| o4 Mini | 95% | 95% | 94% | 93% | 91% | 100% |
| GPT-4o Mini (temp=1) | 95% | 96% | 90% | 92% | 95% | 100% |
| DeepSeek-V2 Chat | 93% | 93% | 96% | 84% | 94% | 100% |
| GPT-4.1 Mini | 93% | 96% | 86% | 88% | 95% | 100% |
| GPT-4o Mini (temp=0) | 92% | 95% | 87% | 90% | 90% | 100% |
| GPT-4o, Aug. 6th (temp=0) | 92% | 81% | 90% | 94% | 96% | 100% |
| Claude 2.0 | 91% | 93% | 94% | 100% | 97% | 72% |
| o4 Mini High | 91% | 85% | 95% | 89% | 89% | 98% |
| Gemini 2.5 Flash | 90% | 90% | 90% | 94% | 91% | 88% |
| GPT-4.1 | 89% | 90% | 87% | 92% | 89% | 89% |
| Llama 3.1 405B | 89% | 89% | 89% | 84% | 97% | 86% |
| MoonshotAI: Kimi K2.5 | 89% | 93% | 91% | 96% | 93% | 71% |
1–15 of 93
Page 1 / 7