Bad Writing Habits
Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.
| Model | Total ▼ | Literary fiction: old friends reunite | Thriller: chase through city streets | Romance: separated couple reunites | Fantasy: entering an ancient ruin | Mystery: examining a crime scene | Horror: alone in an eerie place at night |
|---|---|---|---|---|---|---|---|
| Rocinante 12B | 87% | 88% | 90% | 86% | 84% | 88% | 87% |
| Mistral Large | 87% | 86% | 87% | 85% | 85% | 89% | 88% |
| Mistral Large 2 | 87% | 86% | 89% | 87% | 86% | 86% | 86% |
| Ministral 8B | 86% | 86% | 89% | 83% | 84% | 84% | 90% |
| Ministral 3B | 86% | 82% | 89% | 84% | 84% | 87% | 89% |
| Claude 3.5 Sonnet | 84% | 84% | 85% | 86% | 80% | 87% | 84% |
| Mistral NeMO | 84% | 87% | 80% | 85% | 80% | 84% | 85% |
| Claude 3.7 Sonnet | 83% | 80% | 87% | 79% | 84% | 85% | 85% |
| DeepSeek-V2 Chat | 83% | 80% | 87% | 81% | 82% | 85% | 84% |
| Hermes 3 70B | 83% | 81% | 83% | 83% | 82% | 85% | 84% |
| Hermes 3 405B | 83% | 80% | 83% | 81% | 82% | 87% | 82% |
| Llama 3.1 70B | 83% | 85% | 83% | 87% | 75% | 87% | 79% |
| Qwen 2.5 72B | 83% | 84% | 80% | 81% | 81% | 83% | 85% |
| Llama 3.1 8B | 83% | 81% | 89% | 79% | 78% | 83% | 85% |
| Cohere Command R+ (Aug. 2024) | 82% | 81% | 82% | 84% | 79% | 85% | 82% |
1–15 of 25
Page 1 / 2
Cost vs Performance
Compares total cost for this test against the test score. Quadrant lines are drawn at the median values. Only models with available cost data are shown.
Literary fiction: old friends reunite
Performance Score Distribution (Top 20)
Click a model name to view its detail page.
Thriller: chase through city streets
Performance Score Distribution (Top 20)
Click a model name to view its detail page.
Romance: separated couple reunites
Performance Score Distribution (Top 20)
Click a model name to view its detail page.
Fantasy: entering an ancient ruin
Performance Score Distribution (Top 20)
Click a model name to view its detail page.
Mystery: examining a crime scene
Performance Score Distribution (Top 20)
Click a model name to view its detail page.
Horror: alone in an eerie place at night
Performance Score Distribution (Top 20)
Click a model name to view its detail page.