Bad Writing Habits
Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.
Romance: separated couple reunites
Performance Score Distribution (Top 20)
Click a model name to view its detail page.
Price-Performance Score Distribution (Top 20)
Click a model name to view its detail page.
| Score | Cost | Time | ||
|---|---|---|---|---|
| Rocinante 12B | 84% | $0.0007 | 17.7s | |
| GPT-5.4 Mini (Reasoning) | 89% | $0.019 | 21.8s | |
| GPT-5.4 Mini (Reasoning, Low) | 90% | $0.017 | 19.1s | |
| GPT-5.4 Mini | 88% | $0.018 | 19.0s | |
| Mistral Small Creative | 80% | $0.0007 | 11.0s | |
| DeepSeek V3 (2025-03-24) | 85% | $0.0011 | 38.6s | |
| Mistral Large | 82% | $0.011 | 28.6s | |
| Mistral Small 4 | 81% | $0.0016 | 23.0s | |
| Claude 3.5 Haiku | 82% | $0.0021 | 8.5s | |
| Mistral Small 4 (Reasoning) | 82% | $0.0025 | 37.5s | |
| Writer: Palmyra X5 | 80% | $0.011 | 23.9s | |
| ByteDance Seed 1.6 Flash | 81% | $0.0012 | 26.6s | |
| Ministral 3 3B | 79% | $0.0002 | 4.1s | |
| Ministral 3 14B | 79% | $0.0005 | 12.6s | |
| Z.AI GLM 5 Turbo | 84% | $0.0082 | 37.6s | |
| MiniMax M2.7 | 83% | $0.0039 | 1.1m | |
| Qwen 3.5 35B | 82% | $0.0096 | 39.3s | |
| Qwen3 235B A22B Instruct 2507 | 83% | $0.0006 | 52.1s | |
| Grok 4 Fast | 79% | $0.0016 | 22.9s | |
| Mistral Large 3 | 81% | $0.0028 | 32.4s | |
Most Stable Models (Top 20)
Ranked by stability (median × consistency). Click a model name to view its detail page.
| Score | Consistency | Stability | ||
|---|---|---|---|---|
| GPT-5.4 | 92% | 97% | 90% | |
| GPT-5.4 (Reasoning, Low) | 90% | 97% | 88% | |
| GPT-5.4 Mini (Reasoning, Low) | 90% | 98% | 88% | |
| GPT-5.4 (Reasoning) | 91% | 97% | 87% | |
| GPT-5.4 Mini (Reasoning) | 89% | 96% | 86% | |
| GPT-5.4 Mini | 88% | 96% | 85% | |
| DeepSeek V3 (2025-03-24) | 85% | 96% | 81% | |
| Claude Opus 4 | 85% | 96% | 80% | |
| Qwen 3.5 397B A17B | 83% | 96% | 79% | |
| Mistral Small 4 | 81% | 97% | 79% | |
| Mistral Small 4 (Reasoning) | 82% | 96% | 79% | |
| Qwen 3.5 35B | 82% | 97% | 79% | |
| Qwen3 235B A22B Instruct 2507 | 83% | 93% | 78% | |
| MiniMax M2.7 | 83% | 93% | 78% | |
| Mistral Medium 3.1 | 81% | 96% | 78% | |
| Qwen 3.5 122B | 80% | 98% | 78% | |
| ByteDance Seed 1.6 Flash | 81% | 96% | 77% | |
| GPT-5.1 | 83% | 93% | 77% | |
| Qwen 3.5 Flash | 80% | 97% | 77% | |
| Claude Sonnet 4 | 79% | 97% | 77% | |
Top Overall Models (Top 20)
Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.
| Score | Cost | Speed | Stability | ||
|---|---|---|---|---|---|
| GPT-5.4 Mini (Reasoning, Low) | 90% | $0.017 | 19.1s | 88% | |
| GPT-5.4 Mini (Reasoning) | 89% | $0.019 | 21.8s | 86% | |
| GPT-5.4 Mini | 88% | $0.018 | 19.0s | 85% | |
| DeepSeek V3 (2025-03-24) | 85% | $0.0011 | 38.6s | 81% | |
| GPT-5.4 | 92% | $0.065 | 1.9m | 90% | |
| GPT-5.4 (Reasoning, Low) | 90% | $0.067 | 1.7m | 88% | |
| Mistral Small 4 | 81% | $0.0016 | 23.0s | 79% | |
| Mistral Small 4 (Reasoning) | 82% | $0.0025 | 37.5s | 79% | |
| ByteDance Seed 1.6 Flash | 81% | $0.0012 | 26.6s | 77% | |
| Qwen3 235B A22B Instruct 2507 | 83% | $0.0006 | 52.1s | 78% | |
| Qwen 3.5 35B | 82% | $0.0096 | 39.3s | 79% | |
| Z.AI GLM 5 Turbo | 84% | $0.0082 | 37.6s | 76% | |
| Mistral Large | 82% | $0.011 | 28.6s | 77% | |
| Rocinante 12B | 84% | $0.0007 | 17.7s | 72% | |
| Ministral 3 3B | 79% | $0.0002 | 4.1s | 75% | |
| Mistral Small Creative | 80% | $0.0007 | 11.0s | 75% | |
| Claude 3.5 Haiku | 82% | $0.0021 | 8.5s | 73% | |
| MiniMax M2.7 | 83% | $0.0039 | 1.1m | 78% | |
| Mistral Large 3 | 81% | $0.0028 | 32.4s | 76% | |
| Qwen 3.5 Flash | 80% | $0.0020 | 36.1s | 77% | |