Bad Writing Habits
Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.
Thriller: chase through city streets
Performance Score Distribution (Top 20)
Click a model name to view its detail page.
Price-Performance Score Distribution (Top 20)
Click a model name to view its detail page.
| Score | Cost | Time | ||
|---|---|---|---|---|
| GPT-5.4 Mini (Reasoning) | 90% | $0.022 | 26.3s | |
| GPT-5.4 Mini | 89% | $0.015 | 18.3s | |
| GPT-5.4 Mini (Reasoning, Low) | 87% | $0.013 | 16.0s | |
| Writer: Palmyra X5 | 86% | $0.011 | 22.5s | |
| Gemini 2.5 Flash | 81% | $0.0039 | 8.1s | |
| Qwen3 235B A22B Instruct 2507 | 86% | $0.0011 | 1.1m | |
| GPT-5.4 | 92% | $0.050 | 1.4m | |
| GPT-5.4 (Reasoning, Low) | 91% | $0.051 | 1.4m | |
| Mistral Small 4 (Reasoning) | 83% | $0.0023 | 32.8s | |
| Qwen 3.5 397B A17B | 87% | $0.016 | 1.5m | |
| Z.AI GLM 5 Turbo | 85% | $0.0068 | 30.7s | |
| GPT-4.1 | 83% | $0.018 | 46.3s | |
| Qwen 3.5 35B | 84% | $0.0068 | 26.9s | |
| Mistral Small 4 | 79% | $0.0011 | 17.2s | |
| Gemini 2.5 Flash Lite (Reasoning) | 81% | $0.0027 | 28.7s | |
| Ministral 8B | 79% | $0.0002 | 8.4s | |
| MiniMax M2.5 | 81% | $0.0026 | 35.8s | |
| GPT-4.1 Mini | 81% | $0.0025 | 14.4s | |
| GPT-5.4 Nano (Reasoning, Low) | 80% | $0.0049 | 17.8s | |
| Claude Sonnet 4.5 | 85% | $0.027 | 34.3s | |
Most Stable Models (Top 20)
Ranked by stability (median × consistency). Click a model name to view its detail page.
| Score | Consistency | Stability | ||
|---|---|---|---|---|
| GPT-5.4 | 92% | 98% | 90% | |
| GPT-5.4 (Reasoning) | 91% | 95% | 88% | |
| GPT-5.4 (Reasoning, Low) | 91% | 94% | 86% | |
| GPT-5.4 Mini | 89% | 96% | 86% | |
| GPT-5.4 Mini (Reasoning) | 90% | 93% | 85% | |
| GPT-5.4 Mini (Reasoning, Low) | 87% | 94% | 83% | |
| Qwen 3.5 397B A17B | 87% | 94% | 82% | |
| Writer: Palmyra X5 | 86% | 94% | 81% | |
| Gemini 3.1 Pro (Preview) | 86% | 91% | 80% | |
| GPT-5 | 82% | 96% | 80% | |
| o4 Mini High | 81% | 97% | 79% | |
| Qwen3 235B A22B Instruct 2507 | 86% | 93% | 79% | |
| GPT-5.1 | 83% | 93% | 78% | |
| MiniMax M2.5 | 81% | 95% | 78% | |
| GPT-4.1 | 83% | 92% | 78% | |
| Claude Sonnet 4.5 | 85% | 93% | 78% | |
| Z.AI GLM 5 Turbo | 85% | 94% | 78% | |
| Gemini 2.5 Flash Lite (Reasoning) | 81% | 96% | 78% | |
| GPT-5.4 Nano (Reasoning, Low) | 80% | 96% | 78% | |
| Qwen 3.5 Flash | 80% | 95% | 77% | |
Top Overall Models (Top 20)
Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.
| Score | Cost | Speed | Stability | ||
|---|---|---|---|---|---|
| GPT-5.4 Mini | 89% | $0.015 | 18.3s | 86% | |
| GPT-5.4 Mini (Reasoning) | 90% | $0.022 | 26.3s | 85% | |
| GPT-5.4 | 92% | $0.050 | 1.4m | 90% | |
| GPT-5.4 Mini (Reasoning, Low) | 87% | $0.013 | 16.0s | 83% | |
| Writer: Palmyra X5 | 86% | $0.011 | 22.5s | 81% | |
| GPT-5.4 (Reasoning, Low) | 91% | $0.051 | 1.4m | 86% | |
| Qwen3 235B A22B Instruct 2507 | 86% | $0.0011 | 1.1m | 79% | |
| Z.AI GLM 5 Turbo | 85% | $0.0068 | 30.7s | 78% | |
| Qwen 3.5 397B A17B | 87% | $0.016 | 1.5m | 82% | |
| Qwen 3.5 35B | 84% | $0.0068 | 26.9s | 76% | |
| Claude Sonnet 4.5 | 85% | $0.027 | 34.3s | 78% | |
| MiniMax M2.5 | 81% | $0.0026 | 35.8s | 78% | |
| GPT-4.1 Mini | 81% | $0.0025 | 14.4s | 77% | |
| Gemini 2.5 Flash Lite (Reasoning) | 81% | $0.0027 | 28.7s | 78% | |
| GPT-5.4 Nano (Reasoning, Low) | 80% | $0.0049 | 17.8s | 78% | |
| Mistral Small 4 (Reasoning) | 83% | $0.0023 | 32.8s | 75% | |
| GPT-4.1 | 83% | $0.018 | 46.3s | 78% | |
| Gemini 2.5 Flash (Reasoning) | 81% | $0.0100 | 19.8s | 76% | |
| Ministral 8B | 79% | $0.0002 | 8.4s | 75% | |
| o4 Mini High | 81% | $0.022 | 37.6s | 79% | |