N-Length Sentences

Write sentences with exactly N words

Price-Performance Score Distribution (Top 20)

Click a model name to view its detail page.

ScoreCostTime
Gemini 3.1 Flash Lite100%$0.00011.5s
Gemini 3.1 Flash Lite (Reasoning)98%$0.00012.0s
Gemini 3.1 Flash Lite (Preview)100%$0.00011.2s
Inception Mercury98%$0.00011.6s
Stealth: Aurora Alpha98%—1.7s
Gemma 4 26B96%$0.00003.8s
Gemini 3 Flash (Preview)99%$0.00041.9s
Inception Mercury 2100%$0.00071.2s
Llama 3.1 Nemotron 70B81%$0.00015.7s
GPT-5.4 Nano (Reasoning, Low)100%$0.00085.7s
GPT-5.4 Nano (Reasoning)100%$0.00107.5s
Llama 3.1 70B84%$0.00022.1s
Nemotron 3 Super100%$0.000016.0s
GPT-5.4 Mini (Reasoning, Low)100%$0.00256.3s
Gemma 4 31B89%$0.00018.6s
GPT-5.4 Mini (Reasoning)100%$0.00388.1s
GPT-OSS 120B100%$0.000420.3s
Claude Opus 4.594%$0.00526.8s
Stealth: Healer Alpha79%$0.000012.1s
GPT-5 Nano100%$0.001028.2s
0.700.800.901.00

Cost vs Performance

Compares total cost for this test against the test score. Quadrant lines are drawn at the median values. Only models with available cost data are shown.

Most Stable Models (Top 20)

Ranked by stability (median × consistency). Click a model name to view its detail page.

ScoreConsistencyStability
Qwen3.6 Max Preview100%100%100%
Gemini 3.1 Pro (Preview)100%100%100%
Z.AI GLM 5 Turbo100%100%100%
Claude Sonnet 4.6 (Reasoning)100%100%100%
Grok 4.3 (Reasoning)100%100%100%
MoonshotAI: Kimi K2.6100%100%100%
Qwen 3.5 397B A17B100%100%100%
Gemma 4 31B (Reasoning)100%100%100%
Qwen 3.5 122B100%100%100%
GPT-5.4 (Reasoning, Low)100%100%100%
Qwen 3.5 27B100%100%100%
GPT-5.4 Mini (Reasoning)100%100%100%
Gemini 3 Flash (Preview, Reasoning)100%100%100%
o4 Mini High100%100%100%
GPT-5.2100%100%100%
o4 Mini100%100%100%
Qwen 3.5 Flash100%100%100%
Qwen 3.5 9B100%100%100%
Gemini 3.1 Flash Lite (Preview)100%100%100%
Gemini 3.1 Flash Lite100%100%100%
100%

Top Overall Models (Top 20)

Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.

ScoreCostSpeedStability
Gemini 3.1 Flash Lite (Preview)100%$0.00011.2s100%
Gemini 3.1 Flash Lite100%$0.00011.5s100%
Inception Mercury 2100%$0.00071.2s98%
GPT-5.4 Nano (Reasoning)100%$0.00107.5s100%
GPT-5.4 Mini (Reasoning, Low)100%$0.00256.3s100%
Gemini 3 Flash (Preview)99%$0.00041.9s95%
Nemotron 3 Super100%$0.000016.0s100%
GPT-5.4 Mini (Reasoning)100%$0.00388.1s100%
GPT-5.4 Nano (Reasoning, Low)100%$0.00085.7s95%
Inception Mercury98%$0.00011.6s90%
GPT-OSS 120B100%$0.000420.3s98%
GPT-5 Nano100%$0.001028.2s100%
Stealth: Aurora Alpha98%—1.7s90%
GPT-5.4 (Reasoning, Low)100%$0.009310.4s100%
Gemini 3.1 Flash Lite (Reasoning)98%$0.00012.0s85%
o4 Mini100%$0.008320.8s100%
GPT-5.2100%$0.01115.0s100%
Gemini 3 Flash (Preview, Reasoning)100%$0.01017.7s100%
Qwen 3.5 Flash100%$0.002438.6s100%
GPT-5 Mini99%$0.004326.2s94%
70%80%90%100%
Model Total â–¼Write sentences with 5 words eachWrite sentences with 10 words eachWrite sentences with 20 words each
Qwen3.6 Max Preview100%100%100%100%
Gemini 3.1 Pro (Preview)100%100%100%100%
Z.AI GLM 5 Turbo100%100%100%100%
Claude Sonnet 4.6 (Reasoning)100%100%100%100%
Grok 4.3 (Reasoning)100%100%100%100%
MoonshotAI: Kimi K2.6100%100%100%100%
Qwen 3.5 397B A17B100%100%100%100%
Gemma 4 31B (Reasoning)100%100%100%100%
Qwen 3.5 122B100%100%100%100%
GPT-5.4 (Reasoning, Low)100%100%100%100%
Qwen 3.5 27B100%100%100%100%
GPT-5.4 Mini (Reasoning)100%100%100%100%
Gemini 3 Flash (Preview, Reasoning)100%100%100%100%
o4 Mini High100%100%100%100%
GPT-5.2100%100%100%100%
1–15 of 147
Page 1 / 10

Write sentences with 5 words each

Write sentences with 10 words each

Write sentences with 20 words each