N-Length Sentences

Write sentences with exactly N words

Price-Performance Score Distribution (Top 20)

Click a model name to view its detail page.

ScoreCostTime
Gemini 3.1 Flash Lite (Preview)100%$0.00011.2s
Inception Mercury98%$0.00011.6s
Stealth: Aurora Alpha98%—1.7s
Inception Mercury 2100%$0.00071.2s
Gemini 3 Flash (Preview)99%$0.00041.9s
GPT-5.4 Nano (Reasoning, Low)100%$0.00085.7s
Llama 3.1 70B84%$0.00022.1s
Llama 3.1 Nemotron 70B81%$0.00015.7s
GPT-5.4 Nano (Reasoning)100%$0.00107.5s
GPT-5.4 Mini (Reasoning, Low)100%$0.00256.3s
Nemotron 3 Super100%$0.000016.0s
GPT-5.4 Mini (Reasoning)100%$0.00388.1s
Claude Opus 4.594%$0.00526.8s
Llama 3.1 8B80%$0.0000910ms
GPT-5 Nano100%$0.001028.2s
Stealth: Healer Alpha79%$0.000012.1s
GPT-4o, May 13th (temp=0)87%$0.00254.6s
GPT-5.4 (Reasoning, Low)100%$0.009310.4s
Claude Opus 4.684%$0.00557.6s
GPT-5 Mini99%$0.004326.2s
0.700.800.901.00

Cost vs Performance

Compares total cost for this test against the test score. Quadrant lines are drawn at the median values. Only models with available cost data are shown.

Most Stable Models (Top 20)

Ranked by stability (median × consistency). Click a model name to view its detail page.

ScoreConsistencyStability
Gemini 3.1 Pro (Preview)100%100%100%
Z.AI GLM 5 Turbo100%100%100%
Claude Sonnet 4.6 (Reasoning)100%100%100%
Qwen 3.5 397B A17B100%100%100%
Qwen 3.5 122B100%100%100%
GPT-5.4 (Reasoning, Low)100%100%100%
Qwen 3.5 27B100%100%100%
GPT-5.4 Mini (Reasoning)100%100%100%
Gemini 3 Flash (Preview, Reasoning)100%100%100%
o4 Mini High100%100%100%
GPT-5.2100%100%100%
o4 Mini100%100%100%
Qwen 3.5 Flash100%100%100%
Qwen 3.5 9B100%100%100%
Gemini 3.1 Flash Lite (Preview)100%100%100%
GPT-5.4 Mini (Reasoning, Low)100%100%100%
Nemotron 3 Super100%100%100%
GPT-5 Nano100%100%100%
GPT-5.4 Nano (Reasoning)100%100%100%
GPT-5100%99%99%
100%

Top Overall Models (Top 20)

Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.

ScoreCostSpeedStability
Gemini 3.1 Flash Lite (Preview)100%$0.00011.2s100%
Inception Mercury 2100%$0.00071.2s98%
GPT-5.4 Nano (Reasoning)100%$0.00107.5s100%
GPT-5.4 Mini (Reasoning, Low)100%$0.00256.3s100%
Gemini 3 Flash (Preview)99%$0.00041.9s95%
Nemotron 3 Super100%$0.000016.0s100%
GPT-5.4 Mini (Reasoning)100%$0.00388.1s100%
GPT-5.4 Nano (Reasoning, Low)100%$0.00085.7s95%
Inception Mercury98%$0.00011.6s90%
GPT-5 Nano100%$0.001028.2s100%
Stealth: Aurora Alpha98%—1.7s90%
GPT-5.4 (Reasoning, Low)100%$0.009310.4s100%
o4 Mini100%$0.008320.8s100%
GPT-5.2100%$0.01115.0s100%
Gemini 3 Flash (Preview, Reasoning)100%$0.01017.7s100%
Qwen 3.5 Flash100%$0.002438.6s100%
GPT-5 Mini99%$0.004326.2s94%
Z.AI GLM 5 Turbo100%$0.01126.6s100%
GPT-5.4 (Reasoning)100%$0.01319.5s98%
o4 Mini High100%$0.01127.6s100%
80%90%100%
Model Total â–¼Write sentences with 5 words eachWrite sentences with 10 words eachWrite sentences with 20 words each
Gemini 3.1 Pro (Preview)100%100%100%100%
Z.AI GLM 5 Turbo100%100%100%100%
Claude Sonnet 4.6 (Reasoning)100%100%100%100%
Qwen 3.5 397B A17B100%100%100%100%
Qwen 3.5 122B100%100%100%100%
GPT-5.4 (Reasoning, Low)100%100%100%100%
Qwen 3.5 27B100%100%100%100%
GPT-5.4 Mini (Reasoning)100%100%100%100%
Gemini 3 Flash (Preview, Reasoning)100%100%100%100%
o4 Mini High100%100%100%100%
GPT-5.2100%100%100%100%
o4 Mini100%100%100%100%
Qwen 3.5 Flash100%100%100%100%
Qwen 3.5 9B100%100%100%100%
Gemini 3.1 Flash Lite (Preview)100%100%100%100%
1–15 of 118
Page 1 / 8

Write sentences with 5 words each

Write sentences with 10 words each

Write sentences with 20 words each