Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Price-Performance Score Distribution (Top 20)

Click a model name to view its detail page.

ScoreCostTime
Gemini 2.5 Flash Lite97%$0.00031.7s
Gemini 3.1 Flash Lite (Preview)99%$0.00101.8s
Mistral Small 496%$0.00043.3s
Mistral Small 3.2 24B97%$0.00025.0s
Gemini 2.5 Flash99%$0.00152.2s
Grok 4 Fast99%$0.00086.5s
Gemini 3 Flash (Preview)99%$0.00193.4s
GPT-4.1 Mini98%$0.00117.0s
Inception Mercury 295%$0.00172.3s
Mistral Large 398%$0.00117.7s
Gemma 3 12B95%$0.00019.0s
Qwen 3.5 Plus (2026-02-15)99%$0.00157.2s
Qwen 2.5 72B98%$0.000310.9s
GPT-4o Mini (temp=1)95%$0.00049.5s
Grok 4.20 (Beta)98%$0.00341.8s
Claude Haiku 4.599%$0.00363.2s
Stealth: Hunter Alpha98%$0.000019.5s
Mistral Medium 3.197%$0.00135.9s
Grok 4.1 Fast99%$0.001012.4s
Mistral Small Creative96%$0.00023.1s
0.700.800.901.00

Cost vs Performance

Compares total cost for this test against the test score. Quadrant lines are drawn at the median values. Only models with available cost data are shown.

12 low-scoring outliers hidden: Ministral 3 8B (87.0%), Ministral 8B (86.7%), Mistral NeMO (86.6%), Arcee AI: Trinity Mini (85.7%), Nemotron 3 Nano (83.3%), Ministral 3 3B (81.2%), Ministral 3B (80.9%), Cohere Command R+ (Aug. 2024) (73.7%), LFM2 24B (71.7%), Hermes 3 70B (69.5%), Rocinante 12B (66.3%), Claude 3 Haiku (61.1%).

Most Stable Models (Top 20)

Ranked by stability (median × consistency). Click a model name to view its detail page.

ScoreConsistencyStability
Claude Opus 4.6100%99%99%
Claude Opus 4.5100%98%98%
Claude Sonnet 4100%98%98%
Gemini 3 Pro (Preview)100%98%98%
Claude Opus 4.6 (Reasoning)100%98%98%
Claude Sonnet 4.5100%98%98%
Grok 4100%98%98%
Claude Sonnet 4.6 (Reasoning)100%98%98%
Qwen 3.5 27B99%98%98%
Z.AI GLM 599%98%98%
Gemini 2.5 Pro99%97%97%
Qwen 3.5 Plus (2026-02-15)99%97%97%
Gemini 3 Flash (Preview, Reasoning)99%97%97%
Z.AI GLM 4.799%97%97%
Claude Sonnet 4.699%97%97%
Grok 4.20 (Beta, Reasoning)99%97%97%
Claude Haiku 4.599%97%97%
Gemini 3 Flash (Preview)99%97%97%
GPT-599%96%96%
GPT-5.199%96%96%
90%100%

Top Overall Models (Top 20)

Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.

ScoreCostSpeedStability
Gemini 3.1 Flash Lite (Preview)99%$0.00101.8s96%
Gemini 3 Flash (Preview)99%$0.00193.4s97%
Qwen 3.5 Plus (2026-02-15)99%$0.00157.2s97%
Claude Haiku 4.599%$0.00363.2s97%
Grok 4 Fast99%$0.00086.5s95%
Gemini 2.5 Flash99%$0.00152.2s92%
GPT-4.1 Mini98%$0.00117.0s94%
Stealth: Healer Alpha99%$0.000014.3s94%
Grok 4.1 Fast99%$0.001012.4s92%
Mistral Medium 3.197%$0.00135.9s91%
Gemini 2.5 Flash Lite97%$0.00031.7s86%
Claude Sonnet 4.5100%$0.0114.9s98%
Claude Sonnet 4100%$0.0116.1s98%
GPT-4.198%$0.00544.4s92%
Grok 4.20 (Beta)98%$0.00341.8s89%
Qwen 2.5 72B98%$0.000310.9s89%
Mistral Large 398%$0.00117.7s88%
Claude Sonnet 4.699%$0.0114.7s97%
Mistral Large98%$0.00447.6s90%
Stealth: Hunter Alpha98%$0.000019.5s89%
50%60%70%80%90%100%
Specific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric Prompt
Model Total ▼Character rename: Elena->Mirabel, Gregor->AldricCharacter rename: Elena->Mirabel, Gregor->AldricLocation rename: market square, outer ring, bridge, northern minesLocation rename: market square, outer ring, bridge, northern minesExpand all contractionsExpand all contractionsTense rewriting: past to presentTense rewriting: past to presentPOV shift: 3rd person to 1st person (Elena's perspective)POV shift: 3rd person to 1st person (Elena's perspective)Multi-character gender swap: Priya(F)->Rohan(M), Mara unchangedMulti-character gender swap: Priya(F)->Rohan(M), Mara unchangedCombined: 3rd person past → 1st person presentCombined: 3rd person past → 1st person presentPassive voice → active voicePassive voice → active voiceAvoid said/asked/replied/answeredAvoid said/asked/replied/answered
Claude Sonnet 4100%100%100%100%100%100%100%100%100%100%100%100%100%100%99%98%97%100%100%
Claude Opus 4.6100%100%100%100%100%100%100%100%99%100%99%100%100%100%99%98%98%100%100%
Claude Sonnet 4.5100%100%100%100%100%100%100%100%100%100%100%100%100%99%99%99%96%100%100%
Claude Opus 4.6 (Reasoning)100%100%100%100%100%100%100%100%100%100%99%100%100%100%99%99%96%100%100%
Grok 4100%100%100%100%100%100%100%99%100%100%100%100%100%100%99%99%96%100%100%
Gemini 3 Pro (Preview)100%100%100%100%100%100%100%100%99%100%100%100%100%99%99%98%96%100%100%
Claude Sonnet 4.6 (Reasoning)100%100%100%100%100%100%99%100%99%100%100%100%100%100%99%99%96%100%100%
Claude Opus 4.5100%100%100%100%100%100%100%100%99%100%99%100%100%100%99%97%97%100%100%
Z.AI GLM 599%100%100%100%100%100%100%100%98%100%100%100%100%100%99%98%96%100%100%
Gemini 2.5 Pro99%100%100%100%100%100%100%100%96%100%100%100%100%100%99%99%95%100%100%
GPT-599%100%100%100%100%100%100%100%96%100%100%100%100%100%98%98%97%100%100%
Qwen 3.5 27B99%100%100%100%100%100%100%100%97%100%100%100%99%100%99%98%97%100%100%
Z.AI GLM 4.799%100%100%100%99%100%100%100%97%100%100%100%100%100%99%98%96%100%100%
Qwen 3.5 Plus (2026-02-15)99%100%100%100%100%100%100%100%99%100%100%99%100%99%99%97%96%100%100%
Grok 4.20 (Beta, Reasoning)99%100%100%100%100%100%100%100%94%100%100%100%100%99%99%99%96%100%100%
1–15 of 116
Page 1 / 8

Generic Prompt

Character rename: Elena->Mirabel, Gregor->Aldric

Location rename: market square, outer ring, bridge, northern mines

Expand all contractions

Tense rewriting: past to present

POV shift: 3rd person to 1st person (Elena's perspective)

Multi-character gender swap: Priya(F)->Rohan(M), Mara unchanged

Combined: 3rd person past → 1st person present

Passive voice → active voice

Avoid said/asked/replied/answered

Specific Prompt

Character rename: Elena->Mirabel, Gregor->Aldric

Location rename: market square, outer ring, bridge, northern mines

Expand all contractions

Tense rewriting: past to present

POV shift: 3rd person to 1st person (Elena's perspective)

Multi-character gender swap: Priya(F)->Rohan(M), Mara unchanged

Combined: 3rd person past → 1st person present

Passive voice → active voice

Avoid said/asked/replied/answered