Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Price-Performance Score Distribution (Top 20)

Click a model name to view its detail page.

ScoreCostTime
Gemini 2.5 Flash Lite97%$0.00031.7s
Mistral Small 3.2 24B97%$0.00025.0s
Gemini 2.5 Flash99%$0.00152.2s
Gemini 3 Flash (Preview)99%$0.00193.4s
Grok 4 Fast99%$0.00086.5s
GPT-4.1 Mini98%$0.00117.0s
Mistral Large 398%$0.00117.7s
Gemma 3 12B95%$0.00019.0s
Qwen 3.5 Plus (2026-02-15)99%$0.00157.2s
Qwen 2.5 72B98%$0.000310.9s
GPT-4o Mini (temp=1)95%$0.00049.5s
Claude Haiku 4.599%$0.00363.2s
Mistral Medium 3.197%$0.00135.9s
Mistral Small Creative96%$0.00023.1s
Grok 4.1 Fast99%$0.001012.4s
Ministral 3 14B93%$0.00024.1s
Gemini 2.5 Flash Lite (Reasoning)95%$0.001917.4s
DeepSeek V3 (2024-12-26)96%$0.000816.0s
GPT-4.198%$0.00544.4s
Mistral Large 298%$0.00447.6s
0.700.800.901.00

Cost vs Performance

Compares total cost for this test against the test score. Quadrant lines are drawn at the median values. Only models with available cost data are shown.

10 low-scoring outliers hidden: Ministral 3 8B (87.0%), Ministral 8B (86.7%), Mistral NeMO (86.6%), Arcee AI: Trinity Mini (85.7%), Ministral 3 3B (81.2%), Ministral 3B (80.9%), Cohere Command R+ (Aug. 2024) (73.7%), Hermes 3 70B (69.5%), Rocinante 12B (66.3%), Claude 3 Haiku (61.1%).

Most Stable Models (Top 20)

Ranked by stability (median × consistency). Click a model name to view its detail page.

ScoreConsistencyStability
Claude Opus 4.6100%99%99%
Claude Opus 4.5100%98%98%
Claude Sonnet 4100%98%98%
Gemini 3 Pro (Preview)100%98%98%
Claude Opus 4.6 (Reasoning)100%98%98%
Claude Sonnet 4.5100%98%98%
Grok 4100%98%98%
Claude Sonnet 4.6 (Reasoning)100%98%98%
Z.AI GLM 599%98%98%
Gemini 2.5 Pro99%97%97%
Qwen 3.5 Plus (2026-02-15)99%97%97%
Gemini 3 Flash (Preview, Reasoning)99%97%97%
Z.AI GLM 4.799%97%97%
Claude Sonnet 4.699%97%97%
Claude Haiku 4.599%97%97%
Gemini 3 Flash (Preview)99%97%97%
GPT-599%96%96%
GPT-5.199%96%96%
Claude 3.7 Sonnet99%96%96%
ByteDance Seed 1.699%95%95%
90%100%

Top Overall Models (Top 20)

Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.

ScoreCostSpeedStability
Gemini 3 Flash (Preview)99%$0.00193.4s97%
Qwen 3.5 Plus (2026-02-15)99%$0.00157.2s97%
Claude Haiku 4.599%$0.00363.2s97%
Grok 4 Fast99%$0.00086.5s95%
Gemini 2.5 Flash99%$0.00152.2s92%
GPT-4.1 Mini98%$0.00117.0s94%
Grok 4.1 Fast99%$0.001012.4s92%
Mistral Medium 3.197%$0.00135.9s91%
Gemini 2.5 Flash Lite97%$0.00031.7s86%
Claude Sonnet 4.5100%$0.0114.9s98%
Claude Sonnet 4100%$0.0116.1s98%
GPT-4.198%$0.00544.4s92%
Qwen 2.5 72B98%$0.000310.9s89%
Mistral Large 398%$0.00117.7s88%
Claude Sonnet 4.699%$0.0114.7s97%
Mistral Large98%$0.00447.6s90%
Gemini 2.5 Flash (Reasoning)98%$0.006311.7s93%
Claude 3.7 Sonnet99%$0.0115.9s96%
Mistral Small Creative96%$0.00023.1s84%
GPT-4o, May 13th (temp=0)99%$0.0113.5s94%
60%70%80%90%100%
Specific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric Prompt
Model Total ▼Character rename: Elena->Mirabel, Gregor->AldricCharacter rename: Elena->Mirabel, Gregor->AldricLocation rename: market square, outer ring, bridge, northern minesLocation rename: market square, outer ring, bridge, northern minesExpand all contractionsExpand all contractionsTense rewriting: past to presentTense rewriting: past to presentPOV shift: 3rd person to 1st person (Elena's perspective)POV shift: 3rd person to 1st person (Elena's perspective)Multi-character gender swap: Priya(F)->Rohan(M), Mara unchangedMulti-character gender swap: Priya(F)->Rohan(M), Mara unchangedCombined: 3rd person past → 1st person presentCombined: 3rd person past → 1st person presentPassive voice → active voicePassive voice → active voiceAvoid said/asked/replied/answeredAvoid said/asked/replied/answered
Claude Sonnet 4100%100%100%100%100%100%100%100%100%100%100%100%100%100%99%98%97%100%100%
Claude Opus 4.6100%100%100%100%100%100%100%100%99%100%99%100%100%100%99%98%98%100%100%
Claude Sonnet 4.5100%100%100%100%100%100%100%100%100%100%100%100%100%99%99%99%96%100%100%
Claude Opus 4.6 (Reasoning)100%100%100%100%100%100%100%100%100%100%99%100%100%100%99%99%96%100%100%
Grok 4100%100%100%100%100%100%100%99%100%100%100%100%100%100%99%99%96%100%100%
Gemini 3 Pro (Preview)100%100%100%100%100%100%100%100%99%100%100%100%100%99%99%98%96%100%100%
Claude Sonnet 4.6 (Reasoning)100%100%100%100%100%100%99%100%99%100%100%100%100%100%99%99%96%100%100%
Claude Opus 4.5100%100%100%100%100%100%100%100%99%100%99%100%100%100%99%97%97%100%100%
Z.AI GLM 599%100%100%100%100%100%100%100%98%100%100%100%100%100%99%98%96%100%100%
Gemini 2.5 Pro99%100%100%100%100%100%100%100%96%100%100%100%100%100%99%99%95%100%100%
GPT-599%100%100%100%100%100%100%100%96%100%100%100%100%100%98%98%97%100%100%
Z.AI GLM 4.799%100%100%100%99%100%100%100%97%100%100%100%100%100%99%98%96%100%100%
Qwen 3.5 Plus (2026-02-15)99%100%100%100%100%100%100%100%99%100%100%99%100%99%99%97%96%100%100%
Gemini 3 Flash (Preview, Reasoning)99%100%100%100%100%100%100%100%98%100%100%100%100%100%99%97%95%100%100%
Gemini 3.1 Pro (Preview)99%100%100%94%100%100%100%100%99%100%100%100%100%100%99%99%96%100%100%
1–15 of 84
Page 1 / 6

Generic Prompt

Character rename: Elena->Mirabel, Gregor->Aldric

Text Editing

Location rename: market square, outer ring, bridge, northern mines

Text Editing

Expand all contractions

Text Editing

Tense rewriting: past to present

Text Editing

POV shift: 3rd person to 1st person (Elena's perspective)

Text Editing

Multi-character gender swap: Priya(F)->Rohan(M), Mara unchanged

Text Editing

Combined: 3rd person past → 1st person present

Text Editing

Passive voice → active voice

Text EditingHallucination

Avoid said/asked/replied/answered

Text Editing

Specific Prompt

Character rename: Elena->Mirabel, Gregor->Aldric

Text Editing

Location rename: market square, outer ring, bridge, northern mines

Text Editing

Expand all contractions

Text Editing

Tense rewriting: past to present

Text Editing

POV shift: 3rd person to 1st person (Elena's perspective)

Text Editing

Multi-character gender swap: Priya(F)->Rohan(M), Mara unchanged

Text Editing

Combined: 3rd person past → 1st person present

Text Editing

Passive voice → active voice

Text EditingHallucination

Avoid said/asked/replied/answered

Text Editing