Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Price-Performance Score Distribution (Top 20)

Click a model name to view its detail page.

ScoreCostTime
Gemini 2.5 Flash Lite97%$0.00031.7s
Gemini 3.1 Flash Lite (Preview)99%$0.00101.8s
Gemini 3.1 Flash Lite (Reasoning)99%$0.00103.1s
Mistral Small 496%$0.00043.3s
Gemini 3.1 Flash Lite99%$0.00102.8s
Mistral Small 3.2 24B97%$0.00025.0s
Gemini 2.5 Flash99%$0.00152.2s
DeepSeek V4 Flash97%$0.00028.1s
Grok 4 Fast99%$0.00086.5s
Gemini 3 Flash (Preview)99%$0.00193.4s
GPT-4.1 Mini98%$0.00117.0s
Mistral Large 398%$0.00117.7s
Gemma 3 12B95%$0.00019.0s
Inception Mercury 295%$0.00172.3s
Grok 4.2098%$0.00204.4s
Qwen 2.5 72B98%$0.000310.9s
Qwen 3.5 Plus (2026-02-15)99%$0.00157.2s
GPT-4o Mini (temp=1)95%$0.00049.5s
Grok 4.395%$0.00214.7s
Stealth: Hunter Alpha98%$0.000019.5s
0.700.800.901.00

Cost vs Performance

Compares total cost for this test against the test score. Quadrant lines are drawn at the median values. Only models with available cost data are shown.

16 low-scoring outliers hidden: DeepSeek V3.1 (89.5%), GPT-4.1 Nano (89.3%), Gemma 3 4B (89.3%), Skyfall 36B V2 (87.3%), Ministral 3 8B (87.0%), Ministral 8B (86.7%), Mistral NeMO (86.6%), Arcee AI: Trinity Mini (85.7%), Nemotron 3 Nano (83.3%), Ministral 3 3B (81.2%), Ministral 3B (80.9%), Cohere Command R+ (Aug. 2024) (73.7%), LFM2 24B (71.7%), Hermes 3 70B (69.5%), Rocinante 12B (66.3%), Claude 3 Haiku (61.1%).

Most Stable Models (Top 20)

Ranked by stability (median × consistency). Click a model name to view its detail page.

ScoreConsistencyStability
Claude Opus 4.6100%99%99%
Gemma 4 31B100%98%98%
Claude Opus 4.5100%98%98%
Gemma 4 31B (Reasoning)100%98%98%
Claude Sonnet 4100%98%98%
Gemini 3 Pro (Preview)100%98%98%
Claude Opus 4.6 (Reasoning)100%98%98%
Claude Sonnet 4.5100%98%98%
Grok 4100%98%98%
Claude Sonnet 4.6 (Reasoning)100%98%98%
Qwen3.6 Max Preview100%98%98%
Z.AI GLM 5.1100%98%98%
Qwen 3.5 27B99%98%98%
Z.AI GLM 599%98%98%
Claude Opus 4.7 (Reasoning)99%98%98%
Gemma 4 26B (Reasoning)99%97%97%
Claude Opus 4.799%97%97%
Grok 4.20 (Reasoning)99%97%97%
Gemini 2.5 Pro99%97%97%
Qwen3.7 Max99%97%97%
90%100%

Top Overall Models (Top 20)

Ranked by composite score (performance, cost, speed & stability). Click a model name to view its detail page.

ScoreCostSpeedStability
Gemini 3.1 Flash Lite (Preview)99%$0.00101.8s96%
Gemini 3.1 Flash Lite99%$0.00102.8s96%
Gemini 3.1 Flash Lite (Reasoning)99%$0.00103.1s96%
Gemini 3 Flash (Preview)99%$0.00193.4s97%
Qwen 3.5 Plus (2026-02-15)99%$0.00157.2s97%
Claude Haiku 4.599%$0.00363.2s97%
Grok 4 Fast99%$0.00086.5s95%
Gemini 2.5 Flash99%$0.00152.2s92%
Gemma 4 26B99%$0.000317.1s97%
GPT-4.1 Mini98%$0.00117.0s94%
Grok 4.2098%$0.00204.4s92%
Stealth: Healer Alpha99%$0.000014.3s94%
DeepSeek V4 Pro99%$0.001321.1s97%
Grok 4.1 Fast99%$0.001012.4s92%
Gemma 4 31B100%$0.000330.2s98%
Mistral Medium 3.197%$0.00135.9s91%
Gemini 2.5 Flash Lite97%$0.00031.7s86%
Claude Sonnet 4.5100%$0.0114.9s98%
Claude Sonnet 4100%$0.0116.1s98%
GPT-4.198%$0.00544.4s92%
60%70%80%90%100%
Specific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric PromptSpecific PromptGeneric Prompt
Model Total ▼Character rename: Elena->Mirabel, Gregor->AldricCharacter rename: Elena->Mirabel, Gregor->AldricLocation rename: market square, outer ring, bridge, northern minesLocation rename: market square, outer ring, bridge, northern minesExpand all contractionsExpand all contractionsTense rewriting: past to presentTense rewriting: past to presentPOV shift: 3rd person to 1st person (Elena's perspective)POV shift: 3rd person to 1st person (Elena's perspective)Multi-character gender swap: Priya(F)->Rohan(M), Mara unchangedMulti-character gender swap: Priya(F)->Rohan(M), Mara unchangedCombined: 3rd person past → 1st person presentCombined: 3rd person past → 1st person presentPassive voice → active voicePassive voice → active voiceAvoid said/asked/replied/answeredAvoid said/asked/replied/answered
Claude Sonnet 4100%100%100%100%100%100%100%100%100%100%100%100%100%100%99%98%97%100%100%
Gemma 4 31B100%100%100%100%100%100%100%100%98%100%100%100%100%100%99%99%98%100%100%
Claude Opus 4.6100%100%100%100%100%100%100%100%99%100%99%100%100%100%99%98%98%100%100%
Claude Sonnet 4.5100%100%100%100%100%100%100%100%100%100%100%100%100%99%99%99%96%100%100%
Claude Opus 4.6 (Reasoning)100%100%100%100%100%100%100%100%100%100%99%100%100%100%99%99%96%100%100%
Gemma 4 31B (Reasoning)100%100%100%100%100%100%99%100%99%100%100%100%100%100%99%99%97%100%100%
Grok 4100%100%100%100%100%100%100%99%100%100%100%100%100%100%99%99%96%100%100%
Gemini 3 Pro (Preview)100%100%100%100%100%100%100%100%99%100%100%100%100%99%99%98%96%100%100%
Claude Sonnet 4.6 (Reasoning)100%100%100%100%100%100%99%100%99%100%100%100%100%100%99%99%96%100%100%
Claude Opus 4.5100%100%100%100%100%100%100%100%99%100%99%100%100%100%99%97%97%100%100%
Qwen3.6 Max Preview100%100%100%100%100%100%100%100%98%100%100%100%100%100%99%98%96%100%100%
Z.AI GLM 5.1100%100%100%100%100%100%100%100%97%100%100%100%100%100%99%99%97%100%100%
Gemma 4 26B (Reasoning)99%100%100%100%100%100%100%100%99%100%100%100%100%99%99%98%95%100%100%
Z.AI GLM 599%100%100%100%100%100%100%100%98%100%100%100%100%100%99%98%96%100%100%
Grok 4.20 (Reasoning)99%100%100%100%100%100%100%99%97%100%100%100%100%100%99%99%97%100%100%
1–15 of 151
Page 1 / 11

Generic Prompt

Character rename: Elena->Mirabel, Gregor->Aldric

Location rename: market square, outer ring, bridge, northern mines

Expand all contractions

Tense rewriting: past to present

POV shift: 3rd person to 1st person (Elena's perspective)

Multi-character gender swap: Priya(F)->Rohan(M), Mara unchanged

Combined: 3rd person past → 1st person present

Passive voice → active voice

Avoid said/asked/replied/answered

Specific Prompt

Character rename: Elena->Mirabel, Gregor->Aldric

Location rename: market square, outer ring, bridge, northern mines

Expand all contractions

Tense rewriting: past to present

POV shift: 3rd person to 1st person (Elena's perspective)

Multi-character gender swap: Priya(F)->Rohan(M), Mara unchanged

Combined: 3rd person past → 1st person present

Passive voice → active voice

Avoid said/asked/replied/answered