anthropic/claude-sonnet-5

Claude Sonnet 5

Release Date

Jun 30th, 2026

Context Size

1m

Reasoning

No

Benchmark Cost

$10.88

Speed

56.4 tok/s

Categories

20%40%60%80%100%Creative Writing82.6%Tooling96.0%Language95.5%Utility88.6%Reasoning75.3%Text Editing97.6%Rule Following74.4%Hallucination88.7%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
848282828182%
929288858488%
868686838285%
959187868288%
858180777579%
949289888289%
Detailed Writing Rules85.28%
genre
837876706775%
908583807983%
767673717073%
838383818182%
757474737374%
878680797682%
genre78.20%
Novelcrafter Default Prompt
817878777678%
908282817582%
848483797681%
928886858587%
828180807480%
898685858185%
Novelcrafter Default Prompt82.23%
81.90%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989797979597%
989797979797%
999999989899%
999898949497%
97.33%

Relationship tree

Extracts a deterministic XML family and relationship tree from cumulative literary prose.

Scenario #1 #2 #3 #4 #5 Total
848280807680%
787474736973%
76.87%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
10010010010096969698%
100100100100100100100100%
9898979694949396%
100100100100100100100100%
9999999999999999%
Generic Prompt99.05%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999998989899%
10010010010010010099100%
100100100100100100100100%
100100100100100100100100%
9898989898979798%
100100100100100100100100%
100100100100999999100%
Specific Prompt99.56%
99.31%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%