anthropic/claude-sonnet-5

Claude Sonnet 5 (Reasoning, Low)

Release Date

Jun 30th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$19.49

Speed

67.0 tok/s

Categories

20%40%60%80%100%Creative Writing81.4%Tooling99.5%Language98.1%Utility93.5%Reasoning87.8%Text Editing97.9%Rule Following66.6%Hallucination96.4%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
908685848486%
919189898789%
868685838384%
949390908891%
898784848386%
938787858086%
Detailed Writing Rules87.09%
genre
838073736675%
888179756678%
807975757276%
858281807581%
838075747377%
847979767579%
genre77.45%
Novelcrafter Default Prompt
797876747376%
878382807982%
787775747375%
878583807782%
818079777679%
848383757379%
Novelcrafter Default Prompt79.05%
81.20%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979797969696%
979797979697%
999998989898%
1009999989899%
97.58%

Relationship tree

Extracts a deterministic XML family and relationship tree from cumulative literary prose.

Scenario #1 #2 #3 #4 #5 Total
989893888592%
928786858186%
89.36%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999899%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9898969695959496%
10010010010010010099100%
9999999999999999%
Generic Prompt99.27%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999898989899%
10010010010010010099100%
100100100100100100100100%
100100100100100100100100%
9898989898989898%
100100100100100100100100%
10010010010099999899%
Specific Prompt99.56%
99.41%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%