anthropic/claude-sonnet-5

Claude Sonnet 5 (Reasoning)

Release Date

Jun 30th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$19.31

Speed

66.8 tok/s

Categories

20%40%60%80%100%Creative Writing81.3%Tooling99.2%Language99.0%Utility92.9%Reasoning87.8%Text Editing97.9%Rule Following68.3%Hallucination96.8%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
888786847884%
898886858386%
878683818184%
949187878589%
948684838286%
919088858087%
Detailed Writing Rules85.90%
genre
818076767277%
797974726874%
787676746574%
787877746875%
777674737074%
868482807481%
genre75.87%
Novelcrafter Default Prompt
868481787781%
878677757280%
838378787580%
888482828183%
807979777678%
858381807781%
Novelcrafter Default Prompt80.47%
80.75%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989897969697%
999897979597%
989898979497%
1009999989899%
97.51%

Relationship tree

Extracts a deterministic XML family and relationship tree from cumulative literary prose.

Scenario #1 #2 #3 #4 #5 Total
989895939095%
888685848385%
90.08%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9896959595949495%
100100100100100100100100%
9999999999999999%
Generic Prompt99.21%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999998989899%
10010010010010010099100%
100100100100100100100100%
100100100100100100100100%
9998989898989898%
100100100100100100100100%
10099999999999999%
Specific Prompt99.59%
99.40%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%