anthropic/claude-sonnet-4.5

Claude Sonnet 4.5

Release Date

Sep 29th, 2025

Context Size

1m

Reasoning

No

Benchmark Cost

$13.53

Speed

56.5 tok/s

Categories

20%40%60%80%100%Creative Writing84.2%Tooling100.0%Language92.4%Utility83.8%Reasoning92.5%Text Editing99.0%Rule Following76.8%Hallucination75.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
918887878287%
929289898389%
909086838286%
949189888890%
949088878388%
949089898790%
Detailed Writing Rules88.39%
genre
838179756877%
858079777379%
868282787781%
878584848285%
898280787280%
918784848185%
genre81.16%
Novelcrafter Default Prompt
787574727274%
908985828286%
888584827984%
898281818183%
928383807983%
878684838385%
Novelcrafter Default Prompt82.41%
83.99%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
969595959595%
979796969696%
989897979697%
100100100999999%
96.97%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9897969696969596%
100100100100100100100100%
10010010010010010099100%
Generic Prompt99.46%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
10099999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
10099999998989799%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.77%
99.62%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%