anthropic/claude-opus-4.5

Claude Opus 4.5

Release Date

Nov 24th, 2025

Context Size

200k

Reasoning

No

Benchmark Cost

$23.48

Speed

49.4 tok/s

Categories

20%40%60%80%100%Creative Writing81.7%Tooling100.0%Language99.7%Utility89.8%Reasoning93.9%Text Editing97.7%Rule Following72.6%Hallucination82.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
868585857483%
918888878387%
909086858387%
888785828185%
938686858487%
898888868487%
Detailed Writing Rules85.90%
genre
837570686672%
888784828084%
807978727276%
828077777578%
838180807480%
808079797579%
genre78.12%
Novelcrafter Default Prompt
828180777479%
898683807983%
848483807882%
848179797680%
818080797780%
858481817982%
Novelcrafter Default Prompt80.87%
81.63%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999898989898%
1009998989899%
999999999798%
100100100999999%
98.56%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9898989898979797%
9999999999999999%
10099999999999999%
Generic Prompt99.38%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9898989797979797%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.69%
99.54%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%