anthropic/claude-opus-4.6

Claude Opus 4.6

Release Date

Feb 4th, 2026

Context Size

1m

Reasoning

No

Benchmark Cost

$25.91

Speed

48.1 tok/s

Categories

20%40%60%80%100%Creative Writing83.6%Tooling100.0%Language96.1%Utility90.7%Reasoning93.3%Text Editing98.4%Rule Following83.1%Hallucination93.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
898886858486%
968887827786%
888786828185%
918988858487%
898887858487%
949089888689%
Detailed Writing Rules86.69%
genre
828077747077%
858483827982%
838380806979%
818078767478%
858381807781%
818079757277%
genre79.04%
Novelcrafter Default Prompt
858181807681%
888483837983%
878180777780%
908585838285%
898685787683%
898685837885%
Novelcrafter Default Prompt82.79%
82.84%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989898989898%
10010099999999%
999896969697%
999999999999%
98.29%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9898989898979798%
9999999999999999%
9999999999999999%
Generic Prompt99.47%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9898989898989898%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.83%
99.65%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%