anthropic/claude-opus-4.6

Claude Opus 4.6 (Reasoning)

Release Date

Feb 4th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$44.01

Speed

54.5 tok/s

Categories

20%40%60%80%100%Creative Writing84.5%Tooling100.0%Language96.1%Utility98.9%Reasoning93.8%Text Editing98.9%Rule Following89.8%Hallucination98.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
878584848385%
908988878688%
908784838185%
949492898992%
919189898589%
939389888890%
Detailed Writing Rules88.04%
genre
828078767478%
868383828283%
818078787478%
848481797681%
828079757578%
858281797881%
genre79.94%
Novelcrafter Default Prompt
888685847984%
878582797782%
858380787781%
898685848486%
888786828085%
838383828082%
Novelcrafter Default Prompt83.25%
83.74%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999898989798%
1009999999799%
999999999999%
999999999999%
98.48%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9897979696959596%
9999999999999999%
1001001001001009999100%
Generic Prompt99.34%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100999998989899%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.86%
99.60%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%