anthropic/claude-opus-4.8

Claude Opus 4.8 (Reasoning)

Release Date

May 27th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$39.20

Speed

64.2 tok/s

Categories

20%40%60%80%100%Creative Writing85.3%Tooling99.4%Language96.4%Utility99.3%Reasoning93.5%Text Editing98.8%Rule Following70.3%Hallucination94.8%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
919086868287%
939089868188%
928888858387%
939291908891%
898988848287%
908988878488%
Detailed Writing Rules87.90%
genre
848280797780%
868380777280%
858282797581%
838281817881%
858383787781%
848484787280%
genre80.50%
Novelcrafter Default Prompt
858480797881%
888483817783%
918685848386%
878684838084%
858483817982%
898988878187%
Novelcrafter Default Prompt83.81%
84.07%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999999999999%
100100100999999%
999999989898%
10010095959597%
98.46%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
1001001001001009999100%
100100100100100100100100%
100100100100100100100100%
10098989898989798%
100100100100100100100100%
9999999999999999%
Generic Prompt99.67%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9998959595959596%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9999999999999899%
100100100100100100100100%
10099999999999999%
Specific Prompt99.39%
99.53%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%