anthropic/claude-sonnet-4.6

Claude Sonnet 4.6 (Reasoning)

Release Date

Feb 17th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$43.81

Speed

69.8 tok/s

Categories

20%40%60%80%100%Creative Writing83.1%Tooling100.0%Language97.6%Utility97.9%Reasoning92.8%Text Editing98.3%Rule Following85.7%Hallucination94.0%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
918786858487%
959290888790%
868685848385%
938986848387%
898988888087%
939291908690%
Detailed Writing Rules87.64%
genre
807877747476%
888584817883%
857874717176%
817777746775%
807776767477%
838280797079%
genre77.68%
Novelcrafter Default Prompt
848380787881%
908782827884%
858280777580%
858584818083%
838079777579%
877877757378%
Novelcrafter Default Prompt80.72%
82.01%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989897979797%
989898979797%
989898979798%
1009896969597%
97.39%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
10099999999999999%
100100999999999999%
100100100100100100100100%
100100100100100100100100%
9796969695959496%
10010010010010010099100%
10010010010099979799%
Generic Prompt99.22%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
10099999898989899%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.85%
99.54%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%