anthropic/claude-sonnet-4.6

Claude Sonnet 4.6

Release Date

Feb 17th, 2026

Context Size

1m

Benchmark Cost

$7.62

Speed

44.8 tok/s

Creative writing

79.45%

Rule following

82.45%

Utility

86.62%

Mathematics

50.00%

Tooling

84.79%

Language

100.00%

Logic

83.24%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
0-shot Creative writingRule following
898987868487%
0-shot Creative writingRule following
948987868588%
0-shot Creative writingRule following
908885838286%
0-shot Creative writingRule following
939388888690%
0-shot Creative writingRule following
868684848084%
0-shot Creative writingRule following
959489878490%
Detailed Writing Rules87.45%
genre
0-shot Creative writingRule following
858078777779%
0-shot Creative writingRule following
858281807881%
0-shot Creative writingRule following
828078777378%
0-shot Creative writingRule following
878483777782%
0-shot Creative writingRule following
767575747074%
0-shot Creative writingRule following
907976757479%
genre78.92%
Novelcrafter Default Prompt
0-shot Creative writingRule following
848482737379%
0-shot Creative writingRule following
878682817582%
0-shot Creative writingRule following
828180797880%
0-shot Creative writingRule following
908887837885%
0-shot Creative writingRule following
787776757276%
0-shot Creative writingRule following
908885827884%
Novelcrafter Default Prompt81.09%
82.49%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
matrix
0-shot ToolingUtilityLogicRule following
9494929291919090908891%
0-shot ToolingUtilityLogicRule following
10010010010098989898969498%
0-shot ToolingUtilityLogicRule following
10097929292929292929293%
0-shot ToolingUtilityLogicRule following
9292929292929292929292%
matrix93.51%
tiers
0-shot ToolingUtilityLogicRule following
8383838383838383838383%
0-shot ToolingUtilityLogicRule following
8686868686868686868686%
0-shot ToolingUtilityLogicRule following
100100979794949289868693%
0-shot ToolingUtilityLogicRule following
9797979797979797888895%
tiers89.46%
91.48%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
dialogue-200
0-shot Creative writingRule following
9999989490878779767688%
0-shot Creative writingRule following
10099989594807559474479%
0-shot Creative writingRule following
10093868483767373676780%
dialogue-20082.64%
dialogue-500
0-shot Creative writingRule following
9362585553505050484556%
0-shot Creative writingRule following
8980565655524947474157%
0-shot Creative writingRule following
726766655242414131548%
dialogue-50053.83%
Ungrouped
0-shot Creative writingRule following
100100100100100100100100616192%
71.65%

Language Comprehension

Does the model understand more than just English?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
100.00%

Language Writing

Can the model generate text in different languages?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
100.00%

N-Length Sentences

Write sentences with exactly N words

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
10097747474747474747479%
0-shot Rule following
87773734282322161032%
70.47%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
outline-count
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100000000030%
outline-count76.67%
pov-count
0-shot ToolingUtility
1001001001001001001001000080%
0-shot ToolingUtility
100100100000000030%
0-shot ToolingUtility
100100100100100100100100100100100%
pov-count70.00%
73.33%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot ToolingUtility
100100100100100100100100100100100%
100.00%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Utility
100100100100100100100100100100100%
0-shot Utility
100100100100100100100100100100100%
1-shot Utility
100100100100100100100100100100100%
Few-shot Utility
100100100100100100100100100100100%
0-shot Utility
100100100100100100100100100100100%
100.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
paragraphs
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
paragraphs100.00%
sentences
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
1001009898989292540073%
sentences94.66%
words
0-shot Rule following
1001001001001001009898929298%
0-shot Rule following
1001001001001001001001009898100%
0-shot Rule following
100100100989292777777282%
0-shot Rule following
100927777775454279958%
0-shot Rule following
00000000000%
words67.41%
85.41%