anthropic/claude-3-haiku

Claude 3 Haiku

Release Date

Mar 13th, 2024

Context Size

200k

Reasoning

No

Benchmark Cost

$1.27

Speed

1382.0 tok/s

Categories

20%40%60%80%100%Creative Writing74.5%Tooling99.5%Language72.8%Utility68.5%Reasoning77.9%Text Editing64.4%Rule Following51.2%Hallucination60.8%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
787574747375%
817971666472%
907878777780%
807979747377%
848377767679%
858377777479%
Detailed Writing Rules77.11%
genre
787775726974%
807875737175%
767573727173%
857775737276%
807573717174%
847974736876%
genre74.82%
Novelcrafter Default Prompt
777272706872%
777672717073%
737271666469%
838080757579%
777471686771%
847675757577%
Novelcrafter Default Prompt73.61%
75.18%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
929090908990%
949290898991%
939187858288%
949289878188%
89.24%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100645048222258%
1001000000029%
93920000026%
98980000028%
00000000%
8686868686867885%
1818181818181818%
00000000%
00000000%
Generic Prompt27.11%
Specific Prompt
10010010010075695385%
1001001001001001003390%
9494939393939394%
1001001001001009999100%
100100100100100100100100%
100100100100100100100100%
9391898988888689%
9797979797979797%
100100100100100100100100%
Specific Prompt95.02%
61.07%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%