mistral-large-2402

Mistral Large
via mistral

Release Date

Feb 26th, 2024

Context Size

32k

Reasoning

No

Benchmark Cost

$11.97

Speed

4413.5 tok/s

Categories

20%40%60%80%100%Creative Writing82.0%Tooling98.7%Language88.6%Utility73.0%Reasoning76.3%Text Editing95.1%Rule Following49.9%Hallucination77.5%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
828076767578%
888887868086%
909088847986%
938887828086%
878383827983%
848282807480%
Detailed Writing Rules83.13%
genre
807776767376%
858078787780%
908985847985%
878080767580%
878483827582%
817978777578%
genre80.22%
Novelcrafter Default Prompt
818078777177%
848180787580%
838279777779%
908683818184%
918783818185%
858280807981%
Novelcrafter Default Prompt81.01%
81.45%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
97959493076%
979794949395%
919191908990%
969695959596%
89.27%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9898989898989898%
100100100100100100100100%
100100100100100100100100%
1001001007474747485%
9292919191919191%
10010010010010010099100%
9999999999999999%
Generic Prompt97.00%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
1001009999999999100%
9696969695959495%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.37%
98.18%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%