mistralai/mistral-small-2603

Mistral Small 4 (Reasoning)

Release Date

Mar 16th, 2026

Context Size

265k

Reasoning

Yes

Benchmark Cost

$1.29

Speed

144.6 tok/s

Categories

20%40%60%80%100%Creative Writing81.7%Tooling99.7%Language60.5%Utility85.6%Reasoning87.8%Text Editing90.6%Rule Following60.3%Hallucination93.0%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
807676757276%
868584827883%
888786858085%
888382828183%
858483838183%
868483827782%
Detailed Writing Rules82.13%
genre
787372717073%
858180797380%
878280797781%
878683827883%
858282817982%
898483837483%
genre80.24%
Novelcrafter Default Prompt
787876737376%
858584797782%
908785838085%
898584827884%
908883837985%
878685827984%
Novelcrafter Default Prompt82.44%
81.60%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
929090898790%
949390909091%
949486868589%
969492888691%
90.28%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9998989896962587%
100100999998989599%
100100100100100969699%
100100999995733786%
9391919086797887%
1001001009985686889%
10099979693939095%
Generic Prompt93.50%
Specific Prompt
10010010010010010097100%
100100100100100100100100%
10099999693898394%
10099999999986995%
100100100100100969699%
100100999999999999%
9393939290894084%
1001001001001001006595%
100100100100100979599%
Specific Prompt96.09%
94.79%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%