mistralai/mistral-medium-3.1

Mistral Medium 3.1

Release Date

Aug 13th, 2025

Context Size

131k

Reasoning

No

Benchmark Cost

$1.53

Speed

64.8 tok/s

Categories

20%40%60%80%100%Creative Writing81.7%Tooling97.5%Language49.5%Utility80.1%Reasoning89.3%Text Editing93.8%Rule Following48.6%Hallucination82.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
787772696873%
848483817982%
928888878187%
868080797881%
868383838384%
888785848486%
Detailed Writing Rules82.00%
genre
767676756874%
848379777680%
888383828183%
828181777479%
838281807781%
848180797780%
genre79.56%
Novelcrafter Default Prompt
807977747376%
858584847883%
878686838285%
868585848084%
848484827682%
858483817682%
Novelcrafter Default Prompt81.95%
81.17%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979797979697%
949493939393%
979696969696%
1009999969698%
96.02%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9797969696969696%
9898989898989898%
9696969696969696%
10010010010010010099100%
8483837979787881%
9696959595959595%
9696969695959596%
Generic Prompt95.76%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
10010010010097979799%
9494949494939294%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.07%
97.41%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001001006797%
96.67%