mistralai/mistral-medium-3.1

Mistral Medium 3.1

Release Date

Aug 13th, 2025

Context Size

131k

Benchmark Cost

$1.06

Speed

54.2 tok/s

Creative writing

70.17%

Rule following

78.10%

Utility

71.82%

Mathematics

100.00%

Tooling

75.89%

Language

56.48%

Logic

83.86%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
0-shot Creative writingRule following
787772696873%
0-shot Creative writingRule following
848483817982%
0-shot Creative writingRule following
938888878187%
0-shot Creative writingRule following
868080797881%
0-shot Creative writingRule following
868483838384%
0-shot Creative writingRule following
888785848486%
Detailed Writing Rules82.03%
genre
0-shot Creative writingRule following
767676756874%
0-shot Creative writingRule following
848379777680%
0-shot Creative writingRule following
888383838284%
0-shot Creative writingRule following
828181777679%
0-shot Creative writingRule following
838281807781%
0-shot Creative writingRule following
848181797780%
genre79.74%
Novelcrafter Default Prompt
0-shot Creative writingRule following
807977767377%
0-shot Creative writingRule following
858584847883%
0-shot Creative writingRule following
878686838285%
0-shot Creative writingRule following
868585848084%
0-shot Creative writingRule following
848484827682%
0-shot Creative writingRule following
868483817882%
Novelcrafter Default Prompt82.13%
81.30%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
matrix
0-shot ToolingUtilityLogicRule following
8181797878777675736777%
0-shot ToolingUtilityLogicRule following
9087878787878783828086%
0-shot ToolingUtilityLogicRule following
888888888888888383379%
0-shot ToolingUtilityLogicRule following
1001001001001001009292929297%
matrix84.42%
tiers
0-shot ToolingUtilityLogicRule following
10083838383838383767683%
0-shot ToolingUtilityLogicRule following
100100100100100100100100949499%
0-shot ToolingUtilityLogicRule following
9696929292929292898892%
0-shot ToolingUtilityLogicRule following
9291838383767575686779%
tiers88.51%
86.47%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
dialogue-200
0-shot Creative writingRule following
4838341821000014%
0-shot Creative writingRule following
836457565150505050151%
0-shot Creative writingRule following
918780545149443025552%
dialogue-20038.89%
dialogue-500
0-shot Creative writingRule following
5850504120000020%
0-shot Creative writingRule following
505024700000013%
0-shot Creative writingRule following
85825150464525197041%
dialogue-50024.73%
Ungrouped
0-shot Creative writingRule following
100100100100100100100100100100100%
41.55%

Language Comprehension

Does the model understand more than just English?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
000000%
0-shot Language
000000%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
50.00%

Language Writing

Can the model generate text in different languages?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
1009167505072%
0-shot Language
1001005050060%
0-shot Language
1001008667070%
0-shot Language
9150500038%
0-shot Language
1009150505068%
61.67%

N-Length Sentences

Write sentences with exactly N words

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Rule following
1001001001001001009898979799%
0-shot Rule following
9592908579797877726882%
0-shot Rule following
7668676564646358533461%
80.55%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
outline-count
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
10000000000010%
outline-count70.00%
pov-count
0-shot ToolingUtility
50500000000010%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100000000030%
pov-count46.67%
58.33%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot ToolingUtility
1001001001001001001001001006797%
96.67%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Utility
00000000000%
0-shot Utility
00000000000%
1-shot Utility
1001000000000020%
Few-shot Utility
100100100100100100100100100090%
0-shot Utility
00000000000%
22.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
paragraphs
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
paragraphs100.00%
sentences
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
10010010010010010010098989899%
0-shot Rule following
10010010010010098989277087%
0-shot Rule following
100100100100100100100100100100100%
sentences97.23%
words
0-shot Rule following
10010010010010010010010010098100%
0-shot Rule following
10010010010010010010098989299%
0-shot Rule following
1001001009892927777775487%
0-shot Rule following
98987777272000038%
0-shot Rule following
92925427272000029%
words70.59%
87.62%