Mistral Medium 3.1

mistralai/mistral-medium-3.1

Mistral Medium 3.1

via OpenRouter

Release Date

Aug 13th, 2025

Context Size

131k

Benchmark Cost

$1.06

Speed

54.2 tok/s

Creative writing

70.17%

Rule following

78.10%

Utility

71.82%

Mathematics

100.00%

Tooling

75.89%

Language

56.48%

Logic

83.86%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario	#1	#2	#3	#4	#5	Total
Detailed Writing Rules
Fantasy: entering an ancient ruin 0-shot Creative writingRule following	78	77	72	69	68	73%
Horror: alone in an eerie place at night 0-shot Creative writingRule following	84	84	83	81	79	82%
Literary fiction: old friends reunite 0-shot Creative writingRule following	93	88	88	87	81	87%
Mystery: examining a crime scene 0-shot Creative writingRule following	86	80	80	79	78	81%
Romance: separated couple reunites 0-shot Creative writingRule following	86	84	83	83	83	84%
Thriller: chase through city streets 0-shot Creative writingRule following	88	87	85	84	84	86%
Detailed Writing Rules						82.03%
genre
Fantasy: entering an ancient ruin 0-shot Creative writingRule following	76	76	76	75	68	74%
Horror: alone in an eerie place at night 0-shot Creative writingRule following	84	83	79	77	76	80%
Literary fiction: old friends reunite 0-shot Creative writingRule following	88	83	83	83	82	84%
Mystery: examining a crime scene 0-shot Creative writingRule following	82	81	81	77	76	79%
Romance: separated couple reunites 0-shot Creative writingRule following	83	82	81	80	77	81%
Thriller: chase through city streets 0-shot Creative writingRule following	84	81	81	79	77	80%
genre						79.74%
Novelcrafter Default Prompt
Fantasy: entering an ancient ruin 0-shot Creative writingRule following	80	79	77	76	73	77%
Horror: alone in an eerie place at night 0-shot Creative writingRule following	85	85	84	84	78	83%
Literary fiction: old friends reunite 0-shot Creative writingRule following	87	86	86	83	82	85%
Mystery: examining a crime scene 0-shot Creative writingRule following	86	85	85	84	80	84%
Romance: separated couple reunites 0-shot Creative writingRule following	84	84	84	82	76	82%
Thriller: chase through city streets 0-shot Creative writingRule following	86	84	83	81	78	82%
Novelcrafter Default Prompt						82.13%
81.30%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
matrix
Large codex (40 entries), long passage (1,019 words) 0-shot ToolingUtilityLogicRule following	81	81	79	78	78	77	76	75	73	67	77%
Large codex (40 entries), short passage (165 words) 0-shot ToolingUtilityLogicRule following	90	87	87	87	87	87	87	83	82	80	86%
Small codex (7 entries), long passage (734 words) 0-shot ToolingUtilityLogicRule following	88	88	88	88	88	88	88	83	83	3	79%
Small codex (7 entries), short passage (165 words) 0-shot ToolingUtilityLogicRule following	100	100	100	100	100	100	92	92	92	92	97%
matrix											84.42%
tiers
5 codex entries 0-shot ToolingUtilityLogicRule following	100	83	83	83	83	83	83	83	76	76	83%
10 codex entries 0-shot ToolingUtilityLogicRule following	100	100	100	100	100	100	100	100	94	94	99%
20 codex entries 0-shot ToolingUtilityLogicRule following	96	96	92	92	92	92	92	92	89	88	92%
40 codex entries 0-shot ToolingUtilityLogicRule following	92	91	83	83	83	76	75	75	68	67	79%
tiers											88.51%
86.47%

Data extraction

Extract key details from a given block of text.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
All valid emails 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Contextual pronoun 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Fruits excluding citrus 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Future event time 0-shot UtilityLogic	50	50	50	50	50	50	50	50	50	50	50%
Guess the pet 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
Highest-rated movie 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Indirect birth year 0-shot UtilityMathematicsLogic	100	100	100	100	100	100	100	100	100	100	100%
What instrument does Lucy play? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
What's the color of the car? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
What's the correct time? 0-shot UtilityLogic	0	0	0	0	0	0	0	0	0	0	0%
Who's the sister? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
Who's the tallest? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
87.50%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
dialogue-200
Write 200 words with 10% dialogue 0-shot Creative writingRule following	48	38	34	18	2	1	0	0	0	0	14%
Write 200 words with 50% dialogue 0-shot Creative writingRule following	83	64	57	56	51	50	50	50	50	1	51%
Write 200 words with 90% dialogue 0-shot Creative writingRule following	91	87	80	54	51	49	44	30	25	5	52%
dialogue-200											38.89%
dialogue-500
Write 500 words with 30% dialogue 0-shot Creative writingRule following	58	50	50	41	2	0	0	0	0	0	20%
Write 500 words with 50% dialogue 0-shot Creative writingRule following	50	50	24	7	0	0	0	0	0	0	13%
Write 500 words with 70% dialogue 0-shot Creative writingRule following	85	82	51	50	46	45	25	19	7	0	41%
dialogue-500											24.73%
Ungrouped
Write unattributed dialogue 0-shot Creative writingRule following	100	100	100	100	100	100	100	100	100	100	100%
41.55%

Language Comprehension

Does the model understand more than just English?

Scenario	#1	#2	#3	#4	#5	Total
Asking for directions (Dutch) 0-shot Language	0	0	0	0	0	0%
Asking for directions (German) 0-shot Language	0	0	0	0	0	0%
Friend got new kittens (German) 0-shot Language	100	100	100	100	100	100%
Friend got new kittens (Tagalog) 0-shot Language	100	100	100	100	100	100%
50.00%

Language Writing

Can the model generate text in different languages?

Scenario	#1	#2	#3	#4	#5	Total
Character dialogue (French) in a story 0-shot Language	100	91	67	50	50	72%
Character dialogue (German) in a story 0-shot Language	100	100	50	50	0	60%
Character dialogue (Hindi) in a story 0-shot Language	100	100	86	67	0	70%
Character dialogue (Italian) in a story 0-shot Language	91	50	50	0	0	38%
Character dialogue (Spanish) in a story 0-shot Language	100	91	50	50	50	68%
61.67%

N-Length Sentences

Write sentences with exactly N words

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
Write sentences with 5 words each 0-shot Rule following	100	100	100	100	100	100	98	98	97	97	99%
Write sentences with 10 words each 0-shot Rule following	95	92	90	85	79	79	78	77	72	68	82%
Write sentences with 20 words each 0-shot Rule following	76	68	67	65	64	64	63	58	53	34	61%
80.55%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
outline-count
Count acts 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	100	100%
Count chapters 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	100	100%
Count scenes 0-shot ToolingUtility	100	0	0	0	0	0	0	0	0	0	10%
outline-count											70.00%
pov-count
Count point of views for Jack and Olivia 0-shot ToolingUtility	50	50	0	0	0	0	0	0	0	0	10%
Count point of views for Jack Harper 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	100	100%
Count point of views for Olivia 0-shot ToolingUtility	100	100	100	0	0	0	0	0	0	0	30%
pov-count											46.67%
58.33%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
Create alternate prose sections 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	67	97%
96.67%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
Multiple speakers 0-shot Utility	0	0	0	0	0	0	0	0	0	0	0%
Simple 0-shot Utility	0	0	0	0	0	0	0	0	0	0	0%
Simple (1-shot) 1-shot Utility	100	100	0	0	0	0	0	0	0	0	20%
Simple (5-shot) Few-shot Utility	100	100	100	100	100	100	100	100	100	0	90%
Unattributed dialogue 0-shot Utility	0	0	0	0	0	0	0	0	0	0	0%
22.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
paragraphs
1 paragraph summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
3 paragraph summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
5 paragraph summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
paragraphs											100.00%
sentences
1 sentence summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
3 sentence summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
10 sentence summary 0-shot Rule following	100	100	100	100	100	100	100	98	98	98	99%
20 sentence summary 0-shot Rule following	100	100	100	100	100	98	98	92	77	0	87%
50 sentence summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
sentences											97.23%
words
10 word summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	98	100%
20 word summary 0-shot Rule following	100	100	100	100	100	100	100	98	98	92	99%
50 word summary 0-shot Rule following	100	100	100	98	92	92	77	77	77	54	87%
100 word summary 0-shot Rule following	98	98	77	77	27	2	0	0	0	0	38%
200 word summary 0-shot Rule following	92	92	54	27	27	2	0	0	0	0	29%
words											70.59%
87.62%