mistralai/ministral-8b-2512

Ministral 3 8B

Release Date

Dec 2nd, 2025

Context Size

262k

Benchmark Cost

$0.20

Speed

144.7 tok/s

Creative writing

60.74%

Rule following

63.75%

Utility

64.75%

Mathematics

0.00%

Tooling

68.13%

Language

63.34%

Logic

60.74%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
0-shot Creative writingRule following
777770686872%
0-shot Creative writingRule following
828075756575%
0-shot Creative writingRule following
888482776880%
0-shot Creative writingRule following
807776757577%
0-shot Creative writingRule following
807972716974%
0-shot Creative writingRule following
838079747278%
Detailed Writing Rules76.03%
genre
0-shot Creative writingRule following
747269696770%
0-shot Creative writingRule following
878076767178%
0-shot Creative writingRule following
878478777680%
0-shot Creative writingRule following
868280767580%
0-shot Creative writingRule following
838079777779%
0-shot Creative writingRule following
858079787780%
genre77.79%
Novelcrafter Default Prompt
0-shot Creative writingRule following
787573736773%
0-shot Creative writingRule following
838280777078%
0-shot Creative writingRule following
878282797781%
0-shot Creative writingRule following
838179787679%
0-shot Creative writingRule following
838080787579%
0-shot Creative writingRule following
837977777478%
Novelcrafter Default Prompt78.13%
77.32%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
matrix
0-shot ToolingUtilityLogicRule following
6664625858575756524658%
0-shot ToolingUtilityLogicRule following
797474747474717171066%
0-shot ToolingUtilityLogicRule following
7271686865625957534762%
0-shot ToolingUtilityLogicRule following
8483837976767575736377%
matrix65.68%
tiers
0-shot ToolingUtilityLogicRule following
4944444443424040403943%
0-shot ToolingUtilityLogicRule following
8686868181818181767681%
0-shot ToolingUtilityLogicRule following
8178717066666256555266%
0-shot ToolingUtilityLogicRule following
848484838379676763069%
tiers64.79%
65.24%

Data extraction

Extract key details from a given block of text.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Utility
100100100100100100100100100100100%
0-shot Utility
100100100100100100100100100100100%
0-shot Utility
100100100100100100100100100100100%
0-shot UtilityLogic
5050505050505050505050%
0-shot UtilityLogic
100100100100100100100100100100100%
0-shot Utility
100100100100100100100100100100100%
0-shot UtilityMathematicsLogic
00000000000%
0-shot UtilityLogic
00000000000%
0-shot UtilityLogic
100100100100100100100100100100100%
0-shot UtilityLogic
00000000000%
0-shot UtilityLogic
100100100100100100100100100100100%
0-shot UtilityLogic
100100100100100100100100100100100%
70.83%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
dialogue-200
0-shot Creative writingRule following
504818000000012%
0-shot Creative writingRule following
5039343377610018%
0-shot Creative writingRule following
541000000006%
dialogue-20011.58%
dialogue-500
0-shot Creative writingRule following
50505000000010%
0-shot Creative writingRule following
4743412910000016%
0-shot Creative writingRule following
674943282722211713129%
dialogue-50018.39%
Ungrouped
0-shot Creative writingRule following
10061616161141400037%
18.13%

Language Comprehension

Does the model understand more than just English?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
000000%
0-shot Language
000000%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
50.00%

Language Writing

Can the model generate text in different languages?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
10010010093079%
0-shot Language
100100630053%
0-shot Language
10010000040%
0-shot Language
100100100100100100%
0-shot Language
1001001001009499%
74.01%

N-Length Sentences

Write sentences with exactly N words

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Rule following
9693908555363497751%
0-shot Rule following
9287878578757263504273%
0-shot Rule following
103000000001%
41.91%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
outline-count
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
10000000000010%
outline-count70.00%
pov-count
0-shot ToolingUtility
1005050505050505050050%
0-shot ToolingUtility
100100100100100100000060%
0-shot ToolingUtility
1001001001001001001001000080%
pov-count63.33%
66.67%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot ToolingUtility
100100100100100100100100100100100%
100.00%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Utility
00000000000%
0-shot Utility
00000000000%
1-shot Utility
100100100100100100100100100100100%
Few-shot Utility
100100100100100100100100100100100%
0-shot Utility
00000000000%
40.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
paragraphs
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
paragraphs100.00%
sentences
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
10098989898989892542786%
0-shot Rule following
100100100100927777779974%
0-shot Rule following
10000000000010%
sentences74.12%
words
0-shot Rule following
100100989898989898989298%
0-shot Rule following
1001001009892929292927794%
0-shot Rule following
98779920000020%
0-shot Rule following
100987754279900037%
0-shot Rule following
9898929200000038%
words57.39%
73.65%