mistralai/mistral-large-2512

Mistral Large 3

Release Date

Dec 1st, 2025

Context Size

262k

Benchmark Cost

$0.89

Speed

45.3 tok/s

Creative writing

67.38%

Rule following

76.15%

Utility

80.98%

Mathematics

100.00%

Tooling

76.08%

Language

92.28%

Logic

84.45%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
0-shot Creative writingRule following
837875727076%
0-shot Creative writingRule following
888585837783%
0-shot Creative writingRule following
948685848286%
0-shot Creative writingRule following
908483818083%
0-shot Creative writingRule following
848382807881%
0-shot Creative writingRule following
848482797581%
Detailed Writing Rules81.75%
genre
0-shot Creative writingRule following
867471696773%
0-shot Creative writingRule following
847976767478%
0-shot Creative writingRule following
868481737279%
0-shot Creative writingRule following
888382818183%
0-shot Creative writingRule following
858281807781%
0-shot Creative writingRule following
848479787079%
genre78.95%
Novelcrafter Default Prompt
0-shot Creative writingRule following
787776757176%
0-shot Creative writingRule following
878380808082%
0-shot Creative writingRule following
878685848285%
0-shot Creative writingRule following
848382797881%
0-shot Creative writingRule following
919187787584%
0-shot Creative writingRule following
858480797781%
Novelcrafter Default Prompt81.46%
80.72%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
matrix
0-shot ToolingUtilityLogicRule following
8887878686858584828185%
0-shot ToolingUtilityLogicRule following
8785858585858383838184%
0-shot ToolingUtilityLogicRule following
9288848484797979797983%
0-shot ToolingUtilityLogicRule following
9797979797929292929294%
matrix86.57%
tiers
0-shot ToolingUtilityLogicRule following
9191919191919191919191%
0-shot ToolingUtilityLogicRule following
9286868686867979797984%
0-shot ToolingUtilityLogicRule following
9789898988868181818186%
0-shot ToolingUtilityLogicRule following
10097979797939288888894%
tiers88.75%
87.66%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
dialogue-200
0-shot Creative writingRule following
5050453800000018%
0-shot Creative writingRule following
1009859504945383018749%
0-shot Creative writingRule following
49262214141000012%
dialogue-20026.71%
dialogue-500
0-shot Creative writingRule following
49412626100000015%
0-shot Creative writingRule following
4925161030000010%
0-shot Creative writingRule following
705545382726000026%
dialogue-50017.14%
Ungrouped
0-shot Creative writingRule following
100100100100100100100100100100100%
33.08%

Language Comprehension

Does the model understand more than just English?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
100.00%

Language Writing

Can the model generate text in different languages?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
1009282797686%
0-shot Language
1001001001009499%
0-shot Language
1006050505062%
0-shot Language
100100100938495%
0-shot Language
10010090866788%
86.11%

N-Length Sentences

Write sentences with exactly N words

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Rule following
9693939391918989898991%
0-shot Rule following
9090908784848266595979%
0-shot Rule following
22210000001%
57.03%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
outline-count
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
10000000000010%
outline-count70.00%
pov-count
0-shot ToolingUtility
100100100100100100100100100090%
0-shot ToolingUtility
10010010010000000040%
0-shot ToolingUtility
00000000000%
pov-count43.33%
56.67%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot ToolingUtility
100100100100100100100100100100100%
100.00%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Utility
00000000000%
0-shot Utility
100100100100100100100100100100100%
1-shot Utility
100100100100100100100100100100100%
Few-shot Utility
100100100100100100100100100100100%
0-shot Utility
100100100100100100100100100100100%
80.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
paragraphs
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
paragraphs100.00%
sentences
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
10010010010010010010010010098100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
sentences99.95%
words
0-shot Rule following
10010010010010010010098989899%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
92929292929292542270%
0-shot Rule following
100100987790000038%
0-shot Rule following
1001001009898927700067%
words74.94%
90.34%