mistralai/mistral-small-3.2-24b-instruct

Mistral Small 3.2 24B

Release Date

Jun 20th, 2025

Context Size

131k

Benchmark Cost

$0.74

Speed

64.6 tok/s

Creative writing

59.81%

Rule following

67.63%

Utility

71.19%

Mathematics

50.00%

Tooling

61.20%

Language

73.50%

Logic

75.19%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
0-shot Creative writingRule following
676664625663%
0-shot Creative writingRule following
716967676167%
0-shot Creative writingRule following
787574736272%
0-shot Creative writingRule following
777774736874%
0-shot Creative writingRule following
816868676369%
0-shot Creative writingRule following
726867676067%
Detailed Writing Rules68.78%
genre
0-shot Creative writingRule following
726866656066%
0-shot Creative writingRule following
797372705770%
0-shot Creative writingRule following
807979787278%
0-shot Creative writingRule following
777169686069%
0-shot Creative writingRule following
787676685971%
0-shot Creative writingRule following
717169686669%
genre70.60%
Novelcrafter Default Prompt
0-shot Creative writingRule following
696761616164%
0-shot Creative writingRule following
847267656470%
0-shot Creative writingRule following
777674726573%
0-shot Creative writingRule following
837272726873%
0-shot Creative writingRule following
797170676470%
0-shot Creative writingRule following
807978747176%
Novelcrafter Default Prompt71.15%
70.18%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
matrix
0-shot ToolingUtilityLogicRule following
7066656562585754494059%
0-shot ToolingUtilityLogicRule following
8581808078787674747478%
0-shot ToolingUtilityLogicRule following
8888888483808079757582%
0-shot ToolingUtilityLogicRule following
9797979388888888888891%
matrix77.48%
tiers
0-shot ToolingUtilityLogicRule following
7669696767676060605064%
0-shot ToolingUtilityLogicRule following
10094949286868674726685%
0-shot ToolingUtilityLogicRule following
8281797777747373736976%
0-shot ToolingUtilityLogicRule following
8075757571666363634468%
tiers73.27%
75.37%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
dialogue-200
0-shot Creative writingRule following
50434126187100019%
0-shot Creative writingRule following
505050452626000025%
0-shot Creative writingRule following
975554494848444130047%
dialogue-20029.98%
dialogue-500
0-shot Creative writingRule following
3426100000006%
0-shot Creative writingRule following
433730520000012%
0-shot Creative writingRule following
92463835282100024%
dialogue-50014.07%
Ungrouped
0-shot Creative writingRule following
100100100100100100100100100100100%
33.17%

Language Comprehension

Does the model understand more than just English?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
000000%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
75.00%

Language Writing

Can the model generate text in different languages?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
1009400039%
0-shot Language
100100100100080%
0-shot Language
59555050043%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
72.30%

N-Length Sentences

Write sentences with exactly N words

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Rule following
9593919087858380776385%
0-shot Rule following
8079777064595752514864%
0-shot Rule following
5147271975100016%
54.71%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
outline-count
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
00000000000%
outline-count66.67%
pov-count
0-shot ToolingUtility
500000000005%
0-shot ToolingUtility
00000000000%
0-shot ToolingUtility
10000000000010%
pov-count5.00%
35.83%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot ToolingUtility
100100100100100100100100100100100%
100.00%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Utility
00000000000%
0-shot Utility
100100100100100100000060%
1-shot Utility
100100100100100100100100100100100%
Few-shot Utility
100100100100100100100100100100100%
0-shot Utility
100100100100100100100100100100100%
72.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
paragraphs
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
paragraphs100.00%
sentences
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
10010010010010010010010010098100%
sentences99.97%
words
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
10098989898989892929297%
0-shot Rule following
10010010098929922051%
0-shot Rule following
279000000004%
0-shot Rule following
00000000000%
words50.30%
80.87%