Minimax M2.5 - NC Bench

minimax/minimax-m2.5

Minimax M2.5

via OpenRouter

Release Date

Feb 12th, 2026

Context Size

196k

Benchmark Cost

$2.02

Speed

57.9 tok/s

Creative writing

78.68%

Rule following

81.75%

Utility

85.21%

Mathematics

100.00%

Tooling

91.77%

Language

95.44%

Logic

89.06%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario	#1	#2	#3	#4	#5	Total
Detailed Writing Rules
Fantasy: entering an ancient ruin 0-shot Creative writingRule following	79	75	74	72	68	74%
Horror: alone in an eerie place at night 0-shot Creative writingRule following	87	85	79	78	77	81%
Literary fiction: old friends reunite 0-shot Creative writingRule following	89	87	87	83	81	85%
Mystery: examining a crime scene 0-shot Creative writingRule following	88	86	84	83	82	85%
Romance: separated couple reunites 0-shot Creative writingRule following	89	87	86	86	80	85%
Thriller: chase through city streets 0-shot Creative writingRule following	89	88	87	86	82	86%
Detailed Writing Rules						82.73%
genre
Fantasy: entering an ancient ruin 0-shot Creative writingRule following	85	75	74	73	71	76%
Horror: alone in an eerie place at night 0-shot Creative writingRule following	84	81	80	78	72	79%
Literary fiction: old friends reunite 0-shot Creative writingRule following	84	81	80	79	76	80%
Mystery: examining a crime scene 0-shot Creative writingRule following	89	83	83	81	75	82%
Romance: separated couple reunites 0-shot Creative writingRule following	82	78	76	75	74	77%
Thriller: chase through city streets 0-shot Creative writingRule following	84	83	82	80	79	81%
genre						79.17%
Novelcrafter Default Prompt
Fantasy: entering an ancient ruin 0-shot Creative writingRule following	86	81	76	76	66	77%
Horror: alone in an eerie place at night 0-shot Creative writingRule following	88	85	84	79	79	83%
Literary fiction: old friends reunite 0-shot Creative writingRule following	85	83	82	76	75	80%
Mystery: examining a crime scene 0-shot Creative writingRule following	80	80	80	77	76	79%
Romance: separated couple reunites 0-shot Creative writingRule following	87	83	81	81	78	82%
Thriller: chase through city streets 0-shot Creative writingRule following	91	88	86	85	81	86%
Novelcrafter Default Prompt						81.10%
81.00%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
matrix
Large codex (40 entries), long passage (1,019 words) 0-shot ToolingUtilityLogicRule following	90	89	87	83	83	81	79	77	76	75	82%
Large codex (40 entries), short passage (165 words) 0-shot ToolingUtilityLogicRule following	96	96	93	92	92	91	90	88	85	84	91%
Small codex (7 entries), long passage (734 words) 0-shot ToolingUtilityLogicRule following	93	93	90	90	88	88	87	87	84	84	88%
Small codex (7 entries), short passage (165 words) 0-shot ToolingUtilityLogicRule following	100	100	97	97	97	92	90	88	88	88	94%
matrix											88.69%
tiers
5 codex entries 0-shot ToolingUtilityLogicRule following	100	100	93	91	83	83	83	75	75	75	86%
10 codex entries 0-shot ToolingUtilityLogicRule following	100	100	100	100	100	100	100	94	86	86	97%
20 codex entries 0-shot ToolingUtilityLogicRule following	94	94	91	89	88	88	86	86	82	81	88%
40 codex entries 0-shot ToolingUtilityLogicRule following	97	97	93	93	93	90	90	84	80	79	90%
tiers											90.04%
89.36%

Data extraction

Extract key details from a given block of text.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
All valid emails 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Contextual pronoun 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Fruits excluding citrus 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Future event time 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
Guess the pet 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
Highest-rated movie 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
Indirect birth year 0-shot UtilityMathematicsLogic	100	100	100	100	100	100	100	100	100	100	100%
What instrument does Lucy play? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
What's the color of the car? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
What's the correct time? 0-shot UtilityLogic	100	0	0	0	0	0	0	0	0	0	10%
Who's the sister? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
Who's the tallest? 0-shot UtilityLogic	100	100	100	100	100	100	100	100	100	100	100%
92.50%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
dialogue-200
Write 200 words with 10% dialogue 0-shot Creative writingRule following	100	100	100	100	76	68	68	56	50	45	76%
Write 200 words with 50% dialogue 0-shot Creative writingRule following	100	100	100	100	100	100	100	100	100	76	98%
Write 200 words with 90% dialogue 0-shot Creative writingRule following	100	100	64	59	50	50	50	50	50	50	62%
dialogue-200											78.70%
dialogue-500
Write 500 words with 30% dialogue 0-shot Creative writingRule following	100	99	99	97	96	95	52	50	46	0	73%
Write 500 words with 50% dialogue 0-shot Creative writingRule following	100	100	94	75	63	52	50	50	50	0	63%
Write 500 words with 70% dialogue 0-shot Creative writingRule following	98	90	77	58	50	50	50	33	32	1	54%
dialogue-500											63.58%
Ungrouped
Write unattributed dialogue 0-shot Creative writingRule following	100	100	100	100	100	100	100	61	61	1	82%
72.72%

Language Comprehension

Does the model understand more than just English?

Scenario	#1	#2	#3	#4	#5	Total
Asking for directions (Dutch) 0-shot Language	100	100	100	100	0	80%
Asking for directions (German) 0-shot Language	100	100	100	100	100	100%
Friend got new kittens (German) 0-shot Language	100	100	100	100	100	100%
Friend got new kittens (Tagalog) 0-shot Language	100	100	100	100	100	100%
95.00%

Language Writing

Can the model generate text in different languages?

Scenario	#1	#2	#3	#4	#5	Total
Character dialogue (French) in a story 0-shot Language	100	100	100	100	100	100%
Character dialogue (German) in a story 0-shot Language	100	100	100	93	67	92%
Character dialogue (Hindi) in a story 0-shot Language	100	100	100	93	53	89%
Character dialogue (Italian) in a story 0-shot Language	100	100	100	100	100	100%
Character dialogue (Spanish) in a story 0-shot Language	100	100	100	96	93	98%
95.80%

N-Length Sentences

Write sentences with exactly N words

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
Write sentences with 5 words each 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
Write sentences with 10 words each 0-shot Rule following	100	100	100	100	100	100	98	97	95	88	98%
Write sentences with 20 words each 0-shot Rule following	100	100	100	100	100	100	100	100	100	75	98%
98.41%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
outline-count
Count acts 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	100	100%
Count chapters 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	100	100%
Count scenes 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	0	90%
outline-count											96.67%
pov-count
Count point of views for Jack and Olivia 0-shot ToolingUtility	100	100	100	100	100	100	100	50	50	50	85%
Count point of views for Jack Harper 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	0	90%
Count point of views for Olivia 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	100	100%
pov-count											91.67%
94.17%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
Create alternate prose sections 0-shot ToolingUtility	100	100	100	100	100	100	100	100	100	67	97%
96.67%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
Multiple speakers 0-shot Utility	100	100	100	0	0	0	0	0	0	0	30%
Simple 0-shot Utility	100	100	100	100	100	0	0	0	0	0	50%
Simple (1-shot) 1-shot Utility	100	100	100	100	100	0	0	0	0	0	50%
Simple (5-shot) Few-shot Utility	100	0	0	0	0	0	0	0	0	0	10%
Unattributed dialogue 0-shot Utility	100	100	100	100	100	100	100	100	100	100	100%
48.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario	#1	#2	#3	#4	#5	#6	#7	#8	#9	#10	Total
paragraphs
1 paragraph summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
3 paragraph summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
5 paragraph summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
paragraphs											100.00%
sentences
1 sentence summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
3 sentence summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	100	100%
10 sentence summary 0-shot Rule following	100	100	100	100	100	100	100	98	98	92	99%
20 sentence summary 0-shot Rule following	100	100	100	100	98	92	0	0	0	0	59%
50 sentence summary 0-shot Rule following	100	100	0	0	0	0	0	0	0	0	20%
sentences											75.58%
words
10 word summary 0-shot Rule following	100	100	100	100	100	100	100	100	100	98	100%
20 word summary 0-shot Rule following	100	100	100	100	100	100	100	100	98	92	99%
50 word summary 0-shot Rule following	100	100	100	98	98	92	54	54	54	0	75%
100 word summary 0-shot Rule following	100	100	98	98	92	77	2	0	0	0	57%
200 word summary 0-shot Rule following	100	92	9	0	0	0	0	0	0	0	20%
words											70.15%
79.13%