minimax/minimax-m2.5

MiniMax M2.5

Release Date

Feb 12th, 2026

Context Size

196k

Reasoning

Yes

Benchmark Cost

$2.55

Speed

60.3 tok/s

Categories

20%40%60%80%100%Creative Writing81.2%Tooling97.9%Language96.1%Utility90.4%Reasoning92.4%Text Editing96.0%Rule Following62.7%Hallucination92.9%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
797574726874%
858479777780%
898787838185%
888684838285%
898786858085%
898887868286%
Detailed Writing Rules82.52%
genre
847573717175%
848078777278%
848180787580%
878383817582%
817776747376%
848382807781%
genre78.68%
Novelcrafter Default Prompt
867976766676%
888584797983%
858382767580%
808080777679%
878381807681%
918886858186%
Novelcrafter Default Prompt80.96%
80.72%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
959492929193%
969494949394%
989796959496%
999796968995%
94.55%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
1001001001001001008698%
10010010010010010098100%
9999969696959597%
1001001009999996795%
100100100100100100100100%
10099999999978597%
9796969594929194%
10010010010098989899%
10099969695958896%
Generic Prompt97.33%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999999989498%
100100100100999998100%
100100100100100100100100%
9999999999999999%
9696959594949295%
100100100100100100100100%
10099999999999999%
Specific Prompt99.00%
98.16%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001001006797%
96.67%