moonshotai/kimi-k2.5

MoonshotAI: Kimi K2.5

Release Date

Jan 27th, 2026

Context Size

262k

Reasoning

Yes

Benchmark Cost

$13.83

Speed

54.7 tok/s

Categories

20%40%60%80%100%Creative Writing81.3%Tooling100.0%Language97.1%Utility96.6%Reasoning95.4%Text Editing97.8%Rule Following72.0%Hallucination88.0%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
837878777278%
868686827984%
878785848085%
888685838285%
908988838186%
919086858187%
Detailed Writing Rules83.94%
genre
787875746875%
797979787478%
858482817481%
867776706875%
827878706975%
858380767580%
genre77.38%
Novelcrafter Default Prompt
797976757577%
838180797980%
868383807982%
868583807682%
858181818182%
848383837982%
Novelcrafter Default Prompt80.85%
80.72%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989796959596%
979797969196%
989797969596%
989898969597%
96.32%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999799%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9796969595959395%
100100100100100100100100%
9997969696898995%
Generic Prompt98.71%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
1001001001001009999100%
100100100100100100100100%
100100100100100100100100%
10010010010010010099100%
9998989897968095%
100100100100100100100100%
100100100100100979699%
Specific Prompt99.34%
99.02%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%