z-ai/glm-5.1

Z.AI GLM 5.1

Release Date

Apr 7th, 2026

Context Size

200k

Reasoning

Yes

Benchmark Cost

$13.71

Speed

45.1 tok/s

Categories

20%40%60%80%100%Creative Writing84.0%Tooling100.0%Language91.6%Utility97.5%Reasoning96.8%Text Editing98.9%Rule Following88.4%Hallucination97.7%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
868483817782%
929291919191%
918885848186%
948987858488%
939187858187%
919089898889%
Detailed Writing Rules87.38%
genre
787877767477%
878383827882%
878679777380%
898582777381%
868584827682%
848281807380%
genre80.50%
Novelcrafter Default Prompt
847877747377%
938885807784%
868383817481%
919087838287%
858583797681%
908383838284%
Novelcrafter Default Prompt82.47%
83.45%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989897979697%
989797969697%
999999989899%
100100100999999%
98.15%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
10010010010010010099100%
9898979797979597%
100100100100100100100100%
9999999996969297%
Generic Prompt99.20%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
1001001001001009999100%
100100100100100100100100%
100100100100100100100100%
10010010010010010099100%
10099999999989799%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.83%
99.52%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%