z-ai/glm-4.5

Z.AI GLM 4.5

Release Date

Jul 25th, 2025

Context Size

131k

Reasoning

No

Benchmark Cost

$2.03

Speed

37.4 tok/s

Categories

20%40%60%80%100%Creative Writing76.6%Tooling99.9%Language97.3%Utility79.2%Reasoning91.0%Text Editing95.3%Rule Following63.8%Hallucination87.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
747372726471%
888581786780%
858379797781%
848479797881%
838178787278%
888684847684%
Detailed Writing Rules79.03%
genre
726969686869%
797270706972%
797875747175%
797373726673%
787471706772%
837978787278%
genre73.24%
Novelcrafter Default Prompt
727169696770%
817878767578%
908180807781%
848280727078%
848079787780%
847877757478%
Novelcrafter Default Prompt77.24%
76.50%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979696969496%
989696959496%
969595949395%
999999999899%
96.09%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9696969390909093%
10067676767676771%
100100100100100100100100%
100100100100100100100100%
9595959594939294%
10010010010010010099100%
9793939389898992%
Generic Prompt94.47%
Specific Prompt
1001001001001001008398%
100100100100100100100100%
10010010010010010099100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9998989696949397%
100100100100100100100100%
10010010010099999399%
Specific Prompt99.21%
96.84%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%