z-ai/glm-4.6

Z.AI GLM 4.6

Release Date

Sep 30th, 2025

Context Size

200k

Reasoning

Yes

Benchmark Cost

$4.92

Speed

48.9 tok/s

Categories

20%40%60%80%100%Creative Writing78.9%Tooling100.0%Language96.6%Utility88.6%Reasoning95.1%Text Editing97.8%Rule Following65.9%Hallucination90.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
777472727173%
838181797379%
868381807882%
828180797980%
828078767578%
898677777581%
Detailed Writing Rules78.93%
genre
787370686571%
847571707074%
757572727073%
787772727074%
787775736674%
828175747377%
genre73.68%
Novelcrafter Default Prompt
847874737076%
848280797780%
908682807883%
858381797981%
878379787681%
939081807784%
Novelcrafter Default Prompt80.87%
77.83%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979696959596%
999898969297%
989796969596%
999997969397%
96.42%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999969696959396%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9896959594949395%
100100100100100100100100%
100100999289868693%
Generic Prompt98.29%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
10010010010010010099100%
10010010010010010099100%
100100100100100100100100%
1001001001001001009599%
9898989796969597%
100100100100100100100100%
1001001001001009999100%
Specific Prompt99.53%
98.91%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%