z-ai/glm-4.5-air

Z.AI GLM 4.5 Air

Release Date

Jul 25th, 2025

Context Size

131k

Reasoning

No

Benchmark Cost

$1.36

Speed

52.3 tok/s

Categories

20%40%60%80%100%Creative Writing74.6%Tooling99.6%Language95.0%Utility76.6%Reasoning87.9%Text Editing94.4%Rule Following44.1%Hallucination92.7%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
767069696870%
827978777778%
898676767680%
848381797781%
878281797982%
868580787581%
Detailed Writing Rules78.73%
genre
717069676669%
746867636267%
817875747376%
787875736774%
767570686771%
757370686670%
genre71.25%
Novelcrafter Default Prompt
777668686771%
807571717073%
838180756978%
817974726774%
807573717174%
818075737276%
Novelcrafter Default Prompt74.47%
74.82%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
959492929193%
959393918892%
949493888691%
949291908089%
91.23%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999959595959597%
100100100100100100100100%
100100100100100100100100%
10010010010099999599%
9796959392807790%
100100100100100828295%
10097979797979397%
Generic Prompt97.46%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9998989898979598%
100100100100100100100100%
100100100100100100100100%
10099999594929296%
9896969694939095%
10010010010010010098100%
100100100100100979799%
Specific Prompt98.54%
98.00%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%