NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQRJ9X968FPQCX9QWXP2S8
Run Details
Model:
Z.AI GLM 5
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
40.1s
Cost:
$0.005544
Provider:
SiliconFlow
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The emergency lights bathed the abandoned Tube station in harsh red, turning the dust motes into swirling blood. Detective Harlow Quinn ducked beneath the yellow tape, her worn leather
watch
catching on the edge as she stepped into what the uniforms had already designated Crime Scene Seven. Seven. In one night. "You're late, Quinn." DCI Thorne stood by the platform edge, his face grim in the emergency lighting. He looked like he hadn't slept in days—probably because he hadn't. Neither had she. "Traffic." She moved past him, her sharp jaw set as she surveyed the scene. "What have we got?" "Male, mid-thirties. No ID, no wallet, no phone. Found by a transit maintenance crew doing routine structural checks." Thorne gestured toward the platform. "No visible wounds. No signs of struggle. He's just... sitting there." Quinn approached the body. The man sat slumped against the tiled wall, legs str
etched
across the grimy platform floor. He could have been waiting for a train that would never come. His clothes were ordinary—dark jacket, jeans, trainers unmarked by the filth that covered every surface down here. That was wrong already. No one sat on these platforms without collecting twenty years of grime. She crouched beside him, noting the absence of the expected decay in the air. The station had
been closed
since the nineties. The smell should have been
oppressive
—mold, standing water, rat droppings, the particular staleness of air trapped underground for decades. Instead, the air tasted faintly of copper and something else. Something that prickled at the back of her throat. "Homeless?" she asked, though she already knew the answer. "Not with those clothes. Not with those hands." Thorne had moved to stand behind her. "Clean nails, no calluses. Soft palms. This man worked at a desk." "So what's a clean-handed office worker doing in a sealed Tube station with no visible entry wounds?" "That's the question." Thorne's voice carried an edge of exhaustion. "Patrol checked all the entrances. Sealed tight. Padlocks are rusted shut, have been for years. Graffiti's undisturbed. No footprints in the dust except the maintenance crew who found him." Quinn leaned closer to the body. The man's face was peaceful, almost serene. His eyes were closed, lips
slightly
parted. She'd seen enough corpses to know death rarely looked this comfortable. Even the gentle ones usually carried some final tension in their features—the body's last protest against the inevitable. This man had simply stopped. "Has the ME seen him?" "On her way. But Quinn—" Thorne hesitated, and she heard him shift his
weight
behind her. "The maintenance crew said something odd. Initial statement, before they lawyered up." She turned her head
slightly
, not quite looking at him. "What?" "Claimed they heard music. Faint, like it was coming from deeper in the tunnels. But when they checked, the tunnels dead-ended. Collapsed in the seventies, never repaired." "Music." She made the word flat, unimpressed. Through habit, she found herself cataloguing details: the man's jacket collar was damp, though the platform was dry. A fine residue clung to his fingertips,
glinting
faintly when she angled her torch across his hands. Not dust. Something
crystalline
. "There's more." Thorne
was trying
to keep his voice steady, but she caught the tremor beneath it. "The crew leader had a... episode. Started ranting about gates and doorways. Saying things that didn't make sense. They're holding him for psychiatric evaluation." Quinn stood, her joints protesting after years of too many crime scenes and too few rest days. She was forty-one and some mornings she felt twice that. She moved to the platform edge and looked down at the tracks. The rusted rails disappeared into darkness on both sides. No third rail—this line had been older, one of the first, decommissioned before electrification reached every tunnel. "Show me where they entered." Thorne led her through a service door, down a narrow corridor, to a ladder ascending toward a street-level grate. The padlock was indeed rusted, undisturbed. The maintenance crew had used bolt cutters on a different entrance entirely—another service access two hundred meters down the line, in an active station. "Nobody came through here," Thorne said. "I know how it looks, but physically, it's impossible. No way in, no way out. Even if someone killed him elsewhere and dumped the body, they'd have to carry him through
tunnels that collapse
if you look at them wrong." Quinn ran her fingers along the corridor wall. The tile was original, Victorian, painted over countless times before the station closed. The graffiti layered on top was decades old, tags and political slogans faded to illegibility. But here, near the floor, something caught her eye. She crouched again, angling her torch. A thin line of verdigris, like aged copper, traced along the grouting between tiles. It was fresh—the oxidation hadn't settled, hadn't
been disturbed
by time or touch. The line curved and doubled back, forming a shape she almost recognized. "What
is that?" Thorne
had followed her gaze. "Don't know yet." She pulled an evidence bag from her pocket and scraped a sample of the residue from the dead man's fingertips. The crystals caught the light, refracting it strangely. In the bag, they
seemed to
shift, though she hadn't moved. "What do we know about this station's history?" "Standard closure. Structural concerns. Low ridership." Thorne shrugged. "Same story as a dozen others." But Quinn was already moving, following the verdigris line along the wall. It wound around a corner, deeper into the station, past collapsed ceiling panels and pools of stagnant water that reflected the red emergency lights. She was conscious of Thorne following, his breathing heavy in the enclosed space. The line terminated at a service door. Unlike the others, this one showed no rust. The hinges were newly oiled. The handle turned smoothly under her gloved hand. "Quinn—" Thorne's voice
was warning
now. "This isn't cleared. We don't know what's behind there." "That's why I'm opening it." The door swung inward on soundless hinges, and Quinn's torch beam cut through darkness that
seemed to
resist the light. The space beyond wasn't a maintenance closet or a utility passage. It was larger, much larger, and the air that flowed out was cold and smelled of something she couldn't name but recognized somewhere deeper than memory. Her hand went to her wrist, brushing the worn leather of her
watch
. Three years ago, she'd stood in another impossible space, another scene that didn't make sense. Morris had been beside her then. He'd walked into that room and hadn't walked out, and the official report said structural collapse, and she'd stopped arguing because arguing got her nothing but concerned looks and mandatory psych evaluations. "Close the door," she said quietly. "Quinn—" "Close it. Don't look. You didn't see this." She turned to face him, and whatever was in her expression made Thorne step back. "The victim wandered in here. Got lost. Probably overdosed on something synthetic. We'll find drugs in his system, and this door will be explained away as an old maintenance entrance the records lost. That's what happened. That's what you'll write in your report." "That's not—" He stopped. Swallowed. "That's not what happened." "No," she agreed. "But it's what happened on paper. And you don't want the alternative in your file, Thorne. Trust me." She waited until he retreated, until the door clicked shut behind him. Then she turned back to the darkness and raised her torch. The beam fell on something half-buried in debris—a small brass compass, its casing
etched
with markings that hurt to look at directly, verdigris blooming across its surface like disease. The needle spun wildly, then stopped, pointing not north but inward, toward the heart of the impossible space. Quinn pocketed the compass without touching it directly. Her heart
was hammering
, but her hands were steady. Eighteen years on the force, and she'd learned to trust the evidence even when it led somewhere that didn't exist. Somewhere in the tunnels, faint as a half-remembered dream, music
began to
play.
created_at
"2026-02-17T02:45:19.634452+00:00"
2/17/2026, 2:45:19 AM
model
"z-ai/glm-5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1010
moderation_latency
(null)
generation_time
40057
tokens_prompt
923
tokens_completion
2490
native_tokens_prompt
842
native_tokens_completion
2075
native_tokens_completion_images
(null)
native_tokens_reasoning
296
native_tokens_cached
128
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00554385
router
(null)
provider_responses
0
id
"019c697c4a2422c5b76394a2bd7b18a4"
status
200
is_byok
false
latency
1010
endpoint_id
"c173ea30-3dc0-4957-850e-f2df8c031f10"
provider_name
"SiliconFlow"
model_permaslug
"z-ai/glm-5-20260211"
api_type
"completions"
id
"gen-1771296278-8YwAJjAo0RHFBBkeMoyG"
upstream_id
"019c697c4a2422c5b76394a2bd7b18a4"
total_cost
0.00554385
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
Evaluation details
Result
Evaluator
Details
Meta Data
50.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
4
adverbTagCount
1
adverbTags
0
"she said quietly [quietly]"
85.1522%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1347
totalAiIsmAdverbs
2
found
0
adverb
"slightly"
count
2
highlights
0
"slightly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
62.8805%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1347
totalAiIsms
5
found
0
word
"crystalline"
count
1
1
word
"glinting"
count
1
2
word
"oppressive"
count
1
3
word
"weight"
count
1
4
word
"etched"
count
1
highlights
0
"crystalline"
1
"glinting"
2
"oppressive"
3
"weight"
4
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
87
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
3
narrationSentences
87
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
1
"began to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
2
totalSentences
107
matches
0
"tunnels that collapse"
1
"is that?\" Thorne"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
42
mean
31.71
std
21.04
cv
0.663
sampleLengths
0
47
1
4
2
31
3
18
4
34
5
64
6
60
7
9
8
27
9
17
10
39
11
49
12
5
13
5
14
28
15
11
16
27
17
46
18
41
19
65
20
5
21
49
22
46
23
45
24
45
25
8
26
50
27
14
28
49
29
28
30
15
31
5
32
57
33
66
34
6
35
1
36
66
37
9
38
21
39
70
40
37
41
13
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
87
matches
0
"been closed"
1
"been disturbed"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
3
totalVerbs
182
matches
0
"was trying"
1
"was warning"
2
"was hammering"
98.7626%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
990
adjectiveStacks
0
stackExamples
(empty)
adverbCount
41
adverbRatio
0.04141414141414142
lyAdverbCount
16
lyAdverbRatio
0.01616161616161616
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
107
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
107
mean
12.45
std
8.06
cv
0.648
sampleLengths
0
18
1
29
2
1
3
3
4
17
5
11
6
3
7
14
8
4
9
23
10
11
11
4
12
15
13
12
14
17
15
4
16
12
17
14
18
8
19
19
20
10
21
9
22
9
23
15
24
12
25
17
26
10
27
29
28
6
29
7
30
7
31
11
32
18
33
5
34
5
35
16
36
12
37
10
38
1
39
27
40
7
41
18
42
17
43
2
44
2
45
17
46
24
47
17
48
10
49
12
69.7819%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
7
diversityRatio
0.45794392523364486
totalSentences
107
uniqueOpeners
49
50.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
4
fancyCount
2
fancyTags
0
"she said quietly (say quietly)"
1
"she agreed (agree)"
89.2104%