NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMR4M7EDEDWXWP8NNA6W8EB
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
1m 41s
Cost:
$0.015353
Provider:
Alibaba
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The damp cold of the abandoned Tube station seeped through the sole of Detective Harlow Quinn's boots, a persistent chill that had nothing to do with the London weather above. She checked her
watch
, the worn leather strap stiff against her wrist, the face glowing faintly in the beam of her torch. 03:14. Enough time for the shift change to bury the paperwork, not enough time to make sense of the corpse lying ten feet away on the tracks. Quinn holstered her light and stepped forward, her movement precise, economical. At five-nine, she towered over the scene techs who were currently packing up their kits with uneasy glances toward the tunnel shadows. She crouched, the knees of her trousers grazing the ballast. The body was a male, mid-thirties, dressed in a suit that cost more than Quinn's annual salary. There was no blood. No bruising. No sign of struggle. He looked like he
was sleeping
, save for the pallor of his skin and the stillness of his chest. Cause of death remains unknown, the preliminary report had said. Quinn hated those words. They were a placeholder for failure. You're looking at it wrong, Detective. The voice came from behind her, slight and tremulous. Quinn didn't startle. She turned her head
slowly
, her sharp jawline cutting a hard profile against the tunnel wall. Eva Kowalski stood near the edge of the platform, clutching a worn leather satchel to her chest like a shield. The young woman's curly red hair
was frizzing
in the humidity, and she pushed her round glasses up the bridge of her nose with a shaking hand. Quinn stood, brushing dust from her hands. I'm looking at a dead man in a closed station, Kowalski. That's the only way that matters. Eva tucked a stray curl behind her left ear, a nervous tic Quinn had noted in their previous meetings. The British Museum researcher wasn't police, but she had clearance Quinn didn't fully understand, granted by superiors who spoke in hushed tones about restricted archives. It's not a closed station, Eva said, her green eyes wide behind the lenses. Not
really
. Not tonight. Quinn crossed her arms. Explain. Eva stepped closer, though she kept her distance from the body. The air pressure dropped. Can't you
feel
it? The static. It's not electrical. It's residual energy. This location... it's a bleed point. Quinn scanned the tiled walls. White ceramic, cracked and grimy, covered in decades of graffiti. Nothing suggested a bleed point whatever that meant. It suggested neglect. It suggested a place where things went to die. Three years ago, DS Morris had vanished near a similar location. The file was cold, sealed, and buried under layers of bureaucratic concrete. Quinn hadn't forgotten. Energy doesn't kill men in suits, Quinn said flatly. Bullets do. Knives do. Poison does. Sometimes, Eva insisted, her voice gaining a fraction of strength. Sometimes the door opens where it shouldn't. She unbuckled her satchel and rummaged inside, pulling out a small brass object. Quinn's
eyes narrowed
. It was a compass, but unlike any navigation tool she'd seen in eighteen years of service. The casing was thick with patina, a deep verdigris eating at the edges of the metal. The face
was etched
with sigils that hurt the eyes if she focused on them too long. Where did you get that? Quinn asked, her hand drifting instinctively toward her belt. Evidence, Eva said
quickly
. Seized from a suspect last week. It's attuned. Watch. Eva set the compass on a flat concrete slab near the tracks. The needle didn't settle north. It spun, lazy at first, then accelerating into a blur. It
wasn't searching
for
magnetic
pole. It
was hunting
. Quinn
watch
ed the needle. She
watch
ed the dust motes
dancing
in the beam of the overhead work light. She
watch
ed Eva's hands, trembling
slightly
as she tucked her hair behind her ear again. It points to rifts, Eva whispered. Portals. Quinn leaned in. The needle
was vibrating
now, humming against the concrete. It pointed directly at the wall behind the corpse. Quinn moved around the body, positioning herself between the dead man and the tiles. She pulled her torch again, sweeping the beam across the ceramic. Nothing. Just grout and grime. If there was a door, Quinn said, there would be a draft. A change in temperature. Disturbance in the dust. She shone the light at the base of the wall. The dust layer was thick, undisturbed gray fuzz accumulated over years of abandonment. Except for one spot. Directly behind the victim's heels, the dust
was swept
clean in a perfect arc. Quinn knelt again, pulling a magnifying loupe from her pocket. She inspected the tiles. There were microscopic scratches on the glaze, faint lines radiating from a central point hidden behind the man's shoulder. He didn't walk here, Quinn murmured. No, Eva said. He fell out. Quinn ignored the implication. She focused on the physical. The dust arc. The scratches. The position of the body. He landed on his back. His shoes were clean, but the soles showed scuff marks consistent with friction burns, not walking. Quinn stood and walked to the wall. She pressed her palm against the tiles. Cold. Solid. But the compass needle spiked, swinging wildly toward her hand before settling back on the wall. You said this market moves, Quinn said, recalling the intel she'd scraped together from informants who smelled of sulfur and fear. The Veil Market. Eva nodded, clutching her satchel tighter. It moves every full moon. Tonight is the full moon. This station... it's the anchor. Quinn looked at the dead man's hands. His fingers
were curled
inward, gripping nothing. She pried the left hand open. Embedded in the palm, pressed deep into the flesh as if it had
been forced
there by immense pressure, was a sliver of white material. Bone. Quinn pulled her multi-tool and
carefully
extracted the fragment. It was small, carved with a rune that matched the
etching
on the compass face. Entry requirement, Quinn said, her voice low. A bone token. He
tried to
leave, Eva said
softly
. But the market closes. If you're still inside when the shift happens... You get left behind, Quinn finished. Or you get spit out. She looked from the bone fragment to the compass, then to Eva. The young researcher
was watching
her with a mixture of fear and hope. Eva knew the rules of this world. Quinn only knew the rules of evidence. But here, the evidence
was rewriting
the rulebook. The dust wasn't disturbed by wind. It
was displaced
by displacement. Matter moving where matter shouldn't be. The friction burns on the shoes confirmed high-velocity arrival. The bone token confirmed intent. This wasn't a murder scene in the traditional sense. It was an accident. A border control incident. Quinn dropped the bone fragment into an evidence bag and sealed it. She pocketed the compass, ignoring Eva's sharp intake of breath. I'm confiscating this, Quinn said. But you need it to find the others, Eva said, stepping forward. If the market moved, there might be more left behind. More people who didn't make the exit. Quinn looked at the compass in her hand. The verdigris felt warm against her skin. She thought of Morris. Three years. No body. No evidence. Just a gap in the world where a partner used to be. If there are others, Quinn said, meeting Eva's green eyes, I'll find them. But I do it my way. No sigils. No rituals. We follow the dust. Eva hesitated, then nodded, tucking her hair behind her ear one last time. Okay. Quinn turned back to the tunnel, the compass heavy in her pocket. The needle pressed against her thigh, vibrating like a second heartbeat. The evidence didn't add up to a standard homicide, but it added up to something. A pattern. A location. A method. She clicked off her torch, plunging the platform into darkness save for the work lights. The shadows
seemed to
stretch longer than they should, reaching for the tracks. Quinn didn't flinch. She checked her
watch
again. 03:22. Time to go to work.
created_at
"2026-02-17T02:52:55.687078+00:00"
2/17/2026, 2:52:55 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
695
moderation_latency
(null)
generation_time
101201
tokens_prompt
923
tokens_completion
4473
native_tokens_prompt
905
native_tokens_completion
4114
native_tokens_completion_images
(null)
native_tokens_reasoning
2293
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0153534
router
(null)
provider_responses
0
id
"chatcmpl-124746c2-baf0-98d2-bf02-ed547760feb5"
status
200
is_byok
false
latency
695
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771296674-ZACom9u4qCrftoy6parx"
upstream_id
"chatcmpl-124746c2-baf0-98d2-bf02-ed547760feb5"
total_cost
0.0153534
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
0
adverbTagCount
0
adverbTags
(empty)
55.7522%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1356
totalAiIsmAdverbs
6
found
0
adverb
"carefully"
count
1
1
adverb
"quickly"
count
1
2
adverb
"slightly"
count
1
3
adverb
"slowly"
count
1
4
adverb
"softly"
count
1
5
adverb
"really"
count
1
highlights
0
"carefully"
1
"quickly"
2
"slightly"
3
"slowly"
4
"softly"
5
"really"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
70.5015%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1356
totalAiIsms
4
found
0
word
"magnetic"
count
1
1
word
"dancing"
count
1
2
word
"etched"
count
1
3
word
"etching"
count
1
highlights
0
"magnetic"
1
"dancing"
2
"etched"
3
"etching"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"eyes widened/narrowed"
count
1
highlights
0
"eyes narrowed"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
172
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
3
hedgeCount
2
narrationSentences
172
filterMatches
0
"watch"
1
"feel"
hedgeMatches
0
"tried to"
1
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
172
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
47
mean
28.79
std
20.72
cv
0.72
sampleLengths
0
79
1
89
2
20
3
6
4
75
5
24
6
44
7
18
8
5
9
33
10
61
11
15
12
17
13
65
14
14
15
13
16
36
17
33
18
7
19
46
20
5
21
20
22
41
23
33
24
6
25
6
26
40
27
32
28
24
29
21
30
45
31
1
32
24
33
10
34
19
35
11
36
47
37
48
38
22
39
5
40
29
41
37
42
27
43
14
44
44
45
37
46
5
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
5
totalSentences
172
matches
0
"was etched"
1
"was swept"
2
"were curled"
3
"been forced"
4
"was displaced"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
7
totalVerbs
243
matches
0
"was sleeping"
1
"was frizzing"
2
"wasn't searching"
3
"was hunting"
4
"was vibrating"
5
"was watching"
6
"was rewriting"
94.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1356
adjectiveStacks
1
stackExamples
0
"thick, undisturbed gray fuzz"
adverbCount
37
adverbRatio
0.02728613569321534
lyAdverbCount
17
lyAdverbRatio
0.012536873156342183
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
172
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
172
mean
7.87
std
5.94
cv
0.755
sampleLengths
0
30
1
22
2
1
3
26
4
11
5
22
6
10
7
17
8
4
9
2
10
4
11
19
12
10
13
4
14
6
15
6
16
9
17
3
18
16
19
20
20
27
21
7
22
11
23
6
24
19
25
25
26
14
27
2
28
2
29
4
30
1
31
11
32
4
33
4
34
2
35
3
36
3
37
2
38
4
39
5
40
10
41
8
42
3
43
9
44
11
45
12
46
3
47
9
48
2
49
2
40.9804%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
17
diversityRatio
0.3058823529411765
totalSentences
170
uniqueOpeners
52
100.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
0
fancyCount
0
fancyTags
(empty)
91.8373%