NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMR66REGV8A4ZRWXVVDVG3W
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
1m 40s
Cost:
$0.018237
Provider:
Alibaba
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air beneath Camden tasted of rust and wet iron, a metallic tang that coated the back of Harlow Quinn's throat. She clicked her torch on, the beam cutting through the gloom of the abandoned Tube station. This was the third location this year that matched the pattern of the Veil Market, though the stalls
were gone
now, leaving only chalk residues and the lingering scent of ozone. Quinn adjusted the worn leather
watch
on her left wrist, the habit grounding her as she stepped over the yellow tape. Eighteen years on the force had taught her that the quiet scenes were always the loudest. A body lay sprawled near the edge of the platform, face down on the grimy tiles. Quinn crouched, her knees cracking
softly
in the silence. She was forty-one, and her body remembered every chase, every fall, but her mind remained sharp, military precision governing every movement. She glanced at the woman kneeling beside the corpse. Eva Kowalski was not police, but Quinn had brought her in as a specialist consultant. The red curls
were tucked
behind her left ear, a nervous tic Quinn had observed three times in the last ten minutes. It is not an overdose, Detective, Eva said, her voice
echoing
slightly
against the curved ceiling. She pushed her round glasses up the bridge of her nose. The sigils on the floor are active. This was a ritual breach. Quinn kept her expression neutral. She did not trust the occult researcher, not fully. Quinn suspected the clique Eva ran with
was involved
in the criminal activity bleeding into her city, and the unexplained death of her partner, DS Morris, three years ago still hung over her like a shroud. But the evidence was strange, and Quinn needed answers. PC
Evans
, the uniformed officer securing the perimeter, stepped closer. Looks like a bad trip to me, Guv. There are paraphernalia in the satchel. Could be laced with something synthetic. Quinn ignored him and focused on the body. A man, mid-thirties. No visible wounds. His skin was pale, almost grey, but there was no cyanosis typical of suffocation or drugs. She reached out, gloved fingers hovering over the victim's jacket pocket. Something hard protruded from the fabric. Eva shifted her
weight
, the worn leather of her satchel creaking. The energy here is residual. Whatever opened the door didn't close properly. That is what killed him. Shock to the system. Or someone pushed him, Quinn said, her voice low. She pulled the object from the pocket. It was a small brass compass, the casing covered in a patina of verdigris. The face
was etched
with protective sigils that
seemed to
shift
slightly
in the torchlight. A Veil Compass. Eva leaned in, her green eyes widening. That is Shade artisan work. It points to rifts. If the needle is spinning, it means the portal is unstable. Quinn held the compass steady. The needle
was not spinning
. It
was locked
, pointing rigidly toward the tunnel wall behind them. If the portal was unstable, the needle would jitter. If the man had died from magical backlash, the compass would likely be shattered or drained. This was
pristine
.
Evans
shined his light on the chalk markings near the body. See? Circle of protection. He stepped inside it. Bad luck. Quinn stood up, her height giving her a vantage point over the scene. She walked the perimeter, her boots clicking on the tiles. She stopped at the chalk line. It was broken. Not faded, not smudged by foot traffic, but intentionally severed on the northern arc. Eva followed her gaze. That breaks the containment. It would allow energy to escape. Exactly, Quinn said. But look at the dust. She pointed her torch at the floor near the break. The grime was undisturbed except for the victim's footprints. There were no other prints leading away from the circle. If the man broke the circle himself, he died instantly. If someone else broke it, they didn't walk away. Maybe they floated,
Evans
suggested, though he sounded less certain now. Quinn knelt again, examining the victim's hands. Under the fingernails, there was grey dust. Not tile dust. Brick dust. She looked at the wall where the compass needle pointed. There was a maintenance hatch there, painted over decades ago. He did not die from the ritual, Quinn said, standing up and brushing off her knees. He was killed before he was placed here. The ritual
was staged
. Eva frowned, tucking a stray curl behind her ear again. But the energy readings... Are residual, Quinn confirmed. From the market itself. Someone knew this location moves every full moon. They knew the energy would be high tonight. They used the atmosphere to mask the cause of death. She turned to the compass in her hand. The needle pointed at the hatch. He
was trying
to find the exit. Someone wanted him to stay in the circle. Quinn walked to the hatch. She ran a hand over the paint. Fresh scratches near the lock. He
tried to
open it. He
was locked
in. So it was murder, Eva said, her voice quiet. Not an accident. Quinn looked back at the body. The sharp line of her jaw tightened. This was not just murder. It was a message. The clique knew she
was sniffing
around. They
were using
the supernatural to hide mundane violence, betting on the police dismissing it as weirdness. They had almost succeeded.
Evans
was ready to call it an overdose. Eva was ready to call it magic. Quinn pocketed the compass. It was evidence now. She looked at Eva, noting the way the younger woman clutched her satchel full of books. Eva might know the magic, but Quinn knew the criminals. Bag the chalk, Quinn ordered
Evans
. Do not let anyone step in the circle. And get a forensics team down here to check the hatch for prints. What about the... supernatural aspect?
Evans
asked, hesitating. Quinn met his eyes. We process the physical evidence. The rest is theory. She turned to Eva. You said this market moves? Every full moon, Eva replied. Then we have until then to find out who sold him the ticket. Quinn checked her
watch
again. The leather was cool against her skin. She thought of Morris, of the files she kept locked in her desk drawer that made no sense. This compass was a thread. She would pull it until the whole
tapestry
unraveled. Let is go, Quinn said. There is nothing left for us in the dark. She led the way back toward the surface, leaving the body in the gloom. The mystery was not in the magic. It was in the motive. And Harlow Quinn never left a motive undiscovered.
created_at
"2026-02-17T02:53:46.57075+00:00"
2/17/2026, 2:53:46 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
651
moderation_latency
(null)
generation_time
100431
tokens_prompt
923
tokens_completion
5360
native_tokens_prompt
905
native_tokens_completion
4915
native_tokens_completion_images
(null)
native_tokens_reasoning
3469
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.018237
router
(null)
provider_responses
0
id
"chatcmpl-84bd03ac-405b-942a-b8d0-25319a647875"
status
200
is_byok
false
latency
651
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771296725-3Pud6A1kQj2LPOapjrf2"
upstream_id
"chatcmpl-84bd03ac-405b-942a-b8d0-25319a647875"
total_cost
0.018237
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
0
adverbTagCount
0
adverbTags
(empty)
73.3570%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1126
totalAiIsmAdverbs
3
found
0
adverb
"slightly"
count
2
1
adverb
"softly"
count
1
highlights
0
"slightly"
1
"softly"
80.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
0
"Evans"
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
55.5950%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1126
totalAiIsms
5
found
0
word
"tapestry"
count
1
1
word
"pristine"
count
1
2
word
"weight"
count
1
3
word
"echoing"
count
1
4
word
"etched"
count
1
highlights
0
"tapestry"
1
"pristine"
2
"weight"
3
"echoing"
4
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
134
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
2
narrationSentences
134
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
1
"tried to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
134
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
30
mean
37.47
std
23.47
cv
0.627
sampleLengths
0
105
1
92
2
39
3
59
4
30
5
47
6
32
7
48
8
27
9
49
10
21
11
46
12
14
13
56
14
11
15
39
16
28
17
14
18
63
19
26
20
12
21
65
22
34
23
27
24
8
25
22
26
5
27
57
28
14
29
34
93.5323%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
8
totalSentences
134
matches
0
"were gone"
1
"were tucked"
2
"was involved"
3
"was etched"
4
"was locked"
5
"was killed was placed"
6
"was staged"
7
"was locked"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
4
totalVerbs
216
matches
0
"was not spinning"
1
"was trying"
2
"was sniffing"
3
"were using"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1126
adjectiveStacks
0
stackExamples
(empty)
adverbCount
23
adverbRatio
0.020426287744227355
lyAdverbCount
11
lyAdverbRatio
0.009769094138543518
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
134
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
134
mean
8.39
std
5.47
cv
0.652
sampleLengths
0
21
1
16
2
31
3
21
4
16
5
16
6
9
7
21
8
9
9
15
10
22
11
16
12
11
13
7
14
5
15
5
16
9
17
36
18
9
19
10
20
8
21
6
22
6
23
8
24
3
25
3
26
16
27
11
28
6
29
11
30
5
31
7
32
5
33
4
34
9
35
7
36
14
37
15
38
3
39
7
40
5
41
4
42
11
43
5
44
5
45
11
46
9
47
16
48
3
49
11
52.4876%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
8
diversityRatio
0.35074626865671643
totalSentences
134
uniqueOpeners
47
100.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
0
fancyCount
0
fancyTags
(empty)
91.4689%