NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMMTT3ASAQKQVTAVJXGEHH1
Run Details
Model:
Mistral Large
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
26.4s
Cost:
$0.009694
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The abandoned Tube station beneath Camden smelled of damp concrete and rust, the kind of scent that clung to the back of your throat like a bad memory. Detective Harlow Quinn stepped off the last rung of the maintenance ladder, her boots hitting the platform with a sharp *clack* that
echoed
through the hollow darkness. The beam of her torch cut through the gloom, slicing across peeling posters and graffiti-scarred tiles before landing on the body. It lay sprawled near the edge of the platform, one arm twisted beneath it, the other flung out as if reaching for something just beyond its grasp. A man—mid-thirties, dark hair, cheap suit now stained with something darker. His face
was frozen
in an expression Quinn couldn’t quite read. Not fear. Not pain. Something closer to *recognition*, like he’d seen the thing that killed him and understood, too late, what it meant. “You’re late.” Quinn didn’t turn. She knew the voice. Knew the way it
carried that edge
of amusement, like the world was a joke only a few were in on. “Had to park three streets over. Someone blocked the entrance with a van.” “Ah. That’d be me.” Detective Sergeant Elias Carter stepped into the torchlight, his hands tucked into the pockets of his overcoat. He was younger than Quinn by a decade, all sharp cheekbones and easy grins, the kind of cop who charmed witnesses into confessions before they realized they’d said too much. “Thought you’d appreciate the privacy.” Quinn crouched beside the body, her leather
watch
creaking as she reached for the man’s wrist. No pulse, of course. Rigor had already set in. “You called it in as a suspicious death. Why?” Carter shrugged. “No obvious cause. No signs of struggle. And look at his hands.” Quinn did. The man’s fingers were clean, nails neatly trimmed. No defensive wounds. No dirt under them. Just… nothing. She shifted her torch to his face. His eyes were open, glassy, the pupils dilated. She leaned in, ignoring the way Carter exhaled
sharply
behind her. “What?” he asked. “No petechial hemorrhaging.” She tilted the man’s head
gently
. “No ligature marks. No bruising around the throat. If this was strangulation, there’d be *something*.” “Maybe it was quick.” “Quick enough to leave no trace?” Quinn stood, brushing her palms against her trousers. “You said there were no signs of struggle. What about the scene?” Carter gestured to the platform. “No scuff marks. No overturned debris. Just him, lying here like he sat down for a nap and never woke up.” Quinn’s torch
flickered
over the tiles. The beam caught something—a
glint
near the body’s outstr
etched
hand. She stepped closer, crouching again. A small, circular object lay half-hidden beneath the man’s fingers. Brass, tarnished with age,
etched
with strange symbols. “What the hell
is that?” Carter
muttered. Quinn didn’t answer. She pulled a pen from her pocket and used it to nudge the object free. A compass. But not like any compass she’d seen before. The needle
wasn’t pointing
north. It
wasn’t pointing
at anything, in fact. It spun
lazily
, as if caught in a current only it could feel. “That’s… not normal,” Carter said. Quinn turned the compass over. The back was smooth, unmarked. No manufacturer’s stamp. No serial number. Just brass and those strange, looping sigils. She slipped it into an evidence bag, sealing it with a sharp *rip* of plastic. Carter
was watching
her, his expression
unreadable
. “You’re not going to tell me what you’re thinking?” Quinn stood, tucking the bag into her coat pocket. “I’m thinking this isn’t our first weird death this month.” “You’re still on that?” Carter’s voice was light, but there was an edge to it. “The others were overdoses. Bad batches. You know how it is.” “Three overdoses in two weeks, all in the same area, all with no prior history of drug use?” Quinn shook her head. “And now this. A man with no visible injuries, no signs of foul play, holding a compass that doesn’t work like a compass should.” Carter exhaled, running a hand through his hair. “You’re reaching, Quinn. This is Camden. Weird shit happens here all the time.” “Not like this.” Quinn’s torch beam swept over the platform again, lingering on the shadows clinging to the far wall. Something about them didn’t sit right. They were too *thick*, like ink spilled across the tiles. She stepped closer, her boots scuffing against the concrete. Carter followed. “What are you looking at?” “Those marks.” Quinn crouched again, running her fingers along the edge of the shadow. It wasn’t a stain. It wasn’t dirt. It was *cold*. Like touching the inside of a freezer. “This isn’t graffiti.” “Then what is it?” Quinn didn’t answer. She pulled out her phone, snapping a few photos before standing. “We need to canvass the area. See if anyone saw anything.” Carter crossed his arms. “You’re not going to tell me what’s
really
going on in that head of yours, are you?” Quinn met his gaze. “Not yet.” Because the truth was, she didn’t know. Not entirely. But she knew this: the compass, the shadows, the way the air in the station felt *wrong*—like the moment before a storm breaks. And she knew that three years ago, in a flat not far from here, her partner had died in a way no one could explain. A way that had left the room colder than it should have been. A way that had made the paramedics whisper about *supernatural* causes before the official report had buried it under layers of bureaucratic bullshit. Quinn had spent the last three years pretending she didn’t believe in any of that. But the compass in her pocket
was spinning
again. And this time, it
was pointing
at her.
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
3
adverbTagCount
0
adverbTags
(empty)
58.5492%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
965
totalAiIsmAdverbs
4
found
0
adverb
"gently"
count
1
1
adverb
"lazily"
count
1
2
adverb
"sharply"
count
1
3
adverb
"really"
count
1
highlights
0
"gently"
1
"lazily"
2
"sharply"
3
"really"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
48.1865%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
965
totalAiIsms
5
found
0
word
"flickered"
count
1
1
word
"glint"
count
1
2
word
"unreadable"
count
1
3
word
"echoed"
count
1
4
word
"etched"
count
1
highlights
0
"flickered"
1
"glint"
2
"unreadable"
3
"echoed"
4
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
79
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
79
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
2
totalSentences
99
matches
0
"carried that edge"
1
"is that?” Carter"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
34
mean
28.24
std
22.41
cv
0.794
sampleLengths
0
76
1
72
2
2
3
41
4
56
5
34
6
14
7
45
8
3
9
24
10
4
11
26
12
26
13
39
14
7
15
53
16
5
17
38
18
16
19
19
20
26
21
46
22
21
23
45
24
7
25
34
26
4
27
25
28
21
29
6
30
93
31
15
32
9
33
8
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
1
totalSentences
79
matches
0
"was frozen"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
5
totalVerbs
123
matches
0
"wasn’t pointing"
1
"wasn’t pointing"
2
"was watching"
3
"was spinning"
4
"was pointing"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
714
adjectiveStacks
0
stackExamples
(empty)
adverbCount
23
adverbRatio
0.03221288515406162
lyAdverbCount
7
lyAdverbRatio
0.00980392156862745
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
99
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
99
mean
9.7
std
7.12
cv
0.734
sampleLengths
0
28
1
27
2
21
3
27
4
11
5
11
6
2
7
2
8
19
9
2
10
3
11
4
12
21
13
13
14
21
15
30
16
5
17
16
18
4
19
5
20
9
21
2
22
12
23
2
24
8
25
3
26
4
27
1
28
1
29
7
30
8
31
11
32
3
33
9
34
15
35
4
36
14
37
12
38
5
39
21
40
6
41
10
42
5
43
10
44
8
45
7
46
3
47
15
48
2
49
8
52.5253%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
8
diversityRatio
0.36363636363636365
totalSentences
99
uniqueOpeners
36
91.6667%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
3
fancyCount
1
fancyTags
0
"muttered (mutter)"
91.2310%