NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMR95ZFJT474Y2S7HJSVYKF
Run Details
Model:
GPT-4.1 Mini
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
17.5s
Cost:
$0.002614
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn’s boots
echo
ed
sharply
against the cold concrete platform of the abandoned Tube station. The
flicker
ing overhead lights cast long, uncertain shadows, twisting the angular shapes of rusting iron beams and cracked tiles into something unfamiliar—alien. This
was supposed
to be a crime scene, not a forgotten relic. Yet here she was, feet planted in the underbelly of Camden, staring down at a body sprawled awkwardly near a pillar smeared with years of grime and graffiti. The victim, a man in his late thirties, wore a tailored suit stained with dark, coagulating blood. His face
was obscured
by a jagged gash running from temple to jaw, but something about the scene gnawed at Quinn. Around the victim lay a scatter of strange objects—shards of glass coated with a dull green film, a smudged playing card depicting a black jack, and, curiously, a small brass compass with a verdigris patina. Detective Quinn crouched, her sharp brown eyes narrowing. The watch on her left wrist caught the
flicker
of the overhead bulb; its worn leather strap was familiar, a steady companion through years of investigations. She reached out to the compass, turning it over
carefully
. Its face
was etched
with complex sigils, and the needle
trembled
slightly
, pointing not north but awkwardly—toward the tunnel leading deeper into the darkness she barely dared to glance at. Her partner, DS
Patel
, stood back, arms folded, peering over her shoulder with a frown. “Looks straightforward,” he said, voice low. “Altercation gone wrong. Maybe a mugging.” “Except the things don’t add up,” Quinn muttered, eyes scanning the surroundings once more. Her gaze settled on a faint shimmer in the air near the pillar—subtle, like heat rising off asphalt, but colder somehow. “Who leaves behind shards like these?” She picked up a piece
cautiously
. The glass was brittle yet warm to the touch,
etched
with unfamiliar symbols that pulsed faintly under the light.
Patel
glanced at the glass, skepticism plain on his face. “Looks like junk. Maybe drug paraphernalia? Maybe some prankster’s mess.” Quinn shook her head. “No ordinary junk. Look at the residue. That’s not chemical. It’s... something else.” She glanced over her shoulder and spotted a figure approaching, her silhouette framed by the dim light. Eva Kowalski, with her curly red hair escaping in wild tendrils, adjusted her round glasses and hurried forward, her satchel bumping against her hip. “Detective Quinn,” she said, breathless but steady. “I got the message. What are we looking at here?” “A mess. But not the usual kind,” Quinn replied curtly. Eva dropped to her knees beside the victim, tugging a notebook from her satchel. “Those shards are alchemical,” she said, voice low and reverent. “I’ve seen them before in the British Museum’s restricted archives. They’re fragments of what’s called a Veil Shard—components used in portals or rift seals, but they're rare, dangerous.” Quinn raised an eyebrow. “Portals?” Eva nodded, eyes bright. “To places beyond normal reality. Other realms, dimensions. And that compass,” she reached in to touch the brass casing
gently
, “it’s a Veil Compass. Crafted by a Shade artisan. It points to supernatural rifts or portals nearby. It’s not on just any black market; you only get something like this at the Veil Market.” Quinn’s jaw tightened. The Veil Market. She’d heard whispers—an underground black market that moved locations with the full moon, trading in goods outlawed by more than just the police. A place where magic and crime intertwined dangerously. What was a Veil Compass doing here? “And the victim?”
Patel
asked, stepping closer. “Does this tell us anything about him?” Eva glanced up
sharply
. “If he’s carrying a Veil Compass and a shattered Veil Shard—he was either chasing or protecting something supernatural. Maybe he was involved with the Market.” Quinn’s mind raced. Her foot tapped swiftly, restless. She
carefully
lifted the playing card. The black jack’s eyes
seemed to
glint
menacingly in the
flicker
ing light. “Or someone used him,” she said quietly.
Patel
frowned. “Look, we’ve got a man dead in an abandoned Tube station. Best bet is to keep it simple. Mugging, drugs, gang stuff. Magical mumbo jumbo complicates things.” Quinn pulled herself up, body taut, scanning the station once more. The shimmer near the pillar didn’t fade, almost as if the air itself
was stretched
thin, a veil between realities. And in that slight distortion lay answers others were blind to. “No,” she said
slowly
, voice hushed. “This isn’t gang violence. This is a warning. The victim was caught between worlds. Someone wanted to send a message.” Eva leaned in, lowering her voice further. “The Market moves every full moon, switching locations to evade detection. But signs like these... they’re traces. Residual energies from broken seals or failed portal attempts.” Quinn’s eyes settled on the compass needle as it oscillated then swung firmly toward a bricked-up tunnel entrance at the far end of the platform. Her heart quickened. “There’s something behind that wall.”
Patel
shrugged, unconvinced. “Bricked-up for a reason. Whoever put the body here wanted people to stay away.” “Or wanted to lure us here,” Eva whispered, eyes fixed on the needle. Quinn glanced at both of them, took a deep breath. “We need to
reopen that tunnel
. This...,” she held up the Veil Compass like a key, “is pointing us somewhere no one’s
really
looking.”
Patel
’s skepticism didn’t waver. “This isn’t police procedure, Harlow.” “Then it’s time we start following evidence, not protocol,” she replied, voice sharp like the edge of her jaw. The clockwork of her mind clicked forward, a puzzle reshaping itself. “I lost a partner to something like this—unexplainable, slipping through the cracks. We’re missing the bigger picture here. Get forensic. Search for fingerprints on the compass, the shards, the card. And I want a full report on missing persons linked to the Veil Market.” Eva sat back on her heels, watching Quinn’s penetrating gaze with something like hope. “You’re one of the few who still looks.” Quinn nodded once, decisively. “Because sometimes, what’s left unseen is the most dangerous.” She stood, muscles coiled, listening as the faint
echo
of
footsteps
faded deeper into the tunnel’s bricked barrier. Somewhere beyond it, hidden in the darkness between worlds, the truth waited. And this time, she was ready to pull back the veil.
created_at
"2026-02-17T02:54:01.331339+00:00"
2/17/2026, 2:54:01 AM
model
"openai/gpt-4.1-mini-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
479
moderation_latency
(null)
generation_time
17384
tokens_prompt
923
tokens_completion
1737
native_tokens_prompt
836
native_tokens_completion
1425
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0026144
router
(null)
provider_responses
0
id
"resp_006f317eee390ef7016993d8375c7c8197bfe1a1766c9d81ec"
status
200
is_byok
false
latency
117
endpoint_id
"872eccb7-9c85-45fc-974a-ff7c8e2407e6"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-mini-2025-04-14"
api_type
"completions"
id
"gen-1771296823-q0QNLpeZFYv9xro6EjOT"
upstream_id
"resp_006f317eee390ef7016993d8375c7c8197bfe1a1766c9d81ec"
total_cost
0.0026144
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
12
adverbTagCount
5
adverbTags
0
"Quinn muttered, eyes scanning the surroundings once more [once more]"
1
"Quinn replied curtly [curtly]"
2
"she reached in to touch the brass casing gently [gently]"
3
"she said quietly [quietly]"
4
"she said slowly, voice hushed [slowly]"
14.5299%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1053
totalAiIsmAdverbs
9
found
0
adverb
"carefully"
count
2
1
adverb
"cautiously"
count
1
2
adverb
"gently"
count
1
3
adverb
"sharply"
count
2
4
adverb
"slightly"
count
1
5
adverb
"slowly"
count
1
6
adverb
"really"
count
1
highlights
0
"carefully"
1
"cautiously"
2
"gently"
3
"sharply"
4
"slightly"
5
"slowly"
6
"really"
80.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
0
"Patel"
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
24.0266%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1053
totalAiIsms
8
found
0
word
"flicker"
count
1
1
word
"glint"
count
1
2
word
"trembled"
count
1
3
word
"footsteps"
count
1
4
word
"echo"
count
1
5
word
"echoed"
count
1
6
word
"etched"
count
2
highlights
0
"flicker"
1
"glint"
2
"trembled"
3
"footsteps"
4
"echo"
5
"echoed"
6
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
60
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
0
hedgeCount
1
narrationSentences
60
filterMatches
(empty)
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
76
matches
0
"reopen that tunnel"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
29
mean
36
std
21.05
cv
0.585
sampleLengths
0
78
1
73
2
74
3
27
4
66
5
20
6
34
7
41
8
10
9
52
10
5
11
58
12
44
13
14
14
29
15
33
16
29
17
42
18
26
19
33
20
33
21
17
22
13
23
34
24
9
25
74
26
22
27
13
28
41
88.8889%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
4
totalSentences
60
matches
0
"was supposed"
1
"was obscured"
2
"was etched"
3
"was stretched"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
126
matches
(empty)
79.8803%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
682
adjectiveStacks
0
stackExamples
(empty)
adverbCount
35
adverbRatio
0.051319648093841645
lyAdverbCount
22
lyAdverbRatio
0.03225806451612903
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
76
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
76
mean
13.74
std
10.09
cv
0.735
sampleLengths
0
16
1
22
2
12
3
28
4
17
5
21
6
35
7
8
8
26
9
10
10
30
11
15
12
6
13
6
14
14
15
21
16
12
17
19
18
10
19
10
20
4
21
30
22
24
23
7
24
10
25
10
26
14
27
10
28
28
29
4
30
1
31
4
32
54
33
3
34
3
35
23
36
8
37
7
38
7
39
7
40
4
41
25
42
3
43
5
44
6
45
12
46
7
47
2
48
27
49
11
76.7544%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
0
diversityRatio
0.4605263157894737
totalSentences
76
uniqueOpeners
35
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
12
fancyCount
10
fancyTags
0
"he said, voice low (say, voice)"
1
"Quinn muttered, eyes scanning the surroundings once more (mutter, scan)"
2
"Quinn replied curtly (reply curtly)"
3
"she said, voice low and reverent (say, voice)"
4
"she reached in to touch the brass casing gently (reach in touch case gently)"
5
"she said quietly (say quietly)"
6
"she said slowly, voice hushed (say slowly, hush)"
7
"Eva whispered, eyes fixed on the needle (whisper, fix)"
8
"she held up the Veil Compass like a key (hold up like)"
9
"she replied, voice sharp like the edge of her jaw (reply, voice like)"
74.3576%