NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMR64DKT48XBVBQAB1091DP
Run Details
Model:
GPT-4.1 Mini
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
19.1s
Cost:
$0.002814
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn stepped down the narrow concrete stairwell into the cold, dimly lit expanse of the abandoned Tube station beneath Camden. The faint
echo
of dripping water mingled with the distant rumble of above-ground trains, creating a dissonant rhythm in the chill air. She adjusted the collar of her coat and glanced at the battered police tape cordoning off the scene. A metallic tang of blood hung faintly, barely perceptible beneath the usual dampness of the underground platform. It was no ordinary crime scene—something about it refused to settle in her bones. Quinn had
been called
here on a whisper of murder, but the location itself was peculiar. This decrepit station hadn’t seen a passenger in decades, yet here they were: a body sprawled near the old turnstiles, police officers muttering theories that failed to convince her. Detective Sergeant Milo Grant was already at the perimeter, crouched beside the victim. He looked up as Quinn approached, pushing his glasses higher on his nose. “You’re finally here,” he said with a hesitant grin, the light from a flickering overhead bulb catching the worry lines
etched
deep into his face. Quinn didn’t reply immediately. Her sharp eyes scanned the scene—the victim, a young woman dressed in layers of uneven fabric and worn leather boots, lay face up beneath the arched concrete ceiling. Her features
were obscured
by shadows, but the absence of obvious trauma gave Quinn pause. No bloodied knife, no blunt force injury, no sign of struggle other than the faintest disturbance in the thin dust kicked up around her. “What’s the preliminary?” Quinn finally asked, her voice low, steady. Milo gestured toward the silence around them. “No witnesses, no CCTV obviously. Time of death seems about midnight, but the body’s in a state of... decomposition that doesn’t quite fit. And we found this.” He pulled something from a small evidence bag—a bone token, bleached white and
etched
with delicate runes Quinn recognized from a case file months ago. A marker of entrance into the Veil Market. Quinn’s brow
furrowed
. “You think this girl was trying to get in?” Milo nodded
cautiously
. “That’s the theory. But here’s the twist—there are no signs of entry through the usual channels. No known openings, no witnesses to her arrival or departure.” Quinn’s gaze drifted to the edges of the platform, where old advertisements flaked off the wall and a rusted train door sat half open like a broken mouth. She walked
slowly
around the perimeter, noting the subtle irregularities. The air near the far wall
shimmered
ever so faintly, as if the cold mist bent unnaturally against the bricks. “That’s not just decay,” she murmured, crouching to examine the floor. The dust
was disrupted
around a small brass cicada-shaped object—a Veil Compass. She pried it free and held it up, noting the verdigris patina, the face
etched
deep with sigils, and the needle twitching erratically. “Looks like the compass is active,” Milo whispered behind her. “Pointing to something off the wall there.” Quinn turned, watching the compass needle wobble before settling toward the distant wall. Her
eyes narrowed
. This was no ordinary crime scene. The girl’s death was the consequence of stepping where no human should. Milo’s voice broke through her concentration. “The forensic team didn’t find conventional toxins or anything after testing. It’s as if her body was... drained. Something supernatural.” Quinn’s mind flashed back to DS Morris—the partner she’d lost three years ago under circumstances just as inexplicable. She fought down the resurgence of grief and doubt. Supernatural or not, she had to trust her instincts first. Standing, Quinn’s sharp
jaw clenched
. “Milo, we’re not dealing with a simple homicide. Whoever—or whatever—did this, used methods designed to erase evidence, to keep us blind.” Milo
looked uneasy
but peered back at the compass. “So, what now? We call in the occult unit? The others will laugh us out of HQ for even suggesting that.” Quinn shook her head. “No one knows this place like I do.” She tapped the compass
lightly
. “I want a full sweep. See where the needle points, and watch the surroundings
carefully
. Every detail matters.” Milo gave a half-smile. “This is what you do best, isn’t it?” She gave a cold nod, sending a silent vow into the stale underground air. As they stood in watchful silence, a soft rustling came from the narrow corridor behind the victim. Quinn spun, drawing her service pistol with military precision trained from years on the force. A petite figure emerged, red curls bouncing in the shadows—Eva Kowalski, clutching her worn leather satchel close. “Eva?” Quinn’s surprise was genuine. “What are you doing here?” Eva’s green eyes
flickered
nervously
behind her round glasses as she tucked a wild curl behind her ear. “I heard about the body. Someone mentioned the location. I thought maybe I could help.” Quinn’s jaw tightened. She trusted Eva’s research instincts, though the young scholar’s involvement sometimes made things messier. “This isn’t a museum archive, Evie. What do you know about this?” Eva took a hesitant step forward, voice low. “The Veil Market. It isn’t just rumor—or a legend. It’s real. And it moves through these underground abandoned stations during the full moon.” “Why come here tonight, then?” Quinn asked, eyeing the clock on her wristwatch. “Full moon was two nights ago.” Eva swallowed, then pulled a set of notes from her bag. “Because the ley lines shift. The portal lingers a bit longer sometimes, especially if someone’s trying to force it open. I think someone wanted the compass to guide them here—and this girl was the key.” Quinn stared at her friend, weighing the words. “If the portal’s still active, that explains the shimmer you saw on the wall.” Eva nodded. “And if the girl died trying to cross over...” Her gaze faltered. Quinn’s mind reassembled the puzzle like a steel trap snapping shut. “This was no random killing. It’s part of a larger pattern—people getting caught in the crossover between realms.” A sudden motion caught Quinn’s eye—the compass needle
suddenly
spun wildly, then pointed directly at a seam in the tunnel wall. “Look,” Milo said
sharply
, stepping forward. Quinn crouched again, inspecting the aged brickwork. The mortar was loose, but beneath it—a faint pulsating glow hummed
softly
. She slipped on a pair of gloves and pressed her palm against the wall. An electric charge sifted through her skin. “Supernatural energy, definitely,” she muttered. Eva pulled a small vial from her satchel and pressed it against the wall in one spot. The liquid vanished instantly, as if swallowed by the surface. Quinn rose, lunging for her radio. “Send in tech with
spectral
scanners. We need to
map that wall
, and I’m going to get to the bottom of how this girl got here, and what killed her." She glanced down once more at the body, now bathed in the eerie glow seeping from the cracked wall. This wasn’t just the scene of a homicide. It was the edge of something far more dangerous—and Quinn wasn’t going to let it go unsolved. Not after what she’d already lost.
created_at
"2026-02-17T02:52:23.447141+00:00"
2/17/2026, 2:52:23 AM
model
"openai/gpt-4.1-mini-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
401
moderation_latency
(null)
generation_time
18963
tokens_prompt
923
tokens_completion
1924
native_tokens_prompt
836
native_tokens_completion
1550
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0028144
router
(null)
provider_responses
0
id
"resp_04add279cf3320a8016993d7d372148194a824fa4e040ee118"
status
200
is_byok
false
latency
117
endpoint_id
"872eccb7-9c85-45fc-974a-ff7c8e2407e6"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-mini-2025-04-14"
api_type
"completions"
id
"gen-1771296723-6vZ8NsywZ7nN1cvFZJ12"
upstream_id
"resp_04add279cf3320a8016993d7d372148194a824fa4e040ee118"
total_cost
0.0028144
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
77.7778%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
6
adverbTagCount
1
adverbTags
0
"Milo said sharply, stepping forward [sharply]"
33.2220%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1198
totalAiIsmAdverbs
8
found
0
adverb
"carefully"
count
1
1
adverb
"cautiously"
count
1
2
adverb
"lightly"
count
1
3
adverb
"nervously"
count
1
4
adverb
"sharply"
count
1
5
adverb
"slowly"
count
1
6
adverb
"softly"
count
1
7
adverb
"suddenly"
count
1
highlights
0
"carefully"
1
"cautiously"
2
"lightly"
3
"nervously"
4
"sharply"
5
"slowly"
6
"softly"
7
"suddenly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
24.8748%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1198
totalAiIsms
9
found
0
word
"flickered"
count
1
1
word
"shimmered"
count
1
2
word
"spectral"
count
1
3
word
"furrowed"
count
1
4
word
"jaw clenched"
count
1
5
word
"echo"
count
1
6
word
"etched"
count
3
highlights
0
"flickered"
1
"shimmered"
2
"spectral"
3
"furrowed"
4
"jaw clenched"
5
"echo"
6
"etched"
66.6667%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
2
maxInWindow
2
found
0
label
"eyes widened/narrowed"
count
1
1
label
"jaw/fists clenched"
count
1
highlights
0
"eyes narrowed"
1
"jaw clenched"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
1
narrationSentences
69
matches
0
"looked uneasy"
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
0
hedgeCount
0
narrationSentences
69
filterMatches
(empty)
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
86
matches
0
"map that wall"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
37
mean
31.89
std
18.48
cv
0.579
sampleLengths
0
79
1
59
2
51
3
71
4
10
5
67
6
12
7
29
8
58
9
46
10
17
11
34
12
26
13
37
14
26
15
30
16
35
17
12
18
14
19
49
20
10
21
33
22
29
23
31
24
19
25
46
26
22
27
14
28
29
29
21
30
6
31
19
32
21
33
5
34
27
35
36
36
50
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
69
matches
0
"been called"
1
"were obscured"
2
"was disrupted"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
144
matches
(empty)
97.0104%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
834
adjectiveStacks
0
stackExamples
(empty)
adverbCount
34
adverbRatio
0.0407673860911271
lyAdverbCount
19
lyAdverbRatio
0.022781774580335732
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
86
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
86
mean
13.72
std
8.35
cv
0.609
sampleLengths
0
22
1
22
2
18
3
17
4
14
5
16
6
29
7
13
8
13
9
25
10
4
11
28
12
15
13
24
14
10
15
7
16
52
17
8
18
3
19
9
20
3
21
26
22
28
23
10
24
20
25
11
26
12
27
23
28
10
29
7
30
13
31
3
32
6
33
12
34
6
35
20
36
18
37
9
38
10
39
5
40
21
41
9
42
21
43
4
44
13
45
18
46
4
47
8
48
14
49
17
57.7519%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
2
diversityRatio
0.36046511627906974
totalSentences
86
uniqueOpeners
31
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
6
fancyCount
5
fancyTags
0
"he said with a hesitant grin, the light from a flickering overhead bulb catching the worry lines etched deep into his face (say catch etch)"
1
"she murmured, crouching to examine the floor (murmur, crouch examine)"
2
"Milo whispered behind her (whisper)"
3
"Milo said sharply, stepping forward (say sharply, step)"
4
"she muttered (mutter)"
79.8414%