NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMPCED74FM6E0153V2RPNQY
Run Details
Model:
GPT-4o, Aug. 6th (temp=1)
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
40.7s
Cost:
$0.016020
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn arrived at the entrance of the abandoned Tube station beneath Camden, her breath visible in the chill of the underground air. A dull hum of distant traffic filtered through the layers of concrete above her, barely reaching the subdued tumult of the Market below. She wrapped her fingers around the worn leather strap of her
watch
, feeling its familiar texture against her wrist as if it might ground her in this place where reality seemed so thin. The Veil Market sprawled beneath the station, an
intricate
web of stalls and shadowed nooks, its vendors hawking wares that should not exist. A chaotic
symphony
of haggling voices
echoed
through the corridors, blending with the scent of incense and alchemical herbs. Harlow had been here once before, on a different case. It was a place she’d rather not revisit but duty, ever relentless, called her here again. She produced a bone token from her coat pocket, its polished surface smooth and cool against her fingertips. The gatekeeper, a figure shrouded in a cloak that whispered around his feet like a living shadow, inclined his head as she passed. Her presence here would not go unnoticed; her police badge, tucked discreetly into her jacket, felt heavier with the awareness that even this realm wasn’t beyond the law. The scene of the crime was deeper inside, through the labyrinthine alleys of the Market where
normalcy
frayed at the edges. She found it in a corner backed by a crumbling brick wall, dimly lit by a flickering streetlamp that cast long, jittery shadows. Eva Kowalski stood within the taped-off area, her curly red hair a defiant blaze of color in the gloom. Harlow had called in a favor; Eva’s expertise in matters of the occult might offer insights where traditional policing failed. As Harlow approached, Eva tucked a stray curl behind her left ear, her fingers lingering there as if uncertain of their next move. “Harlow,” Eva greeted, her voice carrying a blend of wariness and relief. She adjusted her round glasses and glanced around the small enclosure where the victim lay, or what was left of him. “This is… odd, even for here.” Harlow nodded, eyes narrowing as she examined the scene. The victim
was sprawled
on the cobblestone, limbs askew as if frozen in the midst of some desperate plea. His skin bore no marks of violence—an anomaly that set her instincts on edge. Instead, the flesh
was tinted
an unnatural gray, emanating a
palpable
aura of wrongness. “What do you make of it?” Harlow asked, crouching down beside the victim. A faint smell of brimstone wafted up, mingling with the other scents. Eva pondered, her gaze thoughtfully distant. “The discoloration… it resembles a curse, one that draws the life force without spilling blood. It’s rare, difficult to perform without leaving some kind of trace.” Harlow nodded, pulling a small brass compass from her pocket—the Veil Compass she’d acquired on her last visit. Its face
was etched
with
intricate
sigils, needle shivering as though sensing the energy in the air. Harlow had once dismissed such tools as fanciful objects, but now, as it
quivered
towards a hidden rift just beyond the crime scene, she felt it might reveal what human eyes overlooked. “Any suspects?” Eva asked, skimming through the notes she’d been making in a leather-bound book. “Too many. Half the Market were at his stall when he went down. Witnesses say he collapsed, no obvious cause. But I’m starting to suspect…” Harlow hesitated, glancing up at Eva. “Have you heard of a shade artisan here, someone rumored to craft items beyond the ordinary?” Eva’s eyebrows pinched together in concentration. “A shade… artisan,” she repeated, considering. “There’s a few who’ve claimed titles like that, but only one reputed for genuine work—Gideon. Hard to find, but his creations apparently work with supernatural precision.” Their eyes met briefly, an understanding passing between them. If a shade artisan were involved… Harlow’s thoughts drifted back to the cryptic warning she’d heard on the wind when she first heard about this incident: the veil grows thin. Harlow turned her attention back to the scene, trailing her fingers just above the surface of the stones. Each cobblestone seemed subtly different, one shimmering where the others lay dull—an anomaly that might speak if she listened closely enough. Slowly, she pried the stone loose, revealing a chalked sigil beneath, partially smudged but still charged with latent energy. “Eva, what do you make of this?” Harlow asked, gesturing at the symbol. Eva crouched beside her, eyes widening. “That’s… Harlow, that’s a sealing sigil, usually used to contain a powerful entity. But here… I think maybe it was altered to drain instead of contain.” She swallowed hard. “Someone used this man as a conduit.” Harlow stood, her jaw setting with determination. “Someone wants the balance here disrupted, leveraging the Market’s chaos to cover their tracks.” Her gaze drifted beyond the crime scene, into the pulsing heart of the Market—it was a festering wound whose infection spread with e
very
stall, e
very
secret deal struck in its shadows. “We need to find this Gideon,” Eva declared, rising to stand at Harlow’s shoulder. “If he’s responsible, he’s our best lead.” Agreeing silently, Harlow nodded, a strategy forming in her mind. She’d have to walk this dangerous path
carefully
—one misstep and the Market could close its ranks around them, exhibiting the
very
supernatural hostility she’d only glimpsed. As they moved, the
weight
of Harlow’s
watch
tugged reassuringly at her wrist, a constant reminder of time’s inescapable march. Yet here, beneath Camden in the Veil Market, time felt twisted, e
very
tick a reminder that truth lay hidden, deeper than any darkness. Together, they began their hunt through the Market’s labyrinthine passageways, past stalls piled high with improbable wonders and impossibilities that whispered of the unseen world. Through her years as a detective, Harlow had
learned that truth
often lay in anomalies—that the simple act of paying attention could reveal what others missed. Now, that skill was her compass in a sea of shadows where ordinary evidence was bent and
fractured
, demanding interpretation that challenged e
very
rule she knew. Onward, they ventured, deeper into the Market, the flickering lights above them casting shifting shadows that writhed and danced against the walls. Harlow felt each step resonate beneath her feet, felt the Market breathe around them, waiting patiently, as if it too had secrets to share, once the shadows were thick enough to listen.
created_at
"2026-02-17T02:21:14.180409+00:00"
2/17/2026, 2:21:14 AM
model
"openai/gpt-4o-2024-08-06"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
401
moderation_latency
(null)
generation_time
40437
tokens_prompt
923
tokens_completion
1736
native_tokens_prompt
836
native_tokens_completion
1393
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01602
router
(null)
provider_responses
0
id
"chatcmpl-DA4zRqspKRm3GRG0FPoam7i7eZMak"
status
200
is_byok
false
latency
383
endpoint_id
"9d15935a-34e6-4a5e-a5bc-c7dda213e876"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4o-2024-08-06"
api_type
"completions"
id
"gen-1771294833-tVndm842XTYdHUKmXpuy"
upstream_id
"chatcmpl-DA4zRqspKRm3GRG0FPoam7i7eZMak"
total_cost
0.01602
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
6
adverbTagCount
0
adverbTags
(empty)
72.3757%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1086
totalAiIsmAdverbs
3
found
0
adverb
"carefully"
count
1
1
adverb
"slowly"
count
1
2
adverb
"very"
count
1
highlights
0
"carefully"
1
"slowly"
2
"very"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
7.9190%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1086
totalAiIsms
10
found
0
word
"palpable"
count
1
1
word
"symphony"
count
1
2
word
"quivered"
count
1
3
word
"intricate"
count
2
4
word
"fractured"
count
1
5
word
"normalcy"
count
1
6
word
"weight"
count
1
7
word
"echoed"
count
1
8
word
"etched"
count
1
highlights
0
"palpable"
1
"symphony"
2
"quivered"
3
"intricate"
4
"fractured"
5
"normalcy"
6
"weight"
7
"echoed"
8
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
52
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
0
narrationSentences
52
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
58
matches
0
"learned that truth"
69.2134%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
23
mean
46.83
std
18.37
cv
0.392
sampleLengths
0
80
1
68
2
69
3
44
4
62
5
39
6
56
7
25
8
32
9
67
10
15
11
47
12
38
13
39
14
58
15
13
16
42
17
52
18
21
19
36
20
43
21
77
22
54
94.8718%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
52
matches
0
"was sprawled"
1
"was tinted"
2
"was etched"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
143
matches
(empty)
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
907
adjectiveStacks
0
stackExamples
(empty)
adverbCount
36
adverbRatio
0.03969128996692393
lyAdverbCount
17
lyAdverbRatio
0.018743109151047408
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
58
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
58
mean
18.57
std
8.38
cv
0.451
sampleLengths
0
24
1
23
2
33
3
23
4
19
5
10
6
16
7
18
8
23
9
28
10
21
11
23
12
19
13
20
14
23
15
12
16
21
17
6
18
9
19
19
20
14
21
14
22
13
23
12
24
6
25
26
26
18
27
17
28
32
29
15
30
31
31
16
32
6
33
6
34
26
35
9
36
6
37
24
38
18
39
21
40
19
41
13
42
6
43
29
44
7
45
7
46
45
47
14
48
7
49
10
85.6322%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
2
diversityRatio
0.5344827586206896
totalSentences
58
uniqueOpeners
31
50.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
6
fancyCount
3
fancyTags
0
"Eva greeted, her voice carrying a blend of wariness and relief (greet, carry)"
1
"she repeated, considering (repeat, consider)"
2
"Eva declared, rising to stand at Harlow’s shoulder (declare, rise stand)"
87.0595%