NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQN84314748X1ZZKMJA4ZA
Run Details
Model:
o4 Mini High
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
39.1s
Cost:
$0.022325
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Quinn’s boots
echoed
against the concrete as she stepped off the rattling stairway and onto the platform. The abandoned station yawned before her, its arched ceiling pockmarked with peeling paint and spiderwebs of iron rebar. Flickering bulbs, strung from frayed cables, cast jaundiced circles of light over rails choked with rust. A thin layer of dust settled on every surface, disturbed only by the dark stain at the center of the platform. Quinn paused, letting the hush settle in her chest, feeling the
weight
of eighteen years in uniform tighten around her shoulders. She noted the faint hum of electricity in the cables overhead—a heartbeat out of sync with the city above. Her salt-and-pepper crew cut glinted in the weak glow; her sharp jaw shifted as she scanned the scene. No yellow tape flapped across the entrance. No squad cars idling. This was neither a routine homicide nor a daylight arrest. Not on this night, in this place. Quinn reached for her leather
watch
, its face scuffed, and pressed the button on her flashlight. The beam cut through shadows, illuminating a circle of white dust that surrounded a body slumped on the tracks. “Detective Quinn,” Eva Kowalski’s voice floated through the gloom, tight with nerves. Quinn turned to see green eyes framed by round glasses, curl-struck red hair escaping in chaotic spirals. Eva tucked a lock behind her left ear, then shrugged her satchel straps. The satchel, weathered leather bursting at the seams with books and notebooks, thumped against her hip as she knelt on the edge of the circle. “I flagged this as a ritual site,” she said, voice low. “See the pattern? It’s not chalk from the forensics kit.” Quinn dropped to one knee, palms dusting the jagged edge of the white ring. She frowned. The powder lay too evenly distributed—no brush strokes or breaks where someone had scrambled. “Looks like investigator chalk to me,” she said, dragging a latex-gloved finger through the outer rim. The circle held fast. Dust sprayed upward in a perfect arc, as if anchored by hidden force. She inhaled the sharp, metallic scent of ozone. “And yet it doesn’t move the way chalk should.” She straightened, blade of light cutting across the platform sign: “Camden Veil Station.” The letters were half-obscured by grime, but the name burned in her mind. Veil Market had been here last full moon. A bone token card had
opened that door
. Now it seemed something—someone—had reopened it without invitation. Eva slid a weathered tome from her satchel and flipped to a page streaked with coffee rings. “The Etruscan word for threshold,” she murmured, pointing to a symbol drawn in crimson ink. “Threshold. Nearly identical to the rune scratched into the victim’s wrist.” Quinn’s gaze snapped to the body: a man no older than forty, dressed in a dark overcoat that soaked into the tracks below his knees. His skin was chalk-white, marbled with veins of purple. His left wrist bore a rune carved too neatly, as if by a surgeon’s steady hand: an inverted triangle bisected by a single line. No entry wounds. No obvious trauma. Quinn’s pulse hammered beneath her uniform. She knelt beside him, fingertips grazing the coarse wool of his sleeve. “No defensive wounds. No fall injuries. No blood beyond that ring.” She shone her light along his chest. Beneath the overcoat, his shirt was
pristine
. No slit in the fabric. No splatter on the collar. It was as if his blood had never existed. “He bled out inside himself,” Quinn said. “But there’s no cavity, no puncture. Just… nothing.” She exhaled, jaw tightening. She’d seen bodies torn apart by knives, bullets, strangulation—never this. “Look at the tracks,” Eva said, voice quivering as she pointed along the rail ties. Two sets of footprints, pressed into the dust, led toward a ventilation shaft at the far end of the platform. One set was wide and heavy, likely a pair of work boots; the other was narrower, the print of a leather-soled shoe. Quinn knelt and rubbed a gloved fingertip across the edge of one footprint. Tiny crystals sparkled where dust had fallen into the tread. She coaxed a sliver of residue onto a strip of plastic. “This isn’t ordinary dust,” she said. “It’s almost granular… metallic.” The crystals caught her flashlight and glowed with an inner light, an ember-like shimmer. She placed the sample in an evidence bag and sealed it. Eva adjusted her glasses, voice
barely above a whisper
. “You’re saying… not chalk, not concrete dust.” Quinn shook her head, brow creased. She stood and reached for the brass compass lying beside the body. Verdigris mottled its casing, and protective sigils
were etched
around its face—familiar, handcrafted by a Shade artisan at the Market. She turned it over in her hand, the
weight
surprisingly light. The needle, still,
suddenly
quivered
, aligning itself toward the dark tunnel beyond the ventilation grate. “Compass is keyed to rifts,” Quinn said. “It shouldn’t react to chalk.” She pocketed it, feeling the lure of its pull in her palm. “Someone brought this here intentionally.” Eva’s satchel rustled as she skimmed pages. “The bone token he carried… it’s missing,” she said. Quinn’s gaze snapped to a small pile of bleached bird bones arranged in a semicircle behind the victim. They were too clean to be natural—eagle wing or raven femur, polished smooth. “They used him as a marker,” Quinn said. “Left his token behind as bait or—” She lifted a foot to inspect a blackened spot on the platform. A scorch mark, roughly circular, as if something had burned away the dust and grime. “—a focusing point. See this? He was the focal point for something.” A distant rumble
echoed
through the tunnels, low and resonant, though no train had run here in decades. Quinn’s heart thudded. She pressed her back against the station wall, scanning the ceiling. Water dripped from broken pipes above. Each drop sounded like a clock ticking down. She drew a sharp breath, tasting metal on her tongue. “We’re not looking at a homicide in the usual sense,” she said, voice steady despite the adrenaline. “This was staged to open—or widen—a rift.” Eva’s green
eyes widened
behind her glasses. Quinn pointed to the circle of dust. “That’s a ward, not a crime-scene outline. It keeps something contained or guides energy inward.” Eva closed her book and dropped it into the satchel. “Then the chalk ring is as much protection as it is ritual,” she said. Quinn nodded, then knelt to peer at the rune carved on the man’s wrist. “You
think that symbol
is the key to the doorway. But look at how clean the cut is,” Quinn said. She traced the line with her fingertip. “No blade traces. No chips. Whoever did this used… what tool? Something that slices flesh without scarring bone.” She glanced at Eva. “Arcane energy? I don’t yet believe in magic, but I’ve seen things I can’t explain.” Eva’s brow
furrowed
. She tapped her satchel, then produced a small glass vial containing a swirling blue powder. “Propanthine ash,” she said as if reciting a grocery list. “Derived from a demon’s funeral pyre. It burns hot, erases organic matter—perfect for softening bone. After you slice, you burn away everything you don’t need.” Quinn stared at the vial. The hairs on her forearms rose. “So whoever killed him removed his blood, his organs… then incinerated the proof.” She closed her eyes for a fraction of a second, picturing the scene. The body had
been deposited
here,
pristine
, to disguise the ritual. The scorch mark had cooled too
quickly
to be an accident. It was a byproduct. Above them, the faintest purple glow
flickered
behind grillework in the ventilation shaft. The compass in Quinn’s pocket
throbbed
, as though alive. She rose, gloved hand brushing dust from her trouser leg. “We
follow that,” she
said, nodding toward the shaft. “If there’s an active rift, it’s up there.” Her voice was calm, precise. Military precision, born of years chasing murderers and chasing ghosts. She
flicked
on a secondary flashlight and slipped from the platform, Eva at her heels, red hair brushing against her chin. Each step
echoed
, swallowed by the dark. Quinn’s mind raced through facts—no token, no token, no token. The bone token: small, carved from ivory, the entrance fee to the Veil Market. Whoever invited this man had stolen his means of passage. They’d used his body to draw energy, and then left the Market unguarded. A full moon rose above ground, marking tonight as the market’s move. In hours, the hidden stalls would appear here again—dealers of cursed blade, sellers of forbidden knowledge. At the grated hatch above, the purple glow pulsed through vents, a heartbeat in the wall. Quinn pressed her palm to the cool steel and shone her light through a gap. Vapors swirled beyond, drifting like smoke across underground arches. She slid her flashlight back into her holster, freeing both hands to push the hatch. It gave with a long groan, revealing a narrow passage. The scent of sulfur and old ink rose up. Eva clutched her satchel, eyes bright with fear and excitement. Quinn glanced at the compass in her coat pocket. The needle
trembled
, pointing up through the hatch. She slipped her hand inside and closed her fingers around the glow. Eva murmured, “Detective… this is bigger than anything we’ve seen.” Quinn didn’t reply. She kicked the hatch closed behind them, sealing the world outside. Then she led the way forward, each step measured, ears tuned to the hum of magic. Tonight, she would see what others had missed—and she would stop the shadow that had taken a life to open a door.
created_at
"2026-02-17T02:43:29.598122+00:00"
2/17/2026, 2:43:29 AM
model
"openai/o4-mini-high-2025-04-16"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
14840
moderation_latency
(null)
generation_time
38936
tokens_prompt
923
tokens_completion
2809
native_tokens_prompt
835
native_tokens_completion
4865
native_tokens_completion_images
0
native_tokens_reasoning
2688
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0223245
router
(null)
provider_responses
0
id
"resp_028a92bd39d07c61016993d5aa4ed48197990c3e340f54aabf"
status
200
is_byok
false
latency
273
endpoint_id
"60020533-2fb2-4aa1-9454-181029fd52de"
provider_name
"OpenAI"
model_permaslug
"openai/o4-mini-high-2025-04-16"
api_type
"completions"
id
"gen-1771296170-BTg59xZmEWEj4CRsnA6W"
upstream_id
"resp_028a92bd39d07c61016993d5aa4ed48197990c3e340f54aabf"
total_cost
0.0223245
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
17
adverbTagCount
0
adverbTags
(empty)
81.6514%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1635
totalAiIsmAdverbs
3
found
0
adverb
"quickly"
count
1
1
adverb
"suddenly"
count
1
2
adverb
"barely above a whisper"
count
1
highlights
0
"quickly"
1
"suddenly"
2
"barely above a whisper"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
14.3731%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1635
totalAiIsms
14
found
0
word
"throbbed"
count
1
1
word
"flickered"
count
1
2
word
"flicked"
count
1
3
word
"trembled"
count
1
4
word
"quivered"
count
1
5
word
"pristine"
count
2
6
word
"furrowed"
count
1
7
word
"weight"
count
2
8
word
"echoed"
count
3
9
word
"etched"
count
1
highlights
0
"throbbed"
1
"flickered"
2
"flicked"
3
"trembled"
4
"quivered"
5
"pristine"
6
"furrowed"
7
"weight"
8
"echoed"
9
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"eyes widened/narrowed"
count
1
highlights
0
"eyes widened"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
122
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
122
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
3
totalSentences
127
matches
0
"opened that door"
1
"think that symbol"
2
"follow that,” she"
0.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
16
mean
100.88
std
14.18
cv
0.141
sampleLengths
0
93
1
100
2
88
3
130
4
101
5
97
6
126
7
109
8
101
9
109
10
102
11
116
12
92
13
75
14
84
15
91
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
122
matches
0
"were etched"
1
"been deposited"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
238
matches
(empty)
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1334
adjectiveStacks
0
stackExamples
(empty)
adverbCount
24
adverbRatio
0.017991004497751123
lyAdverbCount
10
lyAdverbRatio
0.0074962518740629685
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
127
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
127
mean
12.71
std
6.08
cv
0.478
sampleLengths
0
17
1
18
2
16
3
21
4
21
5
19
6
18
7
7
8
4
9
10
10
7
11
16
12
19
13
12
14
17
15
13
16
25
17
11
18
10
19
14
20
2
21
14
22
16
23
4
24
13
25
8
26
35
27
8
28
8
29
8
30
17
31
15
32
36
33
9
34
24
35
3
36
3
37
6
38
12
39
18
40
7
41
5
42
5
43
9
44
7
45
12
46
10
47
15
48
20
49
22
60.6299%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
8
diversityRatio
0.4015748031496063
totalSentences
127
uniqueOpeners
51
27.9412%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
17
fancyCount
10
fancyTags
0
"Eva Kowalski’s voice floated through the gloom, tight with nerves (float)"
1
"she said, voice low (say, voice)"
2
"she said, dragging a latex-gloved finger through the outer rim (say, drag)"
3
"she murmured, pointing to a symbol drawn in crimson ink (murmur, point draw)"
4
"snapped (snap)"
5
"Eva said, voice quivering as she pointed along the rail ties (say, voice quivering point)"
6
"she said, voice steady despite the adrenaline (say, voice)"
7
"she said as if reciting a grocery list (say recite)"
8
"she said, nodding toward the shaft (say, nod)"
9
"Eva murmured, Quinn didn’t reply. (murmur, does not reply)"
81.4468%