NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQM5PY7ZZNRXAKJE5R05GB
Run Details
Model:
o4 Mini
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
20.8s
Cost:
$0.012271
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn stepped onto the cracked platform of Camden’s abandoned Tube station, her boots
echoing
against the silent rails. The tunnel lights
flickered
, revealing peeling posters and rust-stained walls. She brushed a lock of steel-gray hair from her forehead, eyes narrowing beneath the harsh glow. A thin sheet of mist curled along the tracks, low and insistent, as if reluctant to dissipate. Quinn checked her leather‐strapped
watch
—worn from years on duty—and noted the time: just past midnight. The station had
been sealed
months ago, but someone had broken in. She inhaled once, tasted damp stone and something coppery in the air. A body lay half on the tracks, half on the platform, legs splayed like a puppet whose strings had snapped. Detective Eva Kowalski crouched beside the corpse, her round glasses catching the pale halo of Quinn’s torch. Eva’s curly red hair framed her pale, freckled face, and she pushed a loose strand behind her left ear. In her satchel, Quinn suspected, lay tomes on occult runes and forbidden rituals. Tonight, that knowledge might come in handy. “Male, mid‐thirties,” Eva said, unzipping a chalk marker. “Head trauma—blunt force. Death likely instantaneous.” She drew a chalk outline around the victim’s boots. “No sign of struggle. Could be a drunk falling.” Quinn knelt a few feet away, boots scuffing the grit. She didn’t see shattered glass, didn’t hear a thrown punch. The victim’s forehead
was caved
inward, dark fluid crusted around the edges. But the indentation was too neat, too shallow for a heavy fall. Her jaw tightened. “Drunk?” Quinn’s voice was cool. “He’s wearing polished dress shoes, tie knotted tight. Not stumbling drunk.” She bent closer, scanning the victim’s collar. A small brass object lay half-hidden beneath his jawbone: a compact compass, face
etched
with sigils curling like vines. She lifted it with a gloved finger. The compass needle
quivered
, then snapped north—toward the far end of the platform—where the tunnel descended into shadow. The casing bore a patina of verdigris, polished around the edges from constant handling. Quinn’s heart tripped at the knowledge: Veil Compass. Crafted by a Shade artisan, sold only in the underground market. The black market that supposedly vanished. Eva’s gaze followed Quinn’s. “A novelty item,” she said, voice light. “Tourist junk. Maybe he picked it up from a collector.” Quinn held the compass level. It pointed rigidly down the track. “This isn’t a collector’s trinket. These sigils protect the bearer from—” She cut off, mindful of bystanders. “From things not of this world.” A cold draft brushed Quinn’s neck. She checked her left wrist, the leather strap of her
watch
snug against her skin. She remembered DS Morris, partner lost three years ago in a case that defied explanation. She felt the old ache of guilt: she had missed signs then. Not again. “The head wound,” she said
softly
, returning the compass to the platform. “It’s too clean. No bone shards. No blood spray on the platform. He didn’t fall. He was placed.” Eva frowned. “You think someone laid him here? To stage a scene?” “Someone—or something.” Quinn stood, letting the torch sweep the tunnel walls. Away from the weak station lights, the darkness
seemed to
pulse, alive. She noted the faint scratch marks on the rail ties, shallow grooves as if something dragged toe‐first. The marks stopped at the victim’s body. There were no marks past his feet. “Look.” Quinn crouched by the rail. “These grooves—they’re consistent with a boot toe scraping wood. But there’s no scuff on his soles.” She peeled back the man’s polished shoes. The leather was immaculate, not a single scrape. “He wasn’t walking here.” Eva knelt beside her, lips parted. She glanced up at Quinn. “But the station’s sealed. How did they move him?” She tucked another strand of hair behind her ear. “No entry sign, no footprints aside from his.” Quinn’s brown eyes
flicked
along the tunnel. The compass needle
trembled
again. “Through a rift.” She heard herself say it, felt the old surge of adrenaline: Portal. Supernatural breach. The veil between worlds thinning here. Eva swallowed. “A rift? Quinn, we don’t have proof of—” Quinn stood abruptly and moved to the edge of the platform. She set the torch on the ground, pointing its beam into the tunnel depths. She reached out, fingertips brushing the cool stone.
“Feel that?” she
asked. “The temperature drops here. Normally this station stays dank, but not cold. This is old stone, sealed for decades.” She passed her hand through the mist. It coalesced around her fingers, lingering like smoke. “Something opened.” A rumble
echoed
from the far end. Quinn’s pulse quickened. She retraced her steps, crouched beside the body again. The victim’s jacket sleeve rode up, revealing a pale wrist. Three parallel slashes—too straight, too precise—lay just above the bone. No blood oozed; the cuts
were cauterized
, crisp as if seared shut. Eva peered at the wounds. “He must have cut himself after falling. Maybe to get help.” Quinn’s gaze hardened. “No. These are ritual scars. They line up with glyphs I’ve seen in Shade texts. Protection wards, inverted.” She dabbed her glove on the platform; the dark smear glowed faintly under her torch’s UV setting. “This is residue—no ordinary blood. High copper content. Alchemical.” Eva’s breath caught. She fished in her satchel, produced a small vial. “I keep reagents for
spectral
analysis.” She knelt, uncorked it, touched a drop to the smear. The fluid flared blue then vanished. “Supernatural ichor,” she whispered. Quinn nodded, folding her arms. “The compass pointed toward that. I’ve
seen that glow
before. There was something down here three years ago when Morris vanished. I thought it was a gas leak.” A jagged memory pricked at her ribs: Morris’s last broadcast, voice trembling, shouting about a door in the wall. “This isn’t a drowning, accident, or a mugging gone wrong. He was delivered—beyond our realm—to be sacrificed.” Quinn swept the torch back toward Eva. “The stains on the wall over there—” She stepped over the chalk outline, knelt by a smear of dark crimson mottled with ash. “They’re not blood. Burnt residue, like flesh singed and scraped. Someone—or something—tore into him.” Eva’s round glasses fogged. “Then who would do this?” Quinn glanced toward the tunnel. The compass needle spun, then locked north. “Shades. They traffic bodies when the market moves. This station is the Veil Market’s current locus. Someone bought a ticket on a rift, paid with a life.” Eva’s voice
trembled
. “My God. The market still moves?” Quinn rose, shoulders squared. “Full moon was last night. It relocated here under Camden. Whoever owns this station opened the breach. We’ve only just found the aftermath.” She snapped on rubber gloves, slipped the compass into an evidence bag. The sigils gleamed once, like embers in the plastic. Eva helped dust for fingerprints on the compass casing. “Only one set—his. No others.” She frowned. “That means… someone wore gloves.” “Everyone here is careful.” Quinn scanned the station: no civilians, no other officers yet. Only cratered shadows and the hush of ancient tunnels. “They didn’t want to leave a trace. But they left magic.” She stood, chest tight. “Call forensics on the residue. Get me a
spectral
analyst from the Museum. There’s more I need to see.” She cocked her head toward the tunnel mouth. “Examine those grooves again. We need to map where the rift led. And pull station blueprints—this station wasn’t even on the line schedule. They never run trains here.” Eva blinked. “We’ll find it, Harlow. I’ll start cross‐referencing the occult archives.” Quinn nodded once, face set. She stepped onto the first rail tie, balanced there, scanning the darkness as if she could chase the portal down the tracks. Her
watch
ticked against her wrist. She breathed in. The mist curled around her ankles, a thin tendril of warning. The Veil had opened. Someone had paid with a life. Somewhere in the shadows, Quinn could almost hear Morris’s voice, terrified and pleading. She swallowed hard. This time, she would follow the signs. This time, she wouldn’t be too late.
created_at
"2026-02-17T02:42:36.303416+00:00"
2/17/2026, 2:42:36 AM
model
"openai/o4-mini-2025-04-16"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4798
moderation_latency
(null)
generation_time
20643
tokens_prompt
923
tokens_completion
2600
native_tokens_prompt
835
native_tokens_completion
2580
native_tokens_completion_images
0
native_tokens_reasoning
640
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0122705
router
(null)
provider_responses
0
id
"resp_01e51a5fe739a757016993d586f8948190a607e5f47ac34cab"
status
200
is_byok
false
latency
152
endpoint_id
"bd121898-b27c-4e2c-bc92-278627465a54"
provider_name
"OpenAI"
model_permaslug
"openai/o4-mini-2025-04-16"
api_type
"completions"
id
"gen-1771296134-rNGQoG9seFhZfMGFZhrI"
upstream_id
"resp_01e51a5fe739a757016993d586f8948190a607e5f47ac34cab"
total_cost
0.0122705
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
77.7778%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
6
adverbTagCount
1
adverbTags
0
"she said softly, returning the compass to the platform [softly]"
92.6525%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1361
totalAiIsmAdverbs
1
found
0
adverb
"softly"
count
1
highlights
0
"softly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
26.5246%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1361
totalAiIsms
10
found
0
word
"flickered"
count
1
1
word
"flicked"
count
1
2
word
"trembled"
count
2
3
word
"quivered"
count
1
4
word
"spectral"
count
2
5
word
"echoed"
count
1
6
word
"echoing"
count
1
7
word
"etched"
count
1
highlights
0
"flickered"
1
"flicked"
2
"trembled"
3
"quivered"
4
"spectral"
5
"echoed"
6
"echoing"
7
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
106
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
1
narrationSentences
106
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
2
totalSentences
125
matches
0
"“Feel that?” she"
1
"seen that glow"
85.3678%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
34
mean
39.5
std
17.73
cv
0.449
sampleLengths
0
63
1
59
2
56
3
32
4
47
5
42
6
64
7
21
8
34
9
50
10
30
11
12
12
54
13
41
14
37
15
35
16
10
17
73
18
51
19
16
20
47
21
38
22
52
23
61
24
9
25
39
26
9
27
48
28
21
29
34
30
59
31
12
32
57
33
30
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
106
matches
0
"been sealed"
1
"was caved"
2
"were cauterized"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
170
matches
(empty)
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
933
adjectiveStacks
0
stackExamples
(empty)
adverbCount
36
adverbRatio
0.03858520900321544
lyAdverbCount
8
lyAdverbRatio
0.00857449088960343
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
125
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
125
mean
10.74
std
7.53
cv
0.701
sampleLengths
0
20
1
10
2
16
3
17
4
15
5
12
6
12
7
20
8
17
9
19
10
13
11
7
12
8
13
15
14
9
15
10
16
10
17
12
18
12
19
3
20
5
21
18
22
19
23
7
24
18
25
14
26
8
27
11
28
6
29
4
30
7
31
10
32
5
33
6
34
17
35
6
36
6
37
15
38
15
39
12
40
2
41
12
42
18
43
2
44
10
45
11
46
12
47
17
48
7
49
7
55.2000%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
6
diversityRatio
0.36
totalSentences
125
uniqueOpeners
45
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
6
fancyCount
5
fancyTags
0
"Eva said, unzipping a chalk marker (say, unzip)"
1
"she said, voice light (say, voice)"
2
"she said softly, returning the compass to the platform (say softly, return)"
3
"she whispered (whisper)"
4
"snapped (snap)"
84.5602%