NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQQPVYYND0T74GRA7WF8RZ
Run Details
Model:
o4 Mini
Test:
Bad Writing Habits
Scenario:
Thriller: chase through city streets
Duration:
25.8s
Cost:
$0.015811
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4417" name="Tomás Herrera" aliases="Tommy, Tomás, Herrera" pronouns="he/him" species="Human" storyRole="Support"> <eyeColor>warm brown</eyeColor> <hair>short curly dark brown</hair> <height>5'10"</height> <age>29</age> <skinTone>olive skin</skinTone> <scar>scar running along his left forearm from a knife attack</scar> <medallion>Saint Christopher medallion around his neck</medallion> <birthplace>Seville, Spain</birthplace> <description>A former paramedic who provides off-the-books medical care for the clique.</description> <backstory>Born in Seville, Spain. He moved to London to work for the NHS but lost his license after administering unauthorized treatments to supernatural patients.</backstory> </character> <location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth"> <signColor>distinctive green neon sign above the entrance</signColor> <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom> <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description> </location> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> </codex> Write a fully fleshed out scene of at least 1000 words in the thriller genre. A detective is pursuing a suspect through rain-soaked city streets at night. The chase takes them from the street level down into an underground market. The detective must decide whether to follow the suspect into unfamiliar and potentially dangerous territory. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Rain sluiced off Harlow Quinn’s shoulders as she barreled across Greek Street, boot heels splashing through ankle-deep puddles. Neon lights from the Raven’s Nest painted the slick cobbles sickly green; umbrellas bobbed above tourists and late-night revellers. She kept her gaze fixed on the collapsing form of her suspect: a lean man in a dark jacket, sprinting toward an alley just beyond the bar’s glowing sign. His shoes slapped wet stone, a staccato she matched stride for stride. Quinn pumped her arms, leather
watch
clapping against her wrist, breath rasping in her chest. Three years of Metropolitan Police, eighteen decorated years of service—it all narrowed to this chase. Behind her, the muffled pings of car tires and a lone siren faded under the rain’s roar. Her partner
was gone
; DS Morris lost to a case that had turned inexplicable. Tonight, she told herself, she would not lose another lead. The suspect vanished between stacks of broken crates and dumpsters at the alley’s mouth. The walls reeked of mold and stale beer. Quinn’s boots crumbed rust from crumbling brick as she pitched forward, vaulting over a spindly scaffold. A broken streetlamp
flickered
overhead, its bulb fighting the downpour. She
flicked
her flashlight on, beam slicing the darkness. A door stood ajar where an advertising poster for a lost-and-found washed against the threshold. She hesitated a heartbeat—her instincts screamed this was no ordinary doorway—but the man’s retreat gave her no choice. She shoulder-rolled through the gap, nearly toppling into a narrow staircase. Metal grates, slick with rainwater, led downward. She hunched, one hand on her pistol, the other brushing against the damp wall. Each stair groaned under her
weight
. Drip-drip
echo
es followed her descent. The smell changed to oil and rust, metallic tang stinging her nose. She counted five steps past the door, then seven, then nearly felt the breeze of a platform opening. At the bottom, the staircase spit her onto a cracked platform littered with dead leaves and discarded newspapers. An old tiled sign, half-scraped by vandalism, still read “Camden Road.” Quarters for a ghost station. She crept forward, flashlight bobbing. A faint hum—electric wires overhead—thrummed in rhythm with her heartbeat. The distant scrape of
footsteps
pulled her deeper, through a breach in a corrugated-iron wall. Beyond lay the Veil Market, invisible to the uninvited. The narrow tunnel widened into a cavern of strange commerce. Lanterns strung from overhead beams cast an amber glow over stalls heaped with curious trinkets. Bone tokens carved into minuscule skulls
glint
ed on one table; jars of charcoal-black powder sat beside rusted vials of ink that
shimmered
violet in the lamplight. A stall wafted incense so thick that Quinn could taste it on her tongue. Hooded figures moved like ghosts between stalls: one exchanged a scrap of parchment for a pewter dagger; another examined a cage containing a huddled creature that looked half-bat, half-rat. Quinn’s pulse climbed. She crept between a rack of talismans and a display of carved wooden masks, following the
echo
of hurried
footsteps
. The suspect’s silhouette
flickered
ahead, hood drawn low. Rainwater dripped from his shoulders as though he’d carried the storm downstairs with him. With a sudden burst, she lunged. Her hand clamped on the man’s sleeve; he spun, forcing a stall-cart over with a crash. Glass vials shattered, sending sparks of colored liquid across bones and coins. He yanked free, shoulder-driving her into a narrow aisle. Shelves rattled; a group of hooded buyers froze in mid-haggle, faces hidden in shadow. Quinn leveled her pistol as she braced against the stall’s edge. The air smelled of damp earth and something sickly sweet—like decay. “Stop!” she barked. Her voice
echo
ed off the corrugated walls. The suspect bolted again, weaving through a tangle of crates and curtained alcoves. Quinn pursued, flashlight beam illuminating vendors who parted silently before her, their eyes
glint
ing with metallic curiosity. She rounded a corner and nearly collided with a broad-shouldered man in a leather satchel slung across his chest. He stepped back, eyes widening as Quinn’s boot kicked loose a bone token from a table. It skittered across the stones and clinked underfoot. She stooped, snagged it by the carved edge. The token grew warm, pulsing against her glove. She felt a faint tug at her wrist—almost a pulse
echo
ing through the token’s ridges. Quinn froze, scanning the crowd. The man with the satchel approached, hands raised in a placating gesture. “Detective,” he said, voice low, accent subtle. Spanish? She frowned. “You’re bleeding.” He
flicked
a quick glance at her left hand, knuckles bruised and bleeding from the push into the stall. She straightened, pressing her injured wrist against her side. “Who are you?” she demanded. He dipped his head. “Tomás Herrera. I work down here. Medical.” He touched the satchel strap. Inside, she caught the
glint
of bandages and vials. “You need a proper dressing.” She
flicked
her flashlight to the satchel then back to his face. Warm brown eyes stared up at her, concern flickering. Her second gut punch since DS Morris
died: that he’d
be here, tending secrets. She’d read file notes: ex-paramedic, off-the-books treatments, lost his license. Could this be—no, she was here for a suspect. The suspect’s hooded shape vanished down another corridor. Quinn’s jaw tightened. She glanced at the bone token. No badge, no uniform, no warrants, but if she turned back, he’d escape underground—and with him, whatever connection he had to her partner’s death. She met Herrera’s gaze. “Help me.” He nodded once, swift and silent. He reached into the satchel, withdrew a small antiseptic spray and folded gauze. He elevated her wrist; she winced as he sprayed and pressed the gauze around the wound. Clean, efficient. He worked in seconds, securing a neat wrap. The hum of conversation resumed around them, vendors masking intrigue behind murmurs. Herrera stood, pocketing the remainder of his kit. “They call this place the Veil Market,” he said. “It moves with the moon. It’s not meant for police.” Quinn flexed her hand. The token burned against her glove as though alive. “That’s a bone token, isn’t it?” He glanced at her hand—pause, then a faint exhale. “Yes. It’s your entry ticket. And your one-way pass if you stay.” She stared, heart lodged in her throat. “What am I in for?” He shook his head. “Dangerous. Not always human danger.” Her mind blasted back to Morris’s unexplained disappearance, the whispered rumors of creatures that defied death. Her military-bred bearing bristled. She’d always believed in facts, evidence. But as the antiseptic stung her injury, she realized tonight her case had slipped off every rule she’d ever known. Footsteps approached. Quinn looked past Herrera; the suspect returned, a furtive shape gliding between stalls. He clutched something—a small leather pouch stamped with unknowable runes. He tensed, ready to strike. Quinn sprang forward. “No more running!” she shouted. He froze. Herrera’s gaze darted back. Around them, the market stilled. Clocks in the bars above ground might have ticked obliviously, but down here time bent to other rules. The suspect made a split-second decision. He turned and bolted for a grated doorway in the wall—arch painted with half-erased graffiti. Quinn sprinted, drew on every ounce of her training. Her wet coat slapped her legs, her breath burned. She reached the grate, yanked it open. Beyond lay a narrow shaft, plunged deeper into the void. Quinn hesitated only a wound’s second. She had no backup. No radio; the storm above blocked signals. The
echo
of
footsteps
above warned that more might be coming. Locke-jaws of fear scraped at her mind. But this suspect held answers—answers about stolen goods, suspicious deaths, and maybe the truth behind Morris. She pressed a finger to her lips, a silent signal to Herrera. His shoulders squared. He gave her a small nod, the faintest of warnings and encouragements all at once. Quinn stepped forward, flashlight carving a path into the black shaft. Rain still dripped from her hair, but the air here was dryer, heavier with promise and peril. She glanced once at Herrera, then cast her gaze down the metal rungs. She exhaled and climbed, each rung rattling beneath her
weight
, determination anchoring her to purpose. If she reached the bottom, the market would fold around her like an untamed beast. She’d chase this lead into the underworld itself. The grate clattered shut above her. Darkness enveloped. Snowmelt of streetlights and neon faded behind her, leaving her alone with the pulse of her own resolve—and the man skulking somewhere ahead in the shadows. She tightened her grip, steadied her breathing, and moved on.
created_at
"2026-02-17T02:44:36.973064+00:00"
2/17/2026, 2:44:36 AM
model
"openai/o4-mini-2025-04-16"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6944
moderation_latency
(null)
generation_time
25664
tokens_prompt
914
tokens_completion
2894
native_tokens_prompt
810
native_tokens_completion
3391
native_tokens_completion_images
0
native_tokens_reasoning
1408
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0158114
router
(null)
provider_responses
0
id
"resp_0e84c310fe437017016993d5fad74c8193b4028cd999fe64ce"
status
200
is_byok
false
latency
137
endpoint_id
"bd121898-b27c-4e2c-bc92-278627465a54"
provider_name
"OpenAI"
model_permaslug
"openai/o4-mini-2025-04-16"
api_type
"completions"
id
"gen-1771296250-z1ZfY4pTTNRU9IGUEB69"
upstream_id
"resp_0e84c310fe437017016993d5fad74c8193b4028cd999fe64ce"
total_cost
0.0158114
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
4
adverbTagCount
0
adverbTags
(empty)
100.0000%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1463
totalAiIsmAdverbs
0
found
(empty)
highlights
(empty)
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
0.0000%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1463
totalAiIsms
19
found
0
word
"flickered"
count
2
1
word
"flicked"
count
3
2
word
"shimmered"
count
1
3
word
"glint"
count
1
4
word
"glinting"
count
1
5
word
"weight"
count
2
6
word
"footsteps"
count
4
7
word
"echo"
count
2
8
word
"echoed"
count
1
9
word
"echoes"
count
1
10
word
"echoing"
count
1
highlights
0
"flickered"
1
"flicked"
2
"shimmered"
3
"glint"
4
"glinting"
5
"weight"
6
"footsteps"
7
"echo"
8
"echoed"
9
"echoes"
10
"echoing"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
128
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
128
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
135
matches
0
"died: that he’d"
89.9068%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
33
mean
43.27
std
20.11
cv
0.465
sampleLengths
0
78
1
71
2
57
3
65
4
75
5
49
6
84
7
45
8
57
9
32
10
73
11
48
12
31
13
14
14
30
15
54
16
47
17
45
18
39
19
19
20
21
21
12
22
9
23
46
24
33
25
5
26
29
27
56
28
51
29
30
30
41
31
38
32
44
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
1
totalSentences
128
matches
0
"was gone"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
260
matches
(empty)
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1391
adjectiveStacks
0
stackExamples
(empty)
adverbCount
34
adverbRatio
0.024442846872753415
lyAdverbCount
9
lyAdverbRatio
0.006470165348670022
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
135
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
135
mean
10.58
std
5.77
cv
0.545
sampleLengths
0
18
1
19
2
29
3
12
4
15
5
15
6
17
7
14
8
10
9
14
10
8
11
16
12
10
13
9
14
15
15
18
16
11
17
7
18
14
19
6
20
5
21
12
22
18
23
18
24
16
25
5
26
10
27
15
28
9
29
10
30
15
31
26
32
14
33
29
34
3
35
20
36
8
37
14
38
6
39
16
40
12
41
9
42
14
43
11
44
11
45
3
46
7
47
13
48
17
49
19
59.0123%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
7
diversityRatio
0.3851851851851852
totalSentences
135
uniqueOpeners
52
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
4
fancyCount
3
fancyTags
0
"he said, voice low, accent subtle (say, voice)"
1
"demanded (demand)"
2
"shouted (shout)"
85.2305%