NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQE6PAATBGGXGBGQE2MM3Y
Run Details
Model:
o4 Mini
Test:
Bad Writing Habits
Scenario:
Thriller: chase through city streets
Duration:
20.7s
Cost:
$0.013391
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4417" name="Tomás Herrera" aliases="Tommy, Tomás, Herrera" pronouns="he/him" species="Human" storyRole="Support"> <eyeColor>warm brown</eyeColor> <hair>short curly dark brown</hair> <height>5'10"</height> <age>29</age> <skinTone>olive skin</skinTone> <scar>scar running along his left forearm from a knife attack</scar> <medallion>Saint Christopher medallion around his neck</medallion> <birthplace>Seville, Spain</birthplace> <description>A former paramedic who provides off-the-books medical care for the clique.</description> <backstory>Born in Seville, Spain. He moved to London to work for the NHS but lost his license after administering unauthorized treatments to supernatural patients.</backstory> </character> <location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth"> <signColor>distinctive green neon sign above the entrance</signColor> <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom> <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description> </location> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> </codex> Write a fully fleshed out scene of at least 1000 words in the thriller genre. A detective is pursuing a suspect through rain-soaked city streets at night. The chase takes them from the street level down into an underground market. The detective must decide whether to follow the suspect into unfamiliar and potentially dangerous territory. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Rain sluices down the neon-slick sidewalk as Detective Harlow Quinn presses her shoulder against the wall of a closed café, chest heaving. Her brown eyes—sharp as flint—track the
flicker
of movement at the end of the block. The suspect, a lean figure in a soaked black hoodie, sprints toward a green neon glow that ripples off puddles in ruby and emerald. Quinn pads forward, leather-soled boots splashing through dirty rainwater. She checks her wrist
watch
—worn leather strap—its face smeared with droplets. 02:17. No time to wait for backup. “Stop right there!” she shouts. Her voice ricochets between brick walls. The suspect glances back, eyes wild, and ducks into the haze of steam rising from a storm drain. Quinn lunges after him, heart hammering in a rhythm she recognizes from eighteen years of stakeouts and raids. She rounds the corner and finds herself at the entrance of the Raven’s Nest—its distinctive green neon sign buzzing overhead. She halves the distance, adrenaline sharpening her senses: the tang of wet asphalt, the musk of old wood, and something metallic undercurrent, like ozone before lightning. Inside the bar, the light dims to near-dark. Old maps line the walls, black-and-white photographs of Soho in eras past. A pair of drunks lean on the bar, jostling silently as Quinn barrels past. The suspect ducks between them and disappears behind a tall bookshelf lined with weathered classics. Quinn vaults over a row of stools, shoulder slamming into the shelf. It swings inward, revealing a narrow corridor lit by a single bare bulb. Water drips from the ceiling onto cracked tile. A musty draft carries the faint murmur of voices and something that smells like burned sage. Quinn hesitates only a breath, then follows. Her fingertips graze damp brick as she negotiates a low, arched doorway. Stairs descend in uneven stone, slick beneath her boots. She counts silently: one… two… three… each step
echoing
in the claustrophobic shaft. Three years ago, they would have ascended these stairs in triumph. Back then, her partner DS Morris had animated her confidence with a grin and a crack about finding coffee at the bottom of hell. Morris never made it back up—vanished under circumstances she’s never understood. Tonight, that memory is a blade against her ribs, urging her forward. At the foot of the stairs, she emerges into a cavernous chamber reeking of incense and iron. Flickering lanterns reveal rows of stalls heaped with glittering talismans, dried herbs in burlap sacks, and brass chalices
etched
with runes. A low hum of haggling voices carries through the vaulted space. This is the Veil Market, an underground bazaar that only surfaces at the full moon. Bone tokens clatter on tables, half-buried beneath folded maps of
otherworldly
realms. Quinn’s breath comes fast. She has no token. She is in enemy territory. The suspect weaves through the crowds, hooded head bobbing as he slips past a vendor offering silver-coated blades. Quinn forces her way forward, arms out to ward off startled buyers and cloaked merchants. A man in a
velvet
tunic ducks beneath her arm, curses in a deep baritone. Lantern light catches on the serrated edge of a blade—too close. Quinn’s hand hovers near the grip of her service pistol, concealed beneath her raincoat. The crowd parts like wet curtains, and she spots her quarry at the far end of the market, exchanging a bone token with a pale-skinned guard whose eyes glow amber. The guard nods, steps aside, and the suspect vanishes through an arched portal carved from aged stone. Quinn slings her coat aside and races after them. Splintered wood underfoot makes her boots skid. She nearly loses her balance before regaining her stride. The lanterns
flicker
overhead, each flame
dancing
as if urged by an unseen breath. Shadows twist across the walls in shapes that steal her breath. For a heartbeat, she thinks she sees Morris standing at the edge of the crowd,
watch
ing. His face is obscured, but the tilt of his head is unmistakable. She blinks. He’s gone. The portal leads to a narrower tunnel lined with ancient tiles carved with serpentine patterns. A low rumble vibrates through the floor. Water drips from overhead pipes. In the dim light, Quinn can barely make out the suspect’s retreating silhouette. She edges forward. The walls close in, coated in damp moss that glistens like black glass. She pauses, listening. The hum of the market recedes. Now it’s just her boots and the steady thrum of her own pulse. Seconds stretch into minutes. Quinn checks her
watch
again—02:24. Back-up is still north of the surface, clueless. She allows herself a cold, meticulous calculation. She could turn back, radio for reinforcements, regroup above ground. But the suspect is minutes from disappearing into whatever
clandestine
chamber lies ahead—taking evidence, perhaps even the key to the case she’s vowed to solve. The loss of Morris taught her that hesitation can be fatal. She steps forward, hand brushing a wet tile. A faint glyph glows beneath her palm, pulsing like a heartbeat. The air shivers. She’s crossed into a place where the rules she knows no longer apply. Her service pistol feels too mundane here; smoke and the hiss of unseen energies fill the tunnel. A memory of Morris
flicker
s—a shared coffee in the station house, the way he glowed when he cracked a case. She swallows, straightens her shoulders. If she can’t trust her own instincts, she’s not a detective—she’s prey. A metallic clang
echoes
through the tunnel. She raises her pistol and rounds a bend. The suspect stands before an iron door reinforced with studs. He slams a heavy fist against a carved symbol: a half-moon cradling a star. The door shudders, then swings open inward. Behind it lies a chamber flooded with red torchlight and the scent of old incense. Figures in hooded robes stand in a circle, heads bowed. The suspect stumbles forward, handing over a small leather satchel to a robed figure whose face
is hidden
beneath a deep cowl. The robes shift, revealing glints of metal—daggers, amulets, vials of glowing liquid. Quinn inches forward, every sense alert. She draws a silent breath and steps into the chamber. The robed figures glance up, eyes shining like coals. They part as one, and the suspect straightens, hands raised in surrender. “Detective Quinn,” he says, voice shaking. “You shouldn’t have come here.” Rainwater drips from Quinn’s hair onto the crimson-stained floor. Her pistol
is trained
on the suspect’s chest, but she can feel the eyes of a dozen strangers on her back. The guard from the Veil Market steps into the doorway, amber eyes locked on her. The guard’s lips peel back in a slow, feline grin. Quinn’s grip tightens on the pistol. Every muscle in her body tenses for the moment when she might need to fire. She glances at the satchel in the suspect’s trembling hands. Inside are vials of shimmering liquid—untainted blood, enchanted for some ritual. She thinks of Morris, of that night she found his abandoned badge at a ritual site, the tears frozen on the ground. “Why?” she rasps. “What are you planning?” The suspect swallows. The robed figure he handed the satchel to steps forward. His voice is low but carries across the chamber like rolling thunder. “This market thrives on balance,” he intones. “You have
upset that balance
. We offer you a choice.” Detective Quinn feels a cold wind slither down her collar. She glances at her
watch
—02:27. Rain still pounds above. Backup will arrive soon—but not soon enough. She raises her pistol, breathing shallow. Honor, duty, vengeance for Morris. She can hear the last chant from the robed circle, rising in a cadence that scrapes at her bones. The detective’s jaw clenches. She steps forward into the torchlight, every fiber
calibrated
for control. “I choose justice,” she says. Her voice
echoes
off stone. “I choose to bring the truth to the light—no matter how dark it is.” The robed men freeze. The suspect exhales, shoulders sagging. The guard flicks a wrist, and the iron door thuds shut behind them with a reverberation that quakes the walls. Quinn’s pulse rockets. With her next step, she crosses the threshold between law and the unknown. The candlelight flares, dust motes
dancing
like spirits. She steadies herself, careful and deliberate, and moves deeper into the chamber. The decision is made. Whatever waits below will not break her resolve. She will uncover the conspiracy that cost her the only partner she ever trusted. And tonight, she will not walk away.
created_at
"2026-02-17T02:39:20.60586+00:00"
2/17/2026, 2:39:20 AM
model
"openai/o4-mini-2025-04-16"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1972
moderation_latency
(null)
generation_time
20128
tokens_prompt
914
tokens_completion
2693
native_tokens_prompt
810
native_tokens_completion
2841
native_tokens_completion_images
0
native_tokens_reasoning
896
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0133914
router
(null)
provider_responses
0
id
"resp_0a04fdb1ef3afeed016993d4c3d7f481968dd86de6571ad57e"
status
200
is_byok
false
latency
118
endpoint_id
"bd121898-b27c-4e2c-bc92-278627465a54"
provider_name
"OpenAI"
model_permaslug
"openai/o4-mini-2025-04-16"
api_type
"completions"
id
"gen-1771295939-Lblbmalk9F4dyU3PifXY"
upstream_id
"resp_0a04fdb1ef3afeed016993d4c3d7f481968dd86de6571ad57e"
total_cost
0.0133914
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
4
adverbTagCount
0
adverbTags
(empty)
100.0000%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1446
totalAiIsmAdverbs
0
found
(empty)
highlights
(empty)
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
17.0124%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1446
totalAiIsms
12
found
0
word
"flicker"
count
2
1
word
"otherworldly"
count
1
2
word
"clandestine"
count
1
3
word
"velvet"
count
1
4
word
"calibrated"
count
1
5
word
"dancing"
count
2
6
word
"echoes"
count
2
7
word
"echoing"
count
1
8
word
"etched"
count
1
highlights
0
"flicker"
1
"otherworldly"
2
"clandestine"
3
"velvet"
4
"calibrated"
5
"dancing"
6
"echoes"
7
"echoing"
8
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
130
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
0
narrationSentences
130
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
134
matches
0
"upset that balance"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
22
mean
64.5
std
32.4
cv
0.502
sampleLengths
0
87
1
93
2
98
3
99
4
89
5
120
6
82
7
78
8
70
9
89
10
105
11
37
12
11
13
55
14
64
15
7
16
25
17
17
18
71
19
24
20
53
21
45
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
130
matches
0
"is hidden"
1
"is trained"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
222
matches
(empty)
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1398
adjectiveStacks
0
stackExamples
(empty)
adverbCount
30
adverbRatio
0.02145922746781116
lyAdverbCount
9
lyAdverbRatio
0.006437768240343348
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
134
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
134
mean
10.59
std
5.87
cv
0.555
sampleLengths
0
22
1
15
2
24
3
9
4
10
5
1
6
6
7
5
8
6
9
18
10
18
11
20
12
26
13
8
14
12
15
14
16
15
17
12
18
13
19
8
20
16
21
7
22
12
23
9
24
4
25
1
26
1
27
7
28
11
29
24
30
11
31
12
32
17
33
21
34
11
35
15
36
12
37
4
38
4
39
5
40
18
41
15
42
15
43
11
44
14
45
30
46
17
47
9
48
7
49
9
54.6366%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
9
diversityRatio
0.3684210526315789
totalSentences
133
uniqueOpeners
49
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
4
fancyCount
3
fancyTags
0
"shouts (shout)"
1
"he says, voice shaking (say, voice shaking)"
2
"he intones (intone)"
86.5676%