NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQYNS59BHW4JD08K4J98KE
Run Details
Model:
o4 Mini
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
23.6s
Cost:
$0.013740
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn stepped off the creaking service ladder into the cavernous hush of the abandoned Camden Tube station. Above her head, flickering sodium lamps sputtered against riveted girders. The stale scent of damp concrete and rusty rails settled in her lungs. She adjusted the collar of her dark overcoat, eyes sweeping the platform’s edge. Her leather-strap
watch
glinted under the half-dead light—6:43 a.m. A single chalk outline,
stark
white against grime-dark tile, traced the shape of a body that no longer lay there. Quinn’s jaw tightened. She lowered her head to examine the smudge of crimson seeping at the circumference. The smear wasn’t fresh; it had crusted into a brittle residue, like coagulated oil. She tapped it with a gloved finger. It flaked, revealing pale stone beneath. “Blood would soak in,” she murmured. “This hauls on top.” Beside her, Eva Kowalski twisted a strand of curly red hair behind her ear. Her round glasses magnified green eyes alive with restless energy. A worn leather satchel hung at her side, heavy with books. She knelt by the chalk line and laid a palm flat inside the outline. The corner of her mouth dipped, curious. “It’s alchemical, not biological. Hemoglobin reacts differently under certain catalysts. This—” she tugged at the flaked residue “—is bound in a lead matrix. Someone’s masking what’s left of the body.” Quinn raised an eyebrow. She preferred facts grounded in fingerprints and fiber analysis, not arcane chemistry. Yet she’d learned to listen to Eva’s quiet insistence. Three years of off-case consultations had taught her intelligence like this demanded respect. “Where’s the rest of the body?” Eva shrugged. “Removed. Clean sweep.” She rose, brushing dust from faded denim. “No drag marks. Whoever took the corpse must’ve used a proper lifting technique—fasten straps, hooks, something hydraulic.” Quinn crouched, tracing the chalk curve with her fingertip. She
flicked
away loose grit and peered closer. A near-invisible groove scored the tile, as though a slender blade scraped at the edge. She stood, brushing gravel from her palms. “A hook, maybe. Deep, but narrow.” Up the platform, uniformed officers clustered under the sputtering lamp. One of them, PC Dana Reeves, approached. Her breath plumed in the cold air. “Detective Quinn,” she called, voice
echo
ing against brick vaults. “We found this down by the rails.” She held out a gloved hand, revealing a tarnished brass compass no larger than a matchbox. The casing
was speckled
with a green patina. On its face, tiny sigils
shimmered
like runic scars. The needle
trembled
, settling at an impossible angle ninety degrees off true north. Quinn’s breath hitched. She took it, weighed its
weight
in her palm. It felt lighter than brass should be, as if gravity bent around it. “Where’d you get it?” she asked. “Beside the third rail. Stuck in the ballast.” Reeves shivered. “No one else touched it. I—” She hesitated. “It’s not police issue.” Quinn nodded. “Good.” She closed her fist around the compass, slipping it into the pocket of her coat. The needle spun once, twice, then jerked toward the tunnel opening. Her pulse pulsed against the leather case. Eva glanced from Quinn’s pocket to the tunnel yawning ahead. “It’s a Veil Compass,” she said. “Crafted by a Shade artisan. Points to the nearest supernatural rift or portal. I’ve only heard of them in…discussions.” Quinn inhaled. Shades were legends—beings of shadow who trafficked in secrets. A compass forged by one of them should have remained in the whispering confines of the Veil Market. She swallowed a pulse of disbelief. “Someone brought it here.” She
flicked
on her handheld torch, the beam carving a path through layered dust motes. The tunnel mouth str
etched
before them, steel rails vanishing into darkness. “Reeves, secure this platform. No one in or out without my say-so.” The constable bowed. Quinn exchanged a look with Eva—quiet urgency in the younger woman’s eyes. Eva’s satchel bulged, its leather worn smooth at the strap. Quinn had warned her not to carry forbidden tomes, but here they were, beneath London’s streets, on the edge of something that didn’t fit any standard case file. They moved down the rails,
footsteps
muffled by an ancient carpet of grit. Quinn’s torch painted water-stains on tunnel walls; iron rings hung at intervals, silent
sentinels
. Every few paces, the compass
trembled
in her breast pocket. She slowed, pressed a finger against a rivet. It hummed faintly—no static shock, but a vibration she couldn’t place. “There’s something electric running through these walls,” she said. “Power or…something else.” Eva bent to examine the ballast. “Residual
echo
. Ley lines,” she whispered. “This station sits on a crossroads of subterranean energy. It’s why the Veil Market moves here on full moons.” Quinn’s gaze sharpened. “You knew this place was due for a market night three days ago.” She felt the
weight
of a pattern clicking into place. A crime staged for the cloak of darkness. “Someone’s clearing out witnesses—maybe traders—before the next gathering.” Eva nodded, fingers trailing over a fracture in the stone. “Or someone’s hijacking the rift itself.” They reached a service alcove before a sealed maintenance door. The compass on Quinn’s wrist—the one she’d retrieved from her coat pocket—
began to
pulse against her skin. The leather casing had turned verdigris. The sigils glowed faintly. She held it up. The needle swung toward the door’s hinge. Quinn inhaled, feeling the familiar hum of adrenaline. Three years ago, DS Morris had vanished on a case that led her to similar anomalies—a hidden shrine, a bloodless sacrifice. She’d buried the supernatural under a mountain of procedure. Now the edges of her world
were fraying
. Eva slid a slender chisel from her satchel. She knelt at the door’s seam, tracing her finger along a line of hidden screws. “Let me,” she murmured. The chisel glinted as it caught lamp glow. Quinn nodded. “Be quick, Evie.” The sealed panel shuddered open to reveal a chamber. Inside, polished granite walls formed a rough circle. At its center, a collapsed ring of iron, scorched at the base. Scattered around the rim were shards of bone—too small to identify. A copper brazier, empty but for a residue of ashen dust, sat at the center. The smell of burnt ozone clung to the air. Eva pushed back curls from her face. “It’s a hastily abandoned ritual site.” She bent to scoop a fragment of bone onto gloved fingertips. “Bone token, maybe. But not one for entry—probably from the ritual itself.” Quinn stepped inside the circle, careful not to disturb any residue. She scanned the walls, saw faint drips of dried ichor. They retraced a triangular web of lines, faint grooves
etched
deep, as though carved by a trembling hand. The lines converged on a single point above the brazier. A half-erased symbol—two spirals unspooling into each other—marred the granite. Quinn’s chest tightened. That sigil matched the ones
etched
on the Veil Compass. She traced its arc with her finger, feeling a hollow in the stone. “They tried to open something here,” she whispered. “A portal, maybe. Something they couldn’t control.” Eva’s breath caught. “The bone dust—they were feeding the rite. Raising energy to tear open a gateway.” She straightened, eyes fixed on the brazier. “Someone aborted the ceremony. Pulled the…subject out.” Quinn kept her gaze on the brazier, noting the heat-blackened iron. No flesh remained, no charred muscle. It was as though a person had been here, undone and extracted whole. She remembered DS Morris’s final report—no body, just a smear of crimson run through a filter of stone. She turned to Eva. “We’re dealing with whatever did this to Morris.” Eva’s lips parted. “You don’t think—” Quinn shook her head, eyes hard. “I know.” The compass in her pocket pulsed once more. She drew it out and
watch
ed the needle lock onto the center of the brazier. It pointed not to north, but down—through the floor. A hidden ramp? A tunnel? Quinn’s fingers curled around the compass. She met Eva’s gaze. “Call Reeves, seal this off. We need heavy gear and a permit for subterranean entry. And, Evie…throw in everything you’ve got on rifts and gateways. We’re going deeper than any squad’s report.” Eva nodded, voice steady. “I’ll pack the Grimoire and the runic index. But be careful, Harlow…” Quinn’s stern face softened for a moment. “I will.” As they backed out of the chamber, the compass’s sigils dimmed, the needle stilled. The hush of the abandoned station closed in around them. Quinn pocketed the device, squared her shoulders, and set her jaw. The portal—whatever it
was—was
still active. And she
was determined
to uncover what lurked beyond, no matter how thin the line between the living world and shadows had become.
created_at
"2026-02-17T02:48:23.086027+00:00"
2/17/2026, 2:48:23 AM
model
"openai/o4-mini-2025-04-16"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
7478
moderation_latency
(null)
generation_time
23433
tokens_prompt
923
tokens_completion
2398
native_tokens_prompt
835
native_tokens_completion
2914
native_tokens_completion_images
0
native_tokens_reasoning
832
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0137401
router
(null)
provider_responses
0
id
"resp_0a6345925d3b0a21016993d6df2b1c8194977e73a9834f743e"
status
200
is_byok
false
latency
230
endpoint_id
"bd121898-b27c-4e2c-bc92-278627465a54"
provider_name
"OpenAI"
model_permaslug
"openai/o4-mini-2025-04-16"
api_type
"completions"
id
"gen-1771296479-mLS0jgr0Gq9XC6hLmtVo"
upstream_id
"resp_0a6345925d3b0a21016993d6df2b1c8194977e73a9834f743e"
total_cost
0.0137401
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
8
adverbTagCount
0
adverbTags
(empty)
100.0000%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1474
totalAiIsmAdverbs
0
found
(empty)
highlights
(empty)
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
5.0204%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1474
totalAiIsms
14
found
0
word
"sentinels"
count
1
1
word
"flicked"
count
2
2
word
"shimmered"
count
1
3
word
"trembled"
count
2
4
word
"stark"
count
1
5
word
"weight"
count
2
6
word
"footsteps"
count
1
7
word
"echo"
count
1
8
word
"echoing"
count
1
9
word
"etched"
count
2
highlights
0
"sentinels"
1
"flicked"
2
"shimmered"
3
"trembled"
4
"stark"
5
"weight"
6
"footsteps"
7
"echo"
8
"echoing"
9
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
125
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
1
narrationSentences
125
filterMatches
0
"watch"
hedgeMatches
0
"began to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
147
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
37
mean
39.08
std
20.74
cv
0.531
sampleLengths
0
64
1
64
2
10
3
86
4
38
5
35
6
45
7
86
8
31
9
22
10
36
11
35
12
39
13
38
14
53
15
68
16
31
17
42
18
16
19
48
20
46
21
35
22
5
23
64
24
36
25
59
26
41
27
31
28
48
29
12
30
6
31
8
32
37
33
42
34
16
35
9
36
64
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
125
matches
0
"was speckled"
1
"was—was"
2
"was determined"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
202
matches
0
"were fraying"
94.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1157
adjectiveStacks
1
stackExamples
0
"stark white against grime-"
adverbCount
21
adverbRatio
0.018150388936905792
lyAdverbCount
3
lyAdverbRatio
0.0025929127052722557
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
147
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
147
mean
9.84
std
5.68
cv
0.578
sampleLengths
0
19
1
10
2
13
3
13
4
9
5
20
6
3
7
14
8
14
9
7
10
6
11
6
12
4
13
14
14
10
15
11
16
14
17
7
18
30
19
4
20
12
21
9
22
13
23
6
24
2
25
10
26
17
27
9
28
8
29
15
30
7
31
6
32
10
33
7
34
7
35
9
36
23
37
8
38
9
39
13
40
3
41
9
42
13
43
6
44
10
45
8
46
4
47
2
48
16
49
11
48.2993%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
4
diversityRatio
0.30612244897959184
totalSentences
147
uniqueOpeners
45
18.7500%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
8
fancyCount
5
fancyTags
0
"she murmured (murmur)"
1
"she called, voice echoing against brick vaults (call, voice echoing)"
2
"she whispered (whisper)"
3
"she murmured (murmur)"
4
"she whispered (whisper)"
86.2394%