NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMR3492DE7N4M672TTFE4EA
Run Details
Model:
o4 Mini High
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
47.0s
Cost:
$0.029285
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn’s boot scuffed the last step, the thudding
echo
swallowed by rancid air. The platform lay deserted, its
mosaic
tiles pocked with grime. Flickering sodium bulbs cast jaundiced streaks across overturned benches. She flipped on her torch—its beam cut through the gloom like a scalpel. At the far end, Eva Kowalski knelt beside a pale figure. The researcher’s green eyes, magnified behind round glasses,
were fixed
on an odd chalk circle scrawled across the platform. Eva’s curly red hair flared under the torchlight; she tucked a tendril behind her left ear, the gesture betraying a nervous energy despite her calm focus. Quinn’s hand hovered near her holster. She moved forward, each step measured, military precision ingrained by eighteen years of service. No patrol car, no CCTV; this abandoned Tube station beneath Camden shouldn’t exist on any map. And now, it had a body. “Dr. Kowalski,” Quinn said, voice low. “Report.” Eva shifted, revealing a cluster of faint burn marks on the victim’s chest. “No obvious wounds,” she said. “No entry, no exit. Skin’s mottled pale, then these concentric burns.” She produced a loupe and a wax crayon from her leather satchel, traced one ring. “Matches a pattern in an 18th-century grimoire on portal invocations.” Quinn crouched, shining her torch across the tiles. The victim lay supine, arms splayed, mouth frozen in a silent scream. The burns were eerily symmetrical. Quinn wiped at one with a gloved thumb—it crumbled to gray ash. “Cause of death?” “Undetermined.” Eva tapped the chalk circle, a perfect ring
etched
with archaic runes. Dust motes drifted through the beam, settling on lines as precise as a stencil. “This isn’t your usual homicide. It looks…ritualistic.” “No fingerprints.” Quinn swept the beam over the floor. She unfolded a small brush from her coat, dipped it into powder, feathered it over tile. Nothing surfaced but a ghost of dust. The victim’s sneaker prints led into the circle, and nowhere else. “They didn’t carry him. They didn’t drag him. He walked in himself.” Eva bit her lip. “Or was drawn.” A metallic
glint
near the victim’s hand caught Quinn’s eye. Eva brushed away dust to reveal a small brass compass, patina freckling its casing with green. Protective sigils
were etched
around its face. Quinn picked it up. It felt almost warm against her palm. The moment she held it, the needle spun wildly, then jerked toward a service tunnel at the platform’s west end. Quinn’s sharp
jaw clenched
. “That’s not
magnetic
north.” She trained her torch on the yawning passage. The compass
quivered
, eager. “You know what this is?” Eva rose, sliding the compass into her satchel. “A Veil Compass. From the Veil Market. It points to the nearest supernatural rift.” Quinn stared at the chalk circle, the compass’s verdict, the absence of normal evidence. “Someone set this up,” she said. “Lured a subject here, activated a portal—” “Or tried to contain something,” Eva offered. Quinn stood. “Containment? With a live body? That ring’s perfect. No breaks, no hesitation. Whoever drew it knows exactly what those runes do.” Eva nodded, voice tight. “I collected chalk, burn residue, tissue samples. And I scraped off the violet sealant on the victim’s eyelids.” Quinn’s torch caught the victim’s face: eyelids sealed with a purple residue, as if to keep eyes closed for reasons beyond fear. “They didn’t want him to see.” She pocketed the compass in an evidence pouch, sealed it with care. Her
watch
—leather band worn thin—
glint
ed 03:21. “Perimeter sweep. No one in, no one out. Drone footage inside the tunnels. And bring in K9 for chemical detection—phosphorus, mercury, any trace of alchemical byproducts.” Eva adjusted her satchel. “I’ll label it an occult incident. Forensics won’t know what hit them.” Quinn turned back to the circle, counting angles. “The Veil Market moves every full moon. They pick spots no one knows exist. This station’s been buried for fifty years. I want every scrap of station history—odd power surges, maintenance logs, trespass reports.” Eva’s freckles speckled pale under the torch. “You think this is connected to your partner’s case?” Quinn’s gaze
flicked
to the compass bag. The memory of DS Morris’s disappearance tightened her chest. “Might be. He vanished after chasing rumors of underground markets and ritual sites. I dismissed it. Now…” She shook off the doubt. “We follow the evidence.” She rose, boots
echo
ing. “Light
up that tunnel
—UV, infrared. Anyone tampering with energy fields leaves a trace.” Eva hesitated. “Detective, this isn’t standard protocol.” Quinn met her eyes. “Not standard, not straightforward.” Eva nodded, slipping her glasses into place. She clicked open her satchel, withdrew vials. “Chalk, residue, tissue, sealant. I’ll fast-track them to the museum’s restricted archives. They’ve got the atmospheric analyzers.” Quinn unsealed the compass bag one last time. The needle spun, pointed deeper into darkness. She turned back to Eva. “We follow this.” A distant scuff
echo
ed from the tunnel. A single footstep, too deliberate to be
echo
. Quinn stiffened, fingers brushing the holster. Eva caught her arm. “Detective—” Quinn signaled downward. “Stay behind me.” The tunnel air pressed in, heavy with stale breath and a hum of energy Quinn could almost hear. Her torch highlighted web-coated tracks that webbed toward a grated door set in the wall. The needle
throbbed
, swinging until its arrow aligned with a
fractured
tile at the door’s base. Quinn knelt, traced the compass’s direction. She slipped a slim pry bar from her belt, wedged it in the tile’s crack. It popped loose, revealing a handful of bone tokens carved into crescent shapes. Dust-covered, brittle. Eva crouched beside her. “Entry requirement for the Market. Whoever used these wasn’t an amateur.” Quinn pocketed two tokens and pressed her ear to the door. Faint voices murmured beyond. She stood, torch trained on Eva. “Secure the scene. I’m going in.” Eva’s eyes found Quinn’s sharp jaw and rigid shoulders. “Be careful.” Quinn gave a curt nod, then slipped through the door, leaving Eva to lock down the platform. The corridor beyond was narrower, plaster chipped in flaking curls. Sigils
were smeared
on the walls, half-washed by damp. The compass
throbbed
in Quinn’s pocket, a living heartbeat guiding her forward. She paused at the end where the corridor opened onto a vaulted chamber lit by a single exposed bulb. A low hum
resonated
through the stone. And in the center,
etched
in chalk, lay another circle—larger, incomplete. A man knelt beside it, chanting in a voice that rattled the spine. His hand hovered over the bones of an animal’s pelvis, arranged like an altar. Quinn stepped forward, torch high. The chant stopped mid-word. The man turned, eyes wild. Behind him, a swirling haze coalesced—a rift, pulsing. Through it, she glimpsed starlit skies that couldn’t belong to this world. The compass flared in her hand. She raised it, the beam catching the man’s face. Recognition struck her like a blow. He was her partner’s last suspect. Just as her training kicked in—draw weapon, secure perimeter—but she stood, breath caught between disbelief and resolve. Quinn’s voice was steady, cold. “Detective Quinn, Metropolitan Police. Drop your altar. Stand clear of the rift.” The man’s lips curled. “You can’t stop it now.” Quinn squared her shoulders. The edges of the circle
began to
glow. She lifted the compass higher, its needle quivering toward the center of the rift. She had no idea whether she
was sealing
a portal or opening one fully. But she knew one thing: she would not let another person vanish on her
watch
. And so, with everything she had learned—physical evidence, occult insight, every hard-won ounce of instinct—Detective Harlow Quinn took her stance, a lone figure in a hidden market beneath the city, determined to bend the rules of nature and reason until the truth would bleed through the Veil.
created_at
"2026-02-17T02:51:12.513564+00:00"
2/17/2026, 2:51:12 AM
model
"openai/o4-mini-high-2025-04-16"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5381
moderation_latency
(null)
generation_time
46901
tokens_prompt
923
tokens_completion
2751
native_tokens_prompt
835
native_tokens_completion
6447
native_tokens_completion_images
0
native_tokens_reasoning
4608
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0292853
router
(null)
provider_responses
0
id
"resp_054424593cf27145016993d7710444819395b6e940104be809"
status
200
is_byok
false
latency
178
endpoint_id
"60020533-2fb2-4aa1-9454-181029fd52de"
provider_name
"OpenAI"
model_permaslug
"openai/o4-mini-high-2025-04-16"
api_type
"completions"
id
"gen-1771296624-2nJiDs0UMKAAlfRAdZCz"
upstream_id
"resp_054424593cf27145016993d7710444819395b6e940104be809"
total_cost
0.0292853
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
4
adverbTagCount
0
adverbTags
(empty)
100.0000%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1313
totalAiIsmAdverbs
0
found
(empty)
highlights
(empty)
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
0.0000%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1313
totalAiIsms
17
found
0
word
"mosaic"
count
1
1
word
"throbbed"
count
2
2
word
"resonated"
count
1
3
word
"flicked"
count
1
4
word
"glint"
count
1
5
word
"quivered"
count
1
6
word
"magnetic"
count
1
7
word
"fractured"
count
1
8
word
"jaw clenched"
count
1
9
word
"echo"
count
2
10
word
"echoed"
count
1
11
word
"echoing"
count
1
12
word
"etched"
count
3
highlights
0
"mosaic"
1
"throbbed"
2
"resonated"
3
"flicked"
4
"glint"
5
"quivered"
6
"magnetic"
7
"fractured"
8
"jaw clenched"
9
"echo"
10
"echoed"
11
"echoing"
12
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"jaw/fists clenched"
count
1
highlights
0
"jaw clenched"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
102
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
1
narrationSentences
102
filterMatches
0
"watch"
hedgeMatches
0
"began to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
132
matches
0
"up that tunnel"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
44
mean
29.41
std
16.1
cv
0.547
sampleLengths
0
47
1
56
2
42
3
7
4
54
5
40
6
34
7
55
8
7
9
44
10
28
11
17
12
22
13
27
14
7
15
23
16
22
17
28
18
44
19
16
20
42
21
16
22
42
23
17
24
7
25
8
26
31
27
23
28
26
29
6
30
49
31
36
32
15
33
27
34
11
35
48
36
64
37
34
38
21
39
23
40
17
41
9
42
55
43
47
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
102
matches
0
"were fixed"
1
"were etched"
2
"were smeared"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
185
matches
0
"was sealing"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
986
adjectiveStacks
0
stackExamples
(empty)
adverbCount
17
adverbRatio
0.017241379310344827
lyAdverbCount
4
lyAdverbRatio
0.004056795131845842
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
132
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
132
mean
9.8
std
6.72
cv
0.686
sampleLengths
0
15
1
10
2
9
3
13
4
11
5
19
6
26
7
6
8
14
9
16
10
6
11
6
12
1
13
13
14
5
15
26
16
10
17
8
18
12
19
5
20
12
21
3
22
13
23
14
24
7
25
9
26
16
27
7
28
11
29
12
30
4
31
3
32
10
33
16
34
7
35
4
36
7
37
20
38
4
39
4
40
8
41
4
42
5
43
8
44
14
45
14
46
6
47
7
48
7
49
2
56.8182%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
5
diversityRatio
0.36363636363636365
totalSentences
132
uniqueOpeners
48
50.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
4
fancyCount
2
fancyTags
0
"Quinn said, voice low (say, voice)"
1
"Eva offered (offer)"
88.6364%