NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQPXFD4XY63CQT4NY596CD
Run Details
Model:
GPT-4.1 Nano
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
12.6s
Cost:
$0.000651
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The entrance to the abandoned Tube station was darker than usual, shadows pooling like ink beneath the
flicker
ing neon sign that barely clung to its rusted frame. Detective Harlow Quinn approached briskly, her sharp jaw set beneath closely cropped salt-and-pepper hair.
The air was thick with
that peculiar scent of decay—metal, damp concrete, something faintly
otherworldly
that she hadn’t yet been able to identify. The kind of darkness that made her
feel
like she
was walking
into a wound no one dared to acknowledge. Her leather
watch
caught the faint glow of the failing light as she checked her time. Five twenty-seven. She’d arrived just before the local patrol had sealed the site, a procedural move prompted, perhaps, by the uneven nature of the evidence—or the lack of it. Behind her, Officer Elmira lingered, shifting
nervously
from foot to foot in her standard uniform. Quinn turned, her eyes narrowing
slightly
, sensing something off about the scene. “I don’t see any blood,” Elmira began, voice tentative. “No sign of struggle either. Just... this.” She gestured to the scattered remnants of what looked like a ritual circle, faint chalk lines barely visible on the uneven, scarred tiles. The edges blurred into grime; almost as if it had been hurriedly drawn, then erased or swallowed by time. No fresh blood. No bodies—at least, none visible from where Quinn stood. “Odd,” Quinn murmured, crouching to inspect the faint symbols. They looked hastily
etched
, almost as if someone
were testing
their own limits. She ran a gloved finger over the faint lines,
feel
ing for any residual energy—nothing. Just broken chalk and dirt. “Could’ve been a staging,” Elmira suggested
softly
. “Someone trying to send us on a wild goose chase. Maybe an occult prank.” Quinn straightened and studied the scene more
carefully
. Occult prank? Maybe. But her instincts—a decade and a half of policing, always looking beyond the surface—told her this was different. The evidence didn’t add up. The silence was
oppressive
, save for a faint, distant hum from somewhere deeper within the station. Quinn’s gaze
flicked
to a narrow tunnel entrance, its dark maw yawning. Something about it made her skin crawl, but she kept her expression
unreadable
. She touched her leather
watch
, glancing at the time again. Still minutes before midnight—the full moon, as it always was when a new chapter in this game began. “Did you find anything unusual?” she asked Elmira, voice low. The officer shook her head. “Just the usual mess. No fingerprints, no footprints—at least not fresh. But I did notice this.” She reached into her pocket and pulled out a small, weathered bone token, worn smooth from handling. “Found it near the circle. No idea what it’s doing here.” Quinn took it
gently
, inspecting the symbol carved into its surface. It looked almost like a miniature talisman. She weighed it in her palm, her mind cataloging possibilities. It was uncommon, but not unheard of, for occult practitioners to leave tokens at sites—often as curses or protections. She frowned. “Could be a warning or a piece of a larger puzzle.” Elmira looked uncertain, but nodded. “That’s what I thought, too. But the weird part? No signs of anyone actually performing a ritual. No candles, no strange objects—nothing that screams active magic.” Quinn’s gaze drifted to the faint chalk lines again. Were they genuine? Or just crude doodles? A mockery? She had learned to trust her gut after years of fights and failed leads. Then a glimmer caught her eye—a small
glint
deeper within the tunnel entrance, barely catching the stray light. She stepped forward, her breath misting in the cool air. Her boots
echoed
softly
against the tile, the sound swallowed by the
oppressive
darkness. At the mouth of the tunnel, she crouched again, this time noticing an irregularity in the floor—something embedded into the crack between tiles. She brushed aside loose dirt, revealing what
appeared to
be a fragment of smooth, dark glass—no, more like obsidian, polished and
pristine
. A deliberate placement. Someone had left it here. She pocketed the piece, then studied the tunnel more closely. A sudden thought pushed in—an intuitive jolt. What if none of this
was meant
to be understood literally? What if it was a distraction? Or a sign? She reached into her satchel, pulling out a small tool, a compact forensics kit even though she didn’t think she’d need it. As she scanned the darkness with her eyes, she spotted something unusual—a faint shimmer to the left wall, almost like a reflection, but there was no light source close by. She eased closer, her boots silent on the cold, slick surface. Her fingers found a subtle change in texture—something scraped or chiseled into the wall, a series of tiny, uneven etchings. Not a symbol, but something more chaotic—a hurried signature, or maybe a warning. She pressed her fingertips against it,
feel
ing the rough ridges. A warning? A message? She considered her options. The scene
was staged
, yet undeniably meaningful. A sudden movement—a
flicker
to the corner of her eye. She spun, instinct sharpened by years of
watch
ing for threats. Nothing. “Someone’s
watch
ing,” she said
softly
, voice
echoing
against the empty walls. Elmira
looked nervous
but
tried to
maintain composure. “You think this connects to the clique? The supernatural stuff? Or...?” Quinn hesitated but kept her tone steady. “I think it’s all connected. Someone’s trying to tell us something, whether they’re here or not. We just need to listen.” She pulled out the Veil Compass from her coat pocket, its brass casing cold and familiar in her grip. The face
etched
with protective sigils was faint, but she knew it was sensitive to supernatural energy. She held it steady, the needle twitching
slightly
before pointing toward the darker tunnel deeper inside. “Looks like we’ve got a direction,” she said quietly. “You see that?” Elmira nodded, eyes wide. “Yeah. Toward the ritual, maybe?” “Or whatever they’re hiding,” Quinn replied, voice low and determined. “Come on. We’re not done here yet.” As they moved forward, the darkness
seemed to
tighten around them, almost eager to hide its secrets. Quinn’s mind spun with questions—about the bone token, the etchings, the glimmer in the wall—and all the ways the pieces might fit together in a puzzle she didn’t yet understand. But she knew one thing for certain: whatever this was, it was bigger than a prank or a simple crime. And she
was starting
to
see that the
evidence—what was visible—was only a fragment of what the darkness concealed.
created_at
"2026-02-17T02:43:57.742089+00:00"
2/17/2026, 2:43:57 AM
model
"openai/gpt-4.1-nano-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
392
moderation_latency
(null)
generation_time
12409
tokens_prompt
923
tokens_completion
1754
native_tokens_prompt
836
native_tokens_completion
1418
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0006508
router
(null)
provider_responses
0
id
"resp_02fc8dc2e0e8f6c4016993d5e0d4d08197b0c3bc3c0ed2d257"
status
200
is_byok
false
latency
120
endpoint_id
"9251cee5-5503-4be9-9439-7ae21ff062a3"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-nano-2025-04-14"
api_type
"completions"
id
"gen-1771296224-NN2jA88Y5nkEdVSkKqWy"
upstream_id
"resp_02fc8dc2e0e8f6c4016993d5e0d4d08197b0c3bc3c0ed2d257"
total_cost
0.0006508
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
7
adverbTagCount
3
adverbTags
0
"Elmira suggested softly [softly]"
1
"she said softly, voice echoing against the empty walls [softly]"
2
"she said quietly [quietly]"
27.6673%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1106
totalAiIsmAdverbs
8
found
0
adverb
"carefully"
count
1
1
adverb
"gently"
count
1
2
adverb
"nervously"
count
1
3
adverb
"slightly"
count
2
4
adverb
"softly"
count
3
highlights
0
"carefully"
1
"gently"
2
"nervously"
3
"slightly"
4
"softly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
0.0000%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1106
totalAiIsms
12
found
0
word
"flicker"
count
1
1
word
"flicked"
count
1
2
word
"glint"
count
1
3
word
"oppressive"
count
2
4
word
"otherworldly"
count
1
5
word
"pristine"
count
1
6
word
"unreadable"
count
1
7
word
"echoed"
count
1
8
word
"echoing"
count
1
9
word
"etched"
count
2
highlights
0
"flicker"
1
"flicked"
2
"glint"
3
"oppressive"
4
"otherworldly"
5
"pristine"
6
"unreadable"
7
"echoed"
8
"echoing"
9
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"air was thick with"
count
1
highlights
0
"The air was thick with"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
1
narrationSentences
81
matches
0
"looked nervous"
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
3
hedgeCount
3
narrationSentences
81
filterMatches
0
"feel"
1
"watch"
hedgeMatches
0
"appeared to"
1
"tried to"
2
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
91
matches
0
"see that the"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
33
mean
32.76
std
16.81
cv
0.513
sampleLengths
0
84
1
45
2
27
3
16
4
54
5
41
6
21
7
34
8
41
9
28
10
10
11
49
12
47
13
13
14
31
15
32
16
42
17
45
18
8
19
37
20
52
21
44
22
25
23
21
24
11
25
19
26
28
27
52
28
12
29
9
30
17
31
47
32
39
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
81
matches
0
"was meant"
1
"was staged"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
3
totalVerbs
162
matches
0
"was walking"
1
"were testing"
2
"was starting"
84.1182%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
945
adjectiveStacks
0
stackExamples
(empty)
adverbCount
52
adverbRatio
0.05502645502645503
lyAdverbCount
22
lyAdverbRatio
0.02328042328042328
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
91
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
91
mean
11.88
std
7.46
cv
0.628
sampleLengths
0
27
1
14
2
23
3
20
4
16
5
2
6
27
7
15
8
12
9
9
10
7
11
23
12
19
13
3
14
9
15
9
16
13
17
14
18
5
19
7
20
14
21
8
22
2
23
1
24
18
25
5
26
16
27
12
28
13
29
10
30
18
31
10
32
5
33
33
34
11
35
11
36
7
37
10
38
19
39
2
40
11
41
5
42
26
43
9
44
3
45
4
46
2
47
14
48
18
49
10
64.1026%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
5
diversityRatio
0.4175824175824176
totalSentences
91
uniqueOpeners
38
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
7
fancyCount
6
fancyTags
0
"Elmira began, voice tentative (begin, voice)"
1
"Quinn murmured, crouching to inspect the faint symbols (murmur, crouch inspect)"
2
"Elmira suggested softly (suggest softly)"
3
"she said softly, voice echoing against the empty walls (say softly, voice echoing)"
4
"she said quietly (say quietly)"
5
"Quinn replied, voice low and determined (reply, voice)"
75.0522%