NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQMPZZYKY3W40EJXBZNG45
Run Details
Model:
GPT-4.1 Nano
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
10.2s
Cost:
$0.000631
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The metallic hum of the underground station
echoed
faintly through the shadowed corridors, a ghostly drone that
seemed to
breathe with the station’s hidden heartbeat. Detective Harlow Quinn pushed through a sealed metal door, the faint scrape of her worn leather boots against the cracked tile floor
echoing
her deliberate effort to keep her presence low. She
flicked
her gaze around the dimly lit space — an abandoned Tube station beneath Camden, now disturbingly out of place in the quiet dawn. The air felt different here—thicker somehow,
tinged
with a strange electric pulse that prickled at her skin even beneath the cold
weight
of her coat. Her hand hovered near her side, close to her worn leather
watch
as if it might counteract the odd sensation. The scent was off, too—a faint coppery tang mixed with something foul, like spoiled herbs left to ferment in a forgotten cellar. In the middle of the deserted platform, a figure bent over something on the ground. Harlow’s sharp eyes identified it as her colleague, Detective
Marcus
Lane, a grumpy veteran with a nose for trouble. His lanky frame moved with eagerness, flashlight in one hand, a small plastic evidence bag in the other. “Lane,” she called
softly
, voice
echoing
in the cavernous space. She paused just behind him, arms crossed,
watch
ing him intently. “What have you got?” He looked up, blinking around the gloom. “You’re late. Already started workin’ the scene. Finds don’t quite add up, Quinn. Not like a typical murder.” Harlow stepped forward, her gaze narrowing as her eyes adjusted to the gloom. The victim lay sprawled on the cracked tiles—an adult male, roughly in his late forties, face frozen in a look of terror. Blood pooled beneath his head, dark and thick, spreading in a slow, uneven radius. But the odd thing was the absence of typical trauma. No bite marks, no gunshot, no stab wounds. Just a fresh, unnatural bruise spreading across his face, like a dark cloud had settled there, cold and unyielding. “You see what I see?” Lane half-rose, tilting his head. “Looks like he was asleep, maybe fainted—” “No,” Harlow interrupted, her brow
furrowing
. “This ain’t a fainting spell. Look at the placement of the bruise. Not hitting his face—more like something pressed against him. From the inside, maybe?” Lane shrugged, but she caught the
flicker
of uncertainty behind his eyes. She knelt, gloved fingers brushing the man's coat, scanning for clues. Her mind sifted through recent reports, crime statistics, her gut telling her something was off. Evidence usually told a story—often straightforward, sometimes layered with lies. Not this. Her gaze
flicked
to the edge of the platform, where a faint white dusting clung to the cracked concrete. Dust that shouldn’t be here, yet fit into no normal scene. Metaphysical perhaps? A residue? She reached into her coat pocket, pulling out the Veil Compass they’d acquired from the market—a tiny brass tool,
etched
with protective sigils, its needle trembling
slightly
as she wielded it above the body. The compass’s needle spun, jittering violently toward a corner. “Over here,” she said, voice barely a whisper. Lane followed, his brow creased. As she moved toward the faintly illuminated corner, the hairs on her arms prickled—an instinct that had served her during darker cases, and certainly more than once since that unexplainable incident with her partner. A supernatural energy hummed just beneath the surface, threatening to breach the mundane. There was something strange about this scene. The victim’s coat was oddly clean around the collar, no sign of struggle or dirt—yet evidence suggested a violent event. Her eyes
flicked
to the dust again—what if this dust was recent? Or... not dust at all? She crouched, scanning the floor
carefully
, noting a faint outline on the tiles, almost as if something had
been pressed
into them, then shifted away before it could leave a clear imprint. “What are you looking for?” Lane asked, voice
softly
skeptical. “An angle,” Harlow said. She drew her gaze across the scene,
slowly
, thinking. Her eye caught a faint
glint
near her feet—a small, near-invisible crack in the ceramic along the edge of the platform. As if something had
been pressed
or embedded there. She reached out, fingers tentative. The crack widened just enough for her to see a fragment hidden inside: a tiny piece of blackened metal, almost charred. It looked out of place against the surrounding dust, an artifact buried just beneath the surface. “Here,” she said
softly
, pulling the fragment free. Lane, now leaning in, muttered, “Looks like scrap from somethin’ burnt or corroded. Doesn’t match the scene. No sign of any fire, no burn marks on the victim.” Harlow shook her head
slowly
. “No. But what if—” Her voice faltered. “There’s nothing usual about this. It’s not just a murder. It’s something else.” Lane frowned, glancing around. “You think someone staged this?” “I think the whole scene’s been tampered with. The bruise, the dust, the fragment—none of it quite fits. This place isn’t just abandoned; it's been *changed.*” She pulled out her phone, dialing a number she knew by heart. When the line picked up, she kept her voice steady. “Quinn here. I need a scan for supernatural residue — the kind that leaves traces on metal or earth. Something odd about this scene, and I don’t think it’s human.” A tense pause, then a faint crackle of static. “Rely on the compass if you can. It’ll guide you to what’s hidden.” She lowered the phone and turned her gaze back to the scene. Something unseen lurked just beyond perception, fracturing the boundaries of what she knew. Her eyes
flicked
once more to the dust, the fragment, and the faint, trembling aim of the compass needle. Behind her, Lane hesitated, then
softly
questioned, “You think it’s supernatural?” Harlow kicked at a stray piece of debris, her mind racing. “I don’t know yet. But whatever it is, it’s not supposed to be here.” She stepped back, eyes fixed on the shadows stretching from the platform’s edge, feeling the
weight
of the unseen lurking just beneath the surface—and knowing, deep down, that she was closer than ever to something far darker than she’d ever faced.
created_at
"2026-02-17T02:42:43.362274+00:00"
2/17/2026, 2:42:43 AM
model
"openai/gpt-4.1-nano-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
384
moderation_latency
(null)
generation_time
9907
tokens_prompt
923
tokens_completion
1658
native_tokens_prompt
836
native_tokens_completion
1369
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0006312
router
(null)
provider_responses
0
id
"resp_068c20d10c130d6c016993d598aa648197a80e4202b04c447b"
status
200
is_byok
false
latency
142
endpoint_id
"9251cee5-5503-4be9-9439-7ae21ff062a3"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-nano-2025-04-14"
api_type
"completions"
id
"gen-1771296152-NtsO7x2kXLrAQY0Rbs9p"
upstream_id
"resp_068c20d10c130d6c016993d598aa648197a80e4202b04c447b"
total_cost
0.0006312
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
8
adverbTagCount
4
adverbTags
0
"she called softly, voice echoing in the cavernous space [softly]"
1
"she said, voice barely a whisper [barely]"
2
"she said softly, pulling the fragment free [softly]"
3
"Behind her, Lane hesitated, then softly questioned, [then softly]"
23.3716%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1044
totalAiIsmAdverbs
8
found
0
adverb
"carefully"
count
1
1
adverb
"slightly"
count
1
2
adverb
"slowly"
count
2
3
adverb
"softly"
count
4
highlights
0
"carefully"
1
"slightly"
2
"slowly"
3
"softly"
80.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
0
"Marcus"
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
0.0000%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1044
totalAiIsms
14
found
0
word
"tinged"
count
1
1
word
"flicker"
count
1
2
word
"flicked"
count
4
3
word
"glint"
count
1
4
word
"furrowing"
count
1
5
word
"weight"
count
2
6
word
"echoed"
count
1
7
word
"echoing"
count
2
8
word
"etched"
count
1
highlights
0
"tinged"
1
"flicker"
2
"flicked"
3
"glint"
4
"furrowing"
5
"weight"
6
"echoed"
7
"echoing"
8
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
63
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
1
narrationSentences
63
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
73
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
29
mean
35.59
std
19.66
cv
0.553
sampleLengths
0
81
1
67
2
52
3
24
4
25
5
86
6
17
7
31
8
50
9
34
10
43
11
13
12
47
13
44
14
32
15
10
16
43
17
42
18
8
19
28
20
25
21
9
22
26
23
52
24
22
25
44
26
11
27
25
28
41
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
63
matches
0
"been pressed"
1
"been pressed"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
136
matches
(empty)
80.7837%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
840
adjectiveStacks
0
stackExamples
(empty)
adverbCount
49
adverbRatio
0.058333333333333334
lyAdverbCount
20
lyAdverbRatio
0.023809523809523808
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
73
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
73
mean
14.14
std
8.77
cv
0.621
sampleLengths
0
25
1
31
2
25
3
25
4
20
5
22
6
15
7
19
8
18
9
10
10
10
11
4
12
7
13
18
14
13
15
22
16
14
17
10
18
8
19
19
20
10
21
7
22
6
23
25
24
12
25
11
26
15
27
10
28
2
29
19
30
11
31
2
32
2
33
34
34
9
35
8
36
5
37
34
38
13
39
7
40
20
41
12
42
1
43
4
44
32
45
10
46
4
47
9
48
21
49
9
79.9087%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
0
diversityRatio
0.4794520547945205
totalSentences
73
uniqueOpeners
35
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
8
fancyCount
6
fancyTags
0
"she called softly, voice echoing in the cavernous space (call softly, voice echoing)"
1
"Harlow interrupted, her brow furrowing (interrupt, furrow)"
2
"she said, voice barely a whisper (say, voice barely)"
3
"she said softly, pulling the fragment free (say softly, pull)"
4
"Lane, now leaning in, muttered, (lean in, mutter)"
5
"Behind her, Lane hesitated, then softly questioned, (hesitate, softly question)"
74.3567%