NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQGKZAWRKJVPQ3E3N5A27B
Run Details
Model:
GPT-4.1 Nano
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
8.4s
Cost:
$0.000566
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The battered black cab pulled up outside the shadowed entrance of the abandoned Tube station beneath Camden. Detective Harlow Quinn stepped out, her shoulders squared, eyes sharp beneath a braid of salt-and-pepper close-cropped hair. The damp night air clung to her skin as she studied the scene with military precision, her worn leather
watch
glinting
faintly under the haze of streetlights. Behind her, Sergeant Ellis fidgeted with a trembling hand, gesturing toward the glowing boundary of yellow police tape. “It’s definitely here, Harlow. Same spot, right beneath the old market. Not a fragment of debris out of place, no sign of forced entry. Just... this.” He nodded toward the slick, dark doorway that yawned into darkness, casting shadows like spilled ink. Harlow’s gaze lingered on the entrance, muscles tense. “Nothing looks disturbed from the outside,” she said, voice measured, eyes narrowing. “No signs of a scuffle, no discarded items. It’s like someone just vanished into the ground.” Ellis hesitated, scratching his head. “We found her empty. No body, no sign she was even here.” Harlow flexed her fingers around the handle of her notebook, then looked past the police tape. The air pulsated with that faint, unnatural hum she could never quite explain — the kind that made her skin crawl. She had learned to trust her instincts, especially when they
whispered that something
was wrong. “Tell me again,” she said, voice low but steady. “What details do we have?” Ellis pulled out his phone, showing her the photo of the victim—a woman in her late twenties, dressed in a neat blazer and jeans, lying face down on the cracked concrete floor, eyes wide open in shock. No visible trauma, no signs of a struggle. Harlow crouched beside the body, noting the position, the slight sheen on her fingertips. The woman’s face was turned toward the ground, but her eyes stared upward, unblinking. She made an unsettling, unnatural impression—more like a doll than a person. “Her name’s Lena Moore,” Ellis added quietly. “Local artist, worked the markets during the day, lived just a few blocks away.” Harlow stood, scanning the area again,
sensing that the
details didn’t quite fit together. The corpse was
pristine
, — no debris on her clothes, no signs of dirt or blood—yet she had clearly fallen or
been placed
here. The ground beneath her was smooth, almost polished in some spots, as if the surface had been
deliberately
altered. She turned her gaze toward the dark opening of the station’s entrance, then to the nearby alley where faint, cryptic symbols
were etched
into the bricks—symbols she recognized from her occult research, remnants of underground dealings or something darker. “Anyone see her go inside?” she asked. Ellis shook his head, frowning. “No witnesses, no CCTV footage that shows anyone approaching her here.” Harlow’s eyes
flicked
to a small, smoking object near Lena’s outstr
etched
hand—a bone token, definitely a part of the underground market’s strange barter system. She knelt again, mind racing. Everything about this was off. Then she saw it—the faint shimmer on the ground, barely perceptible in the flickering lamplight. Her eyes traced a circular pattern just centimeters from Lena’s body, as if some invisible boundary had
been touched
or crossed. She stepped back, her sharp jaw tightening. “This isn’t just a fall,” she murmured. “Something’s influencing the scene.” “Maybe she was pushed,” Ellis suggested, voice tentative. Harlow shook her head. “No. Look closer.” Her gaze moved to Lena’s fingers. No signs of a struggle—no torn fabric, no scrapes—yet her hand rested near that bone token. A clue, perhaps, or a distraction. She moved her eyes to the nearby symbols, then to a faint tremor in the air—a whisper of something unnatural, almost like a ripple in a quiet pond. Her hand instinctively went to her side, where her signature detective tool, the Veil Compass, rested inside her coat pocket—an odd, small brass instrument with face
etched
in sigils, crafted by a Shade artisan. She hesitated briefly, then pulled it out and held it aloft. The needle twitched erratically, then steadied, pointing unwaveringly toward the dark opening beyond the boundary. The supernatural energy nearby was unmistakable. “Ellis,” she said
softly
, “we’re dealing with something more than a simple fall. This isn’t just environment. It’s a portal—probably linked to the Veil Market, or whatever’s beneath the surface.” His
eyes widened
. “You think she was taken?” “Or she went somewhere she shouldn’t have.” Harlow's tone carried the
weight
of someone who’d hunted shadows for nearly two decades. “We’re not just looking at a crime scene—this is a crossroads. And we're going to need more than just the usual forensics.” She crouched again, eyeing Lena’s still, unblinking face. Her gaze drifted to the
etched
symbols on the bricks, then to the bone token, the faint glow of supernatural energy rippling out from the opening. Her jaw tightened. “Keep everyone back,” she ordered. “We’re getting a specialist down here. And I
want that compass
with me. If it points toward something supernatural, we follow it. No matter what.” Ellis nodded, already dialing on his radio, but Harlow’s attention
was fixed
on the dark opening, her mind racing through what she knew—for now, the scene refused to give up its secrets. Because the evidence didn’t add up, and someone or something
was waiting
in the shadows, unseen but felt. And this case was only just beginning to tear open the veil hiding something far darker than a simple disappearance.
created_at
"2026-02-17T02:40:27.545744+00:00"
2/17/2026, 2:40:27 AM
model
"openai/gpt-4.1-nano-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
367
moderation_latency
(null)
generation_time
8250
tokens_prompt
923
tokens_completion
1499
native_tokens_prompt
836
native_tokens_completion
1207
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0005664
router
(null)
provider_responses
0
id
"resp_0cfc51904af92b96016993d5127a9c8190b9e76b709121f488"
status
200
is_byok
false
latency
110
endpoint_id
"9251cee5-5503-4be9-9439-7ae21ff062a3"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-nano-2025-04-14"
api_type
"completions"
id
"gen-1771296018-pDjzJ1BBTc6cIC63xW9d"
upstream_id
"resp_0cfc51904af92b96016993d5127a9c8190b9e76b709121f488"
total_cost
0.0005664
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
50.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
8
adverbTagCount
2
adverbTags
0
"Ellis added quietly [quietly]"
1
"she said softly [softly]"
78.3080%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
922
totalAiIsmAdverbs
2
found
0
adverb
"deliberately"
count
1
1
adverb
"softly"
count
1
highlights
0
"deliberately"
1
"softly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
24.0781%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
922
totalAiIsms
7
found
0
word
"flicked"
count
1
1
word
"glinting"
count
1
2
word
"pristine"
count
1
3
word
"weight"
count
1
4
word
"etched"
count
3
highlights
0
"flicked"
1
"glinting"
2
"pristine"
3
"weight"
4
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"eyes widened/narrowed"
count
1
highlights
0
"eyes widened"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
51
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
51
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
3
totalSentences
60
matches
0
"whispered that something"
1
"sensing that the"
2
"want that compass"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
27
mean
33.63
std
16.99
cv
0.505
sampleLengths
0
61
1
60
2
36
3
17
4
52
5
14
6
45
7
40
8
21
9
57
10
39
11
7
12
16
13
34
14
36
15
18
16
8
17
35
18
73
19
21
20
30
21
8
22
43
23
37
24
30
25
32
26
38
81.0458%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
4
totalSentences
51
matches
0
"been placed"
1
"were etched"
2
"been touched"
3
"was fixed"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
117
matches
0
"was waiting"
86.0477%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
713
adjectiveStacks
1
stackExamples
0
"pepper close-cropped hair."
adverbCount
35
adverbRatio
0.04908835904628331
lyAdverbCount
14
lyAdverbRatio
0.019635343618513323
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
60
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
60
mean
15.13
std
9.63
cv
0.636
sampleLengths
0
17
1
17
2
27
3
18
4
42
5
8
6
12
7
16
8
5
9
12
10
16
11
21
12
15
13
9
14
5
15
37
16
8
17
14
18
14
19
12
20
7
21
14
22
14
23
24
24
19
25
39
26
7
27
5
28
11
29
24
30
5
31
5
32
15
33
21
34
7
35
7
36
4
37
8
38
4
39
9
40
16
41
6
42
28
43
34
44
11
45
15
46
6
47
30
48
3
49
5
68.3333%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
4
diversityRatio
0.45
totalSentences
60
uniqueOpeners
27
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
8
fancyCount
7
fancyTags
0
"she said, voice measured, eyes narrowing (say, voice narrow)"
1
"she said, voice low but steady (say, voice)"
2
"Ellis added quietly (add quietly)"
3
"she murmured (murmur)"
4
"Ellis suggested, voice tentative (suggest)"
5
"she said softly (say softly)"
6
"she ordered (order)"
81.6361%