NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQSEBDSJ8QGNVRX7CFZV2P
Run Details
Model:
GPT-4.1 Nano
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
11.5s
Cost:
$0.000602
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain was relentless, hammering down in stiff sheets that blurred the city’s flickering neon and dim streetlamps into smeared streaks. Detective Harlow Quinn parked her battered Ford just outside what looked like an abandoned tube station, its dark entrance yawning like a maw into the earth. A security barrier, flimsy but functional, blocked the door, and an uneven glow of flickering candles seeped from beneath the rusted grate. The scent of damp concrete and something sharper—perhaps old blood or something else—
hung in the air
. Harlow stepped out into the drizzle, her sharp
jaw clenched
tight. She adjusted her worn leather
watch
on her wrist as she approached, her boots splashing on the uneven pavement. Peering through the fence, her eyes locked onto the scene inside: a small perimeter had
been established
by uniformed officers, but their hurried, distracted presence suggested they didn’t quite know what they
were looking
at. Behind her, a voice called out: “Quinn. Over here.” She turned
sharply
, her brown eyes narrowing as she found her colleague, Detective Mark Sullivan, leaning against a battered metal railing, flipping through a battered notepad. “You’re late,” Sullivan said, voice low, with that familiar tone of nervous excitement. “No cameras, no witnesses. Looks like some kind of... ritual.” Harlow’s gaze swept the scene again. A makeshift altar lay in the center—a toppled crate covered in black cloth, candles scattered across the floor, some still smoldering. Near the altar, a circle of strange symbols, hastily drawn, had
been etched
into the concrete with what looked like chalk, or maybe ash. There was no sign of forced entry, no obvious signs of violence apart from the streaks of dried blood that spattered the ground. She crouched
slightly
, inspecting the symbols. They didn’t match anything she’d seen in the file cabinets or her past cases. Not simple graffiti, not gang markings. Still, they had a certain… energy. A pulse that made her skin crawl. She glanced at Sullivan. “No sign of a struggle? No personal effects?” He shook his head. “Nothing. No wallet, no phone, no obvious identification. Just this stuff. It’s almost as if… someone wanted us to find it, or maybe not even us.” Harlow’s gaze
flicked
over the scene once more. Her sharp eyes caught the faint imprint of something unusual. There, beneath the edge of the cloth covering the altar, a faint indentation, almost like a circle pressed into the concrete, too clean for the surrounding chaos. She reached out, kneeling
carefully
, her gloves brushing cold stone. The mark was too precise, not natural, almost like it had
been carved
after the fact. The rain intensifies, pattering against her coat and pooling in the uneven cracks. She could smell the metallic odor of blood, but it was faint, almost like a smell manufactured, not fresh enough for real murder. She looked up at the ceiling, where the tunnel’s arching concrete faded into darkness beyond the candlelight’s reach. “What’s your take?” she asked Sullivan, standing again. “This looks staged. Like someone was trying to make us see what they want us to see.” He hesitated, nervous. “Yeah, but I think there’s more here. I checked the perimeter identifications—they’re all locals, no outsiders. Still, I kept thinking, why hide here? Why not somewhere more... accessible?” Her brow
furrowed
. “Maybe because it’s protected, or tucked away in a place no one would bother to look. Or maybe they’re hiding something here.” She moved, slow and deliberate, scanning the walls, the ceiling. Her eyes caught faint, almost obscured symbols scratched into the concrete—too faint to make out clearly. She pulled out her phone, tapping a quick picture, knowing she’d have to analyze later, once she had a good lead. But then her gaze snagged on something else: a small brass compass lying at the edge of the altar, half-buried beneath a scattering of broken candles. She knelt again, brushing rain from her fingers and
carefully
retrieving it. The casing was tarnished, patina of verdigris creeping across its surface, the face
etched
with tiny protective sigils that
shimmered
faintly in the candlelight. Her eyes
flicked
to the needle—pointing,
unwavering
, toward the darkness beyond the tunnel, as if pulled by an unseen force. She frowned, eyes narrowing. “That’s new.” Sullivan looked over her shoulder, brow raised. “What, the compass?” “Yeah. It’s also locked onto something supernatural. I don’t think it’s a coincidence.” She pocketed the compass, feeling the faint warmth of its metal under her fingertips. “This is no ordinary setup. Someone’s trying to tell us something — or hide something.” He hesitated, returning to the perimeter. “But if it’s supernatural… why here? Why under Camden?” Harlow’s gaze sharpened. “Because whoever staged this knew exactly what they were doing. They want us to look in certain directions—not just at the symbols or the scene, but at what’s beyond.” She pulled out her handheld, switching to her city map, marking the nearby exits and tunnels. Then, her eyes landed on a detail Sullivan hadn’t noticed: a faint smear of ash trailing toward one side of the tunnel, as if someone had moved through, hurried or purposeful. She followed the line with her eyes, a fleeting suspicion gnawing at her. “We’re missing something,” she said quietly. “Something bigger. Whoever did this wants us to think it’s just some ritual, some ritualistic killing. But the signs don’t add up. Not with the way it’s staged.” She looked back at the symbols, then at the compass, feeling that familiar prickling along her skin, the sense that unseen eyes
were watching
her. Shadowed corners
seemed to
whisper secrets in the flickering candlelight, secrets she was only beginning to grasp. The rain
seemed to
loosen, finally, as if veiling the truth beneath its heavy curtain. But Harlow Quinn knew better: beneath the surface of something so simple as a crime scene, something dark and ancient slumbered—and she’d find it, tonight or in the depths to come.
created_at
"2026-02-17T02:45:19.83737+00:00"
2/17/2026, 2:45:19 AM
model
"openai/gpt-4.1-nano-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
929
moderation_latency
(null)
generation_time
11406
tokens_prompt
923
tokens_completion
1611
native_tokens_prompt
836
native_tokens_completion
1296
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.000602
router
(null)
provider_responses
0
id
"resp_03f94944edd8f81f016993d633a8b48193a95f706f579163e4"
status
200
is_byok
false
latency
132
endpoint_id
"9251cee5-5503-4be9-9439-7ae21ff062a3"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-nano-2025-04-14"
api_type
"completions"
id
"gen-1771296307-tpd0M3jFOPsDfnZEsOYS"
upstream_id
"resp_03f94944edd8f81f016993d633a8b48193a95f706f579163e4"
total_cost
0.000602
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
50.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
4
adverbTagCount
1
adverbTags
0
"she said quietly [quietly]"
59.7180%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
993
totalAiIsmAdverbs
4
found
0
adverb
"carefully"
count
2
1
adverb
"sharply"
count
1
2
adverb
"slightly"
count
1
highlights
0
"carefully"
1
"sharply"
2
"slightly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
19.4361%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
993
totalAiIsms
8
found
0
word
"flicked"
count
2
1
word
"shimmered"
count
1
2
word
"unwavering"
count
1
3
word
"furrowed"
count
1
4
word
"jaw clenched"
count
1
5
word
"etched"
count
2
highlights
0
"flicked"
1
"shimmered"
2
"unwavering"
3
"furrowed"
4
"jaw clenched"
5
"etched"
66.6667%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
2
maxInWindow
2
found
0
label
"jaw/fists clenched"
count
1
1
label
"hung in the air"
count
1
highlights
0
"jaw clenched"
1
"hung in the air"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
53
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
2
narrationSentences
53
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
65
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
26
mean
37.88
std
21.36
cv
0.564
sampleLengths
0
85
1
65
2
9
3
26
4
23
5
74
6
39
7
12
8
30
9
71
10
54
11
25
12
31
13
25
14
47
15
62
16
20
17
6
18
10
19
42
20
15
21
32
22
60
23
34
24
42
25
46
95.5975%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
53
matches
0
"been established"
1
"been etched"
2
"been carved"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
2
totalVerbs
138
matches
0
"were looking"
1
"were watching"
94.1440%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
771
adjectiveStacks
0
stackExamples
(empty)
adverbCount
36
adverbRatio
0.04669260700389105
lyAdverbCount
10
lyAdverbRatio
0.01297016861219196
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
65
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
65
mean
15.15
std
8.92
cv
0.588
sampleLengths
0
21
1
26
2
22
3
16
4
11
5
19
6
35
7
9
8
26
9
13
10
10
11
6
12
21
13
24
14
23
15
6
16
14
17
6
18
5
19
1
20
7
21
4
22
8
23
4
24
26
25
8
26
10
27
27
28
10
29
16
30
13
31
23
32
18
33
8
34
17
35
3
36
28
37
3
38
22
39
10
40
16
41
21
42
26
43
12
44
24
45
20
46
4
47
2
48
7
49
3
73.8462%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
2
diversityRatio
0.46153846153846156
totalSentences
65
uniqueOpeners
30
50.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
4
fancyCount
2
fancyTags
0
"Sullivan said, voice low, with that familiar tone of nervous excitement (say, voice)"
1
"she said quietly (say quietly)"
82.9064%