NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQG1SYX9QBCZM32RD3S036
Run Details
Model:
GPT-4.1 Mini
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
13.5s
Cost:
$0.002627
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The underground chamber reeked of damp stone and something metallic—blood, Detective Harlow Quinn thought, the scent growing stronger as she descended the cracked concrete steps. The flickering glow of harsh fluorescent bulbs revealed a narrow platform long abandoned by the London Tube. Here, beneath Camden’s cluttered streets, the Veil Market once thrived in shadows—an unregistered labyrinth of ghosts and secrets. Harlow adjusted the collar of her trench coat and scanned the grim tableau before her. The body lay sprawled across the grimy tile, a young man in his thirties with dark eyes frozen wide in surprise. A deep gash carved its way from his temple to the base of his jaw. Forensics were still bagging evidence, but the crime scene was sterile overall—too sterile. Nothing crushed, no sign of scuffle, no weapon discarded, just this man, collapsed as if dropped from the sky. “Looks like a clean hit,” said DS Marshall, standing nearby with his notebook. His tone betrayed a hint of satisfaction, the kind that came when cases fitted neatly into established molds. “Single stab wound. No sign of resistance. Could be a contracted hit, or some type of professional execution.” Harlow narrowed her eyes. “No weapon? Nothing found near the body? That’s odd.” Her gaze drifted over the platform’s grime-streaked walls and the rusted iron rails below, rustling the stale air. Marshall shrugged. “Security cameras long since disabled in these tunnels. No witnesses, no footprints—someone knew their way around here like the back of their hand.” He flipped a page, eyes scanning his notes like a rehearsed script. Harlow’s brow
furrowed
. She crouched beside the corpse, studying the position of the wound. Something
wasn’t adding
up. The trajectory of the cut looked off—too clean, too precise. And close. A blade that deep would have required a violent lunge or the victim’s own movement. But his arms
were relaxed
, no defensive wounds. It was almost surgical. She stood, her sharp jaw tense. “Where’s the blood spatter? Nothing on the walls or the floor?” An uneasy silence settled. Marshall glanced sideways. “What do you mean? There’s pooled blood beneath him, and some droplets nearby.” She pointed to the dark pool soaking into the dirt by the man’s side. “No spray or scatter pattern. A violent stabbing always leaves some—” Her words died. Something caught the corner of her vision—a slight shimmer, subtle as a breath of smoke but out of place in this dim, stale space. She squinted, heart picking up a rhythm of caution. “Marshall.” She pulled an old brass object from her coat pocket, cradling it like a talisman. The Veil Compass, small and tarnished with verdigris, its
etched
face catching the flickering light. Its needle spun erratically before settling—pointing insistently toward the chipped tunnel wall behind the victim. “Out with it, Quinn. What’ve you got?” His skepticism was barely masked. “Look at this.” She stepped close to the wall, the compass needle twitching as if disturbed. Running her fingers over the chipped plaster, she found no crack or crevice. Then, her hand caught faint impressions—delicately carved sigils fading beneath layers of grime. Protective wards often hidden in unnoticed places. But this felt new, almost fresh. “That’s a marker,” Harlow said quietly. “Someone
etched
it recently. Made by Shade artisans—arcane signatures to cloak or open portals. This whole place smells of something deeper... supernatural.” Marshall scoffed, eyes darting around. “You’re chasing shadows, Quinn. This is a routine homicide. Somebody just wanted this guy dead.” She shook her head, pulling out a small flashlight and shining it along the wall, revealing a barely perceptible outline. A hollow—just wide enough for a person. The compass needle pointed dead center. “Someone here stepped through a door we don’t have access to. This victim? He didn’t just get stabbed—he might have been caught between worlds.” Marshall’s face paled
slightly
, but he masked it well. “Spare me the ghost stories. We need concrete evidence.” “Concrete evidence,” Harlow repeated, crouching to examine the man’s satchel. Inside lay a cluster of old books, their spines cracked and pages dog-eared. Ancient texts on ley lines, portal magic, and the occult. She recognized the handwriting scrawled on a notebook margin—the same as Eva Kowalski’s. Eva’s voice came
softly
from behind, tentative yet steady. “Detective. I knew you’d come.” Harlow spun around. Eva stood just inside the platform’s entrance, her fiery curls barely contained beneath a woolen scarf, face pale but resolute behind round glasses. The satchel at her side was worn but well cared for. “You shouldn’t be here,” Harlow said, voice low. “It’s dangerous.” Eva sniffed, her hand instinctively tucking a loose curl behind her ear. “I have to understand what’s happening. This man… he was tracking disturbances at the Market. He believed a rift was opening beneath Camden, one that could lead to something worse than just black market dealings.” Eva pulled a folded parchment from her satchel and handed it over. It was the same symbol carved faintly in the tunnel wall—an ancient Seal of Binding, used to contain entities that slipped through realm fractures. Harlow’s eyes traced the lines. “If that’s true…” She glanced back at the body. “This isn’t a normal kill. He might have been silenced for stumbling into things no human should.” Marshall’s jaw tightened. “All right. So maybe there’s more to this than we thought. But how do we prove it?” Harlow stood, steady now. “First, we confirm if that hollow’s a portal. And if something slipped through.” Eva nodded vigorously. “I’ve been tracking similar sigils in the Market’s vendors. Some of their wares aren’t just enchanted—they’re dangerous. I think the Market itself is shifting, unstable.” Harlow’s
watch
caught the light—a worn leather band wrapping her wrist, the face scratched but reliable. Time was never on their side. “We’ll return here after dark,” she said. “And we bring equipment.” Marshall looked between them, still unconvinced yet unwilling to leave the questions hanging. “Fine. But if this portal doesn’t exist, if we’re walking into another wild goose chase—” “We’ll deal with whatever’s waiting,” Harlow cut in, voice steel. “Because someone’s already paying the price.” Outside, the
echoes
of London’s bustling streets felt a world away. Here beneath Camden, the dead whispered secrets, and the line between the natural and unnatural blurred into shadows. And Harlow Quinn, with years of grit and scars both seen and hidden,
was prepared
to step across it all.
created_at
"2026-02-17T02:40:13.93796+00:00"
2/17/2026, 2:40:13 AM
model
"openai/gpt-4.1-mini-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
440
moderation_latency
(null)
generation_time
13408
tokens_prompt
923
tokens_completion
1803
native_tokens_prompt
836
native_tokens_completion
1433
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0026272
router
(null)
provider_responses
0
id
"resp_0a8d5a472c89e9c6016993d4ffdfcc8195896e9d7659cc466b"
status
200
is_byok
false
latency
107
endpoint_id
"872eccb7-9c85-45fc-974a-ff7c8e2407e6"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-mini-2025-04-14"
api_type
"completions"
id
"gen-1771295999-OVGD1m03oxb5car4Nttx"
upstream_id
"resp_0a8d5a472c89e9c6016993d4ffdfcc8195896e9d7659cc466b"
total_cost
0.0026272
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
77.7778%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
6
adverbTagCount
1
adverbTags
0
"Harlow said quietly [quietly]"
81.4815%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1080
totalAiIsmAdverbs
2
found
0
adverb
"slightly"
count
1
1
adverb
"softly"
count
1
highlights
0
"slightly"
1
"softly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
62.9630%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1080
totalAiIsms
4
found
0
word
"furrowed"
count
1
1
word
"echoes"
count
1
2
word
"etched"
count
2
highlights
0
"furrowed"
1
"echoes"
2
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
69
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
69
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
87
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
33
mean
32.21
std
16.57
cv
0.514
sampleLengths
0
60
1
83
2
49
3
31
4
37
5
57
6
17
7
20
8
25
9
36
10
46
11
12
12
55
13
28
14
20
15
33
16
24
17
18
18
46
19
14
20
37
21
10
22
47
23
36
24
31
25
20
26
17
27
28
28
22
29
11
30
28
31
16
32
49
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
69
matches
0
"were relaxed"
1
"was prepared"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
127
matches
0
"wasn’t adding"
87.8374%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
744
adjectiveStacks
1
stackExamples
0
"resolute behind round glasses."
adverbCount
35
adverbRatio
0.04704301075268817
lyAdverbCount
13
lyAdverbRatio
0.01747311827956989
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
87
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
87
mean
12.22
std
7.25
cv
0.594
sampleLengths
0
25
1
17
2
18
3
15
4
21
5
15
6
13
7
19
8
13
9
18
10
18
11
4
12
27
13
2
14
35
15
3
16
11
17
4
18
10
19
2
20
15
21
8
22
4
23
6
24
11
25
4
26
3
27
13
28
14
29
11
30
3
31
24
32
9
33
16
34
15
35
15
36
12
37
16
38
13
39
13
40
7
41
6
42
6
43
22
44
5
45
15
46
20
47
7
48
6
49
24
86.9732%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
1
diversityRatio
0.5287356321839081
totalSentences
87
uniqueOpeners
46
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
6
fancyCount
5
fancyTags
0
"said DS Marshall, standing nearby with his notebook (say stand)"
1
"Harlow said quietly (say quietly)"
2
"Harlow repeated, crouching to examine the man’s satchel (repeat, crouch examine)"
3
"Harlow said, voice low (say, voice)"
4
"Harlow cut in, voice steel (harlow cut in voice steel)"
88.0608%