NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMPCK1Y9S5SP9YC20XQ5FH9
Run Details
Model:
GPT-4o, May 13th (temp=1)
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
16.5s
Cost:
$0.031045
Provider:
Azure
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn stepped off the last rusting step and into the cool, stale air of the abandoned Tube station beneath Camden. Her
footsteps
echoed
around her, bouncing off the tiled walls and metal railings with a hollow, lonely sound that matched the hollow ache in her chest. Military precision colored every inch of her bearing, from her sharp jaw set in determination to her eyes scanning the station's filthy corners for any sign of the evidence she needed. Just beyond the platform edge, where the tracks disappeared into a yawning tunnel, a labyrinth of stalls and cobbled-together shops sprawled out. The Veil Market. Row upon row of rickety tables overflowing with trinkets and oddities that would make a banshee cackle with delight—or scream in terror. Harlow had been here before, under less dire circumstances. Tonight, it felt different. Charged. The air
was filled
with an acrid, almost burnt scent that turned her already steely gut into knots. DS Eva Kowalski stood near an improvised barricade made of rotting barrels and old, graffitied signs. The younger detective adjusted her round glasses with one hand, the other resting
nervously
on the worn leather strap of her satchel. Locks of curly red hair slipped from behind her ear as she stroked it back repeatedly—a nervous habit Harlow had noticed many times before. “Detective Quinn,” Eva called out, voice tight. “You’d better see this.” Harlow's
eyes narrowed
as she approached, her boots crunching on the scattered detritus. She focused on the scene Eva pointed to. Sprawled on the ground, amid a scattering of alchemical tools and half-used candles, was what remained of Gregor Saadi. Gregor, one of the Market’s many peddlers of enchanted baubles and whisperer of hidden knowledge, lay lifeless and twisted at an unnatural angle. His mouth was open in what should have been a scream, but no sound emerged when he died; there was only silence. Blood pooled around him, the dark stains marred with odd symbols that Harlow did not recognize. Her brown eyes
flickered
to Eva, who was chewing her lower lip. “What happened?” Harlow’s voice was a controlled burn of urgency, masked by her usual calm demeanor. “Witnesses say there was no scream, just a burst of light,” Eva replied, tucking another strand of hair behind her ear. “They took off running before anyone saw who—or what—did this.” Harlow crouched down beside Gregor's body and examined the scene more closely. Gregor’s right hand clenched
tightly
around an object—a brass compass with a face
etched
in protective sigils. The verdigris patina on the casing caught the dim light as she pried it from his grasp, careful not to disturb the scene more than necessary. The needle, she noticed,
was spinning
wildly as though the entire Station was awash in supernatural energy. “The Veil Compass,” she murmured, mostly to herself. “This thing points to supernatural rifts or portals.” Eva nodded, her expression clouded. “Doesn’t make sense, though. The Compass is supposed to provide clarity, a direction. But this—” she gestured vaguely, “it’s as if everything here is
deliberately
obscured.” Harlow rose, clutching the Compass. “You think it has something to do with the Market’s schedule? That it moves every full moon?” “Maybe,” Eva said, chewing her lip again. “Or it could mean the veil between our world and another is breaking down.” The Market was bustling as usual despite Gregor's lifeless form. Harlow looked around, scanning for anything that seemed out of place. The eclectic mix of supernatural color and mundane detritus made it difficult to pinpoint a disturbance when everything already seemed chaotic. She noted the stall adjacent to Gregor’s, a shady setup with dusty trinkets and ancient scrolls. She moved closer and inspected something
gleaming
faintly in the low light. It was another compass, identical to the one Gregor clutched except this one had a crack running through its face. She glanced up at the stall owner, a gaunt figure with hollow eyes hidden beneath a hood. “You carry these?” Harlow asked, holding up the cracked compass. “Rarely,” the man croaked. “Crafted by Shadewalkers. Valuable. Dangerous.” “Why dangerous?” Harlow pressed. “Draws attention. They can pull unwanted things through if not used
carefully
,” he said, voice gravelly and
barely above a whisper
. Harlow exchanged a glance with Eva, who looked no less uneasy. She kept the cracked compass with her as they
carefully
retraced their steps to the crime scene. Something nagged at her—the odd charring around the blood; the missing scream. It wasn't the Veil Compass alone at fault here. “Eva, get a hold of Forensics,” Harlow instructed, beckoning the younger detective to step back from the perimeter. “We’re missing something here. The symbols—” she moved her hand over the dark stains, “they’re part of a sealing ritual, I think. Or a botched one.” Eva nodded, already talking into her radio. “Do you think Gregor was trying to close a portal? Or open one?” “Both,” Harlow muttered, squinting at the symbols. “But the crux of it is, he thought he had control. These sigils, however, are off.” Eva returned after a brief exchange. “They’re sending a team down, but it’ll take at least twenty minutes. Traffic’s hell out there.” Harlow glanced at the brass compass again,
calculating
. Something about this entire scene made her skin crawl, more than the abandoned Station and the Veil Market’s inherent creep would explain. She activated her tablet and took quick photographs, sending them to the department’s occult evaluator, hoping for a break. “Do you trust anyone here?” Harlow asked Eva, her tone cold and cutting. “Not particularly,” Eva replied, finally shouldering her satchel and tucking her hair back. “Keep your eyes peeled. Someone erased their tracks well, which means they knew what they were doing. This place isn’t just a black market and carnival show—it's a dangerous game.” Eva nodded, her demeanor changing, becoming less the nervous academic and more the vigilant officer. They continued to scan the surroundings,
suddenly
aware of the veiled eyes watching them from shadowy alcoves and behind fluttering stall curtains. Minutes ticked by in tense silence before the Forensics team arrived, cutting through what had morphed from a market buzz to a nearly still, eerie quiet. The scene buzzed to life with activity, but Harlow's mind stayed on the cold, professional edge. She moved among the stalls with a sense of urgency, speaking to some of the other Market peddlers while keeping an eye on Eva, who’d found an alcove full of
spectral
scrolls and archaic books. Finally, a clue came to fruition—a vendor selling arcane tomes mentioned seeing Gregor in a heated argument with a figure cloaked in shadow moments before the light burst. Harlow pieced it together—Gregor had
tried to
use the Veil Compass to seal a rift but
was interrupted
. An interplanar breach occurred, one far beyond any simple market squabble. It hinted at a larger conspiracy—one involving advanced knowledge of the Veil and the supernatural energies bound within the Market. As they wrapped up their immediate findings, Eva moved closer, her hand
nervously
tucking her hair back one final time for the night. “What’s the call now?” she asked in a tone that mingled with
trepidation
and expectation. “We dig deeper,” Harlow replied, her jaw tightening. “Whoever killed Gregor had more at stake. We’re returning to the office, going over every scrap of evidence. Gregor might have been a bystander—or maybe he knew too much.” Eva nodded, eyes reflecting green determination behind her glasses. “And the Market?” “We watch. We listen. Full moon’s close, meaning it’ll move. We need to find out where, and we need to understand exactly what power players are involved,” Harlow said, her voice an ironclad promise. “No more unexplained deaths on my watch.” As they stepped away, leaving the Forensics team to their work, Harlow felt the
weight
of DS Morris’s absence. This was the kind of puzzle that had taken him three years ago, left questions in its wake. But this time, she wouldn’t be blindsided. The brass compass rested cold in her pocket, its needle finally still.
created_at
"2026-02-17T02:20:54.807212+00:00"
2/17/2026, 2:20:54 AM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
646
moderation_latency
(null)
generation_time
16277
tokens_prompt
923
tokens_completion
2192
native_tokens_prompt
836
native_tokens_completion
1791
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.031045
router
(null)
provider_responses
0
id
"chatcmpl-DA4zW6UPUTdmiQ6vNHdnRN7O9xduo"
status
200
is_byok
false
latency
335
endpoint_id
"ef6b0afa-fd92-4db6-96e0-b8f0e5e4734b"
provider_name
"Azure"
model_permaslug
"openai/gpt-4o-2024-05-13"
api_type
"completions"
id
"gen-1771294837-xrV1uM4oTAjICkdWupt9"
upstream_id
"chatcmpl-DA4zW6UPUTdmiQ6vNHdnRN7O9xduo"
total_cost
0.031045
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Azure"
Evaluation details
Result
Evaluator
Details
Meta Data
15.6863%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
17
adverbTagCount
6
adverbTags
0
"she murmured, mostly to herself [mostly]"
1
"she gestured vaguely [vaguely]"
2
"Eva said, chewing her lip again [again]"
3
"he said, voice gravelly and barely above a whisper [gravelly barely]"
4
"Harlow instructed, beckoning the younger detective to step back from the perimeter [back]"
5
"Eva replied, finally shouldering her satchel and tucking her hair back [finally]"
40.9594%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1355
totalAiIsmAdverbs
8
found
0
adverb
"carefully"
count
2
1
adverb
"deliberately"
count
1
2
adverb
"nervously"
count
2
3
adverb
"suddenly"
count
1
4
adverb
"tightly"
count
1
5
adverb
"barely above a whisper"
count
1
highlights
0
"carefully"
1
"deliberately"
2
"nervously"
3
"suddenly"
4
"tightly"
5
"barely above a whisper"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
33.5793%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1355
totalAiIsms
9
found
0
word
"flickered"
count
1
1
word
"gleaming"
count
1
2
word
"spectral"
count
1
3
word
"calculating"
count
1
4
word
"trepidation"
count
1
5
word
"weight"
count
1
6
word
"footsteps"
count
1
7
word
"echoed"
count
1
8
word
"etched"
count
1
highlights
0
"flickered"
1
"gleaming"
2
"spectral"
3
"calculating"
4
"trepidation"
5
"weight"
6
"footsteps"
7
"echoed"
8
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"eyes widened/narrowed"
count
1
highlights
0
"eyes narrowed"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
75
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
0
hedgeCount
1
narrationSentences
75
filterMatches
(empty)
hedgeMatches
0
"tried to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
88
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
38
mean
35.32
std
20.71
cv
0.586
sampleLengths
0
79
1
79
2
62
3
11
4
63
5
50
6
16
7
31
8
72
9
16
10
31
11
22
12
21
13
42
14
65
15
10
16
9
17
4
18
21
19
49
20
44
21
20
22
23
23
22
24
49
25
13
26
13
27
30
28
37
29
42
30
63
31
49
32
38
33
37
34
12
35
41
36
44
37
12
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
75
matches
0
"was filled"
1
"was interrupted"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
173
matches
0
"was spinning"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1053
adjectiveStacks
0
stackExamples
(empty)
adverbCount
38
adverbRatio
0.03608736942070275
lyAdverbCount
21
lyAdverbRatio
0.019943019943019943
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
88
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
88
mean
15.25
std
7.76
cv
0.509
sampleLengths
0
22
1
26
2
31
3
22
4
3
5
22
6
9
7
4
8
1
9
18
10
16
11
22
12
24
13
7
14
4
15
13
16
8
17
19
18
23
19
22
20
16
21
12
22
16
23
21
24
10
25
12
26
17
27
26
28
17
29
8
30
8
31
5
32
26
33
5
34
17
35
7
36
14
37
10
38
11
39
21
40
16
41
12
42
20
43
17
44
10
45
4
46
5
47
4
48
21
49
11
79.9242%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
3
diversityRatio
0.5
totalSentences
88
uniqueOpeners
44
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
17
fancyCount
13
fancyTags
0
"Eva called out, voice tight (call out, voice)"
1
"Eva replied, tucking another strand of hair behind her ear (reply, tuck)"
2
"she murmured, mostly to herself (murmur)"
3
"she gestured vaguely (gesture vaguely)"
4
"the man croaked (croak)"
5
"pressed (pressed)"
6
"he said, voice gravelly and barely above a whisper (say, voice gravelly)"
7
"Harlow instructed, beckoning the younger detective to step back from the perimeter (instruct, beckon step)"
8
"she moved her hand over the dark stains (move over stain)"
9
"Harlow muttered, squinting at the symbols (mutter, squint)"
10
"Eva replied, finally shouldering her satchel and tucking her hair back (reply, finally shoulder tuck)"
11
"Harlow replied, her jaw tightening (reply, tighten)"
12
"Harlow said, her voice an ironclad promise (say, voice)"
80.5970%