NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMPADW045CE1HHHJ0NP4TRZ
Run Details
Model:
GPT-4o, Aug. 6th (temp=0)
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
28.0s
Cost:
$0.016190
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn stepped off the last rung of the rusted ladder, her boots landing with a soft thud on the damp concrete floor of the abandoned Tube station.
The air was thick with
the scent of mildew and something else—something metallic and sharp that set her instincts on edge. She paused, taking in the scene with a practiced eye, her sharp jawline set in a determined line. The Veil Market, usually bustling with the
clandestine
energy of traders and buyers, was eerily silent. The stalls, draped in shadow, stood empty, their wares hidden beneath tarps and cloths. The only sound was the distant drip of water
echoing
through the tunnels. Harlow's brown eyes, keen and observant, swept over the scene, noting the details that others might overlook. A few yards away, Eva Kowalski stood with her back to Harlow, her curly red hair a bright contrast against the gloom. She
was speaking
in low tones to a uniformed officer, her hands gesturing animatedly. Harlow approached, her movements precise and deliberate, the worn leather
watch
on her wrist a comforting
weight
. "Detective Quinn," Eva greeted, turning to face her. Her green eyes, magnified by her round glasses, were wide with concern. She tucked a stray curl behind her left ear, a nervous habit Harlow had come to recognize. "This is... unusual, even for the Market." Harlow nodded, her gaze shifting to the officer, who
looked relieved
to see her. "What have we got?" she asked, her voice steady, betraying none of the unease she felt. The officer cleared his throat. "Body was found about an hour ago, ma'am. No ID, but the circumstances are... strange." "Strange how?" Harlow pressed, her mind already working through possibilities. Eva interjected, her voice
tinged
with the excitement of discovery. "There are signs of a ritual, Harlow. Symbols drawn in chalk, candles—it's like something out of one of my books." Harlow frowned, her skepticism warring with the evidence. "Show me." They led her to the center of the Market, where the body lay sprawled on the cold ground. A man, mid-thirties, his clothes disheveled and stained with something dark. Around him, a circle of symbols had
been drawn
, their lines precise and deliberate. Candles, now extinguished,
were placed
at intervals, their wax dripped and hardened. Harlow crouched beside the body, her eyes narrowing as she examined the scene. The symbols were unfamiliar, but there was a pattern to them, a logic that spoke of intent. She reached out, her fingers brushing the edge of one of the symbols, feeling the grit of the chalk beneath her skin. "These aren't just random markings," she murmured, more to herself than to the others. "There's a purpose here." Eva nodded, her excitement tempered by the gravity of the situation. "I've seen similar symbols in the archives. They're used to channel energy, to open... pathways." "Pathways?" Harlow
echoed
, her mind racing. "You mean portals?" Eva hesitated, then nodded. "It's possible. The Veil Market is known for its... connections to the supernatural." Harlow stood, her gaze sweeping the scene once more. The evidence was there, but it didn't add up. The man had no visible injuries, no signs of struggle. It was as if he had simply... stopped. "There's something missing," she said, her voice firm. "Something we're not seeing." Eva looked at her, a question in her eyes. "What do you mean?" Harlow gestured to the body, the symbols, the candles. "This is too clean. Too precise. Whoever did this knew exactly what they were doing." The officer shifted uncomfortably, his eyes darting between the two women. "You think this was planned?" Harlow nodded, her mind piecing together the fragments of the puzzle. "Yes. And I think it has something to do with the Market itself." Eva's
eyes widened
, her mind racing to keep up. "You think someone used the Market to... what? Open a portal?" "It's possible," Harlow said, her voice thoughtful. "But why? And why here, in the middle of the Market?" Eva glanced around, her gaze lingering on the shadows that clung to the edges of the station. "The Market moves every full moon. Maybe they needed a place that was... transient. Hard to trace." Harlow considered this, her mind turning over the possibilities. It made sense, in a way. The Market was a place of secrets, of hidden transactions and whispered deals. It was the perfect cover for something like this. But there was still something missing, something that nagged at the edges of her consciousness. She turned back to the body, her eyes narrowing as she studied the man's face. There was something familiar about him, something she couldn't quite place. "Eva," she said, her voice low. "Do you recognize him?" Eva shook her head, her expression puzzled. "No. Should I?" Harlow frowned, her mind working through the possibilities. "I don't know. But I have a feeling we've seen him before." The officer cleared his throat, drawing their attention. "Ma'am, we found this on him." He held out a small brass compass, its casing
etched
with sigils, the needle quivering
slightly
. Harlow took it, her fingers brushing the verdigris patina. "A Veil Compass," she murmured, recognition dawning. "It points to the nearest supernatural rift." Eva's
eyes widened
, understanding dawning. "He was trying to find a portal." Harlow nodded, her mind racing. "And he found one. But something went wrong." Eva's gaze shifted to the symbols, the candles, the body. "You think he was trying to open it himself?" "Maybe," Harlow said, her voice thoughtful. "Or maybe someone else was using him to do it." The officer shifted, his expression uneasy. "What do we do now, ma'am?" Harlow slipped the compass into her pocket, her mind already working through the next steps. "We find out who he is. And we find out who
wanted that portal
opened." Eva nodded, her expression determined. "And we find out why." Harlow met her gaze, a silent understanding passing between them. They were on the trail of something big, something that went beyond the usual boundaries of their world. And they wouldn't stop until they had answers. As they turned to leave, Harlow cast one last glance at the scene, her mind already piecing together the fragments of the mystery. The Veil Market held its secrets close, but she
was determined
to uncover them, one way or another.
created_at
"2026-02-17T02:19:55.579828+00:00"
2/17/2026, 2:19:55 AM
model
"openai/gpt-4o-2024-08-06"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
485
moderation_latency
(null)
generation_time
27460
tokens_prompt
923
tokens_completion
1651
native_tokens_prompt
836
native_tokens_completion
1410
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01619
router
(null)
provider_responses
0
id
"chatcmpl-DA4yNajr13gOEepV5arcxisLHPuM4"
status
200
is_byok
false
latency
421
endpoint_id
"9d15935a-34e6-4a5e-a5bc-c7dda213e876"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4o-2024-08-06"
api_type
"completions"
id
"gen-1771294767-rRj2fqzyWssfYWmV5st8"
upstream_id
"chatcmpl-DA4yNajr13gOEepV5arcxisLHPuM4"
total_cost
0.01619
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
96.2963%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
9
adverbTagCount
1
adverbTags
0
"she murmured, more to herself than to the others [more]"
90.5838%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1062
totalAiIsmAdverbs
1
found
0
adverb
"slightly"
count
1
highlights
0
"slightly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
43.5028%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1062
totalAiIsms
6
found
0
word
"tinged"
count
1
1
word
"clandestine"
count
1
2
word
"weight"
count
1
3
word
"echoed"
count
1
4
word
"echoing"
count
1
5
word
"etched"
count
1
highlights
0
"tinged"
1
"clandestine"
2
"weight"
3
"echoed"
4
"echoing"
5
"etched"
66.6667%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
3
maxInWindow
2
found
0
label
"eyes widened/narrowed"
count
2
1
label
"air was thick with"
count
1
highlights
0
"eyes widened"
1
"The air was thick with"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
1
narrationSentences
69
matches
0
"looked relieved"
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
69
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
96
matches
0
"wanted that portal"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
40
mean
26.48
std
15.38
cv
0.581
sampleLengths
0
68
1
60
2
53
3
44
4
30
5
20
6
10
7
30
8
10
9
55
10
52
11
18
12
26
13
9
14
17
15
36
16
12
17
13
18
24
19
16
20
24
21
20
22
18
23
34
24
37
25
41
26
10
27
10
28
20
29
30
30
23
31
12
32
13
33
19
34
16
35
12
36
30
37
10
38
36
39
41
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
69
matches
0
"been drawn"
1
"were placed"
2
"was determined"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
146
matches
0
"was speaking"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
786
adjectiveStacks
0
stackExamples
(empty)
adverbCount
19
adverbRatio
0.024173027989821884
lyAdverbCount
8
lyAdverbRatio
0.010178117048346057
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
96
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
96
mean
11.03
std
5.39
cv
0.489
sampleLengths
0
29
1
21
2
18
3
16
4
14
5
13
6
17
7
22
8
14
9
17
10
8
11
12
12
17
13
7
14
14
15
16
16
5
17
15
18
10
19
10
20
20
21
8
22
2
23
18
24
11
25
14
26
12
27
13
28
17
29
22
30
14
31
4
32
11
33
15
34
6
35
3
36
4
37
13
38
9
39
9
40
10
41
7
42
1
43
8
44
4
45
9
46
4
47
9
48
15
49
11
55.9028%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
3
diversityRatio
0.3541666666666667
totalSentences
96
uniqueOpeners
34
63.8889%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
9
fancyCount
4
fancyTags
0
"Eva greeted, turning to face her (greet, turn face)"
1
"pressed (pressed)"
2
"she murmured, more to herself than to the others (murmur)"
3
"she murmured, recognition dawning (murmur, dawn)"
89.2260%