NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMR2QRYN12Q6WQBNTBJR3NK
Run Details
Model:
GPT-4.1
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
59.5s
Cost:
$0.017896
Provider:
OpenAI
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn ducked beneath a drooping length of yellow police tape, mindful of rust flakes clinging to the archway overhead. Her boots landed with a hollow
echo
on the stone platform—old rails and ancient shadows folding in close like conspirators. She paused, knuckles brushing the face of her worn leather
watch
out of habit. The time didn't match the mood here, where sunlight had never touched the cracked tiling and every shiver of air tasted faintly of copper. Behind her, a handful of uniforms milled about in the dusk-hued gloom, high-visibility jackets
gleaming
like fish in a hidden cave. The abandoned Tube station beneath Camden—condemned, officially sealed, yet buzzing with illicit rumor—was not the typical theatre for a murder scene. Quinn’s silhouette, tall and crisp with military precision, seemed almost to rebuke the untidiness of the place. The scene itself unraveled farther down the platform. Light pooled from a pair of halogen work lamps, illuminating a body sprawled beside a toppled wooden crate. The corpse, male, mid-twenties at best, his face fixed in an expression of surprise. Blood, too little for the wound in the side of his neck, had dried in a Rorschach blot on the stone. A slender woman, red curls frizzing under the strain of humidity and stress, crouched beside the corpse. Round glasses slipped down her freckled nose as she rifled gloved fingers through a soggy sheaf of receipts. Eva Kowalski—aurora’s childhood friend, but tonight just another witness with too much to say. She glanced up, anxiety flickering in green eyes. "Detective Quinn, the—uh—I was… I mean, I found him. Like this." Quinn took in the nervous tuck as Eva slid a curl behind her left ear. She nodded but did not offer comfort. "You said you were here for research. For the British Museum?" "Yes." Eva’s voice cracked. "I was supposed to meet… someone, to collect a folio for the archives." She
was clutching
her satchel to her chest, leather edges nibbled by use. "He didn't show. Then I heard—well, I found him." A stocky constable, Choudhury, sidled over, pad in hand. "Detective, ME’s on her way—though God knows how she’ll get through that rat-warren. Found no ID. Just this." He dumped a small brass object into a sterile evidence bag. The compass’s casing glimmered dull green-blue under the lights; its face,
etched
with strange sigils, glinted. Quinn studied it, her gaze settling on the ornate protective runes. She felt the urge to turn it over, but resisted. "Where exactly did you find this?" "On the body. Clutched in his right hand." Quinn narrowed her eyes, gaze tracking details. The corpse’s fingers
were curled
, stained brass marking the palm. His left sleeve, though, retained a faint crescent-shaped patch—as though he often wore a
watch
, now missing. "Gloves when moving him, Choudhury," she said. "I want a print lift from both hands. Don’t assume the hand holding the compass is the one that usually does." "Already done, ma’am." Eva hovered, shifting her
weight
. "Should I… leave?" "You’ll stay until uniform takes your statement." Quinn’s tone left no room for argument. She stepped closer to the body, lowering herself on her haunches, ignoring the protest in her knees. At this angle, the wound opened before her—slit cleanly, just below the jaw, but not deep enough to kill outright unless something else intervened. There were no defensive wounds on the hands. His pockets had
been turned
out, but not ransacked. A phone, cheap burner, lay near the boot, screen spiderwebbed. Quinn reached for it with a gloved hand and thumbed the power. No signal. Choudhury nodded toward the far end of the platform, where a blur of motion indicated officers inspecting a battered iron gate. "Sergeant thinks perp escaped through there." Quinn followed his line of sight. The gate led not out, but deeper underground. Her lips thinned. "Record says this station’s been sealed for three decades. Who keeps a lock that new on a door that old?" She stepped past him, scanning for footprints in the patchy dust. Too many to count; but one set, booted, had tracked a grainy lime residue peculiar to the platform edge, then stopped abruptly half a meter from the body. She knelt again, nose wrinkling at the scent—alchemical, not chemical; sharper, almost mossy. Her gaze caught on a faint partition line on the stone, a groove as thin as a shoelace. She pressed a gloved finger along its length. The groove circled the entire crate like a warding boundary. The crate itself was fragrant with sandalwood, battered but once expensive. No shipping labels. "Did anyone see the crate moved?" Quinn asked, louder. A PC in the shadows replied, "No, ma’am—we found it overturned." Inside the crate, nothing but a layer of silt, black and faintly iridescent. She scraped a bit into a baggie. "Send this to the lab," she said. "Test for organic and inorganic traces. Prioritize for supernatural markers." Choudhury frowned. "With respect, ma’am—is that necessary? Looks like a mugging gone bad, if you ask me." Quinn straightened, eyeing the sprawling scene. "Does it? There’s barely a minute’s worth of blood. No splatter, no drag marks. Too neat. And our victim—" Her hand floated over the body’s shoulder, pointing at three neat punctures beneath the left clavicle, spaced just so. Eva inhaled
sharply
; Quinn caught her reaction. "Something wrong, Miss Kowalski?" Eva’s voice was small. "Those marks… I’ve seen them before. On a case file. Medieval, actually—records of illegal dueling, um, with enchanted implements." She bit her lip; Quinn stored the note away. "Thank you." Quinn’s gaze swept the rest of the scene—lamplight, dust, old advertising boards for War-era musicals, corners thick with darkness. She thought of her partner, Morris, and the London night three years lost—of wounds that did not close and explanations that didn’t fit statistics or training manuals. The back of her neck prickled. "Anything else on the compass?" she asked, addressing Choudhury. "Odd thing, ma’am. Needle spins when we bring metal close, but if you stand three paces west, it points hard north and won’t budge." He shrugged. Quinn accepted the evidence bag, turning the compass with care. The sigils, though scuffed, were unmistakable protection wards—old, possibly Shade craft, the kind one saw in seized Veil Market contraband. Quinn narrowed her eyes at the sergeant’s earlier assumption. "You think this is a straightforward mugging, Choudhury?" He shrugged. "Looks that way. No wallet, faint trace of struggle, no witnesses. Camden’s full of desperate sorts. Maybe something more exotic, but—" "But the facts don’t fit." Quinn’s voice carried, clipped and certain. "Look here—our victim came
through that entrance
, alone. He stops by the crate. No sign of forced entry, so he knew how to get in. He’s not here by accident. Nor are you, Miss Kowalski." Quinn fixed Eva in her gaze; the younger woman wilted
slightly
. "You said you were meeting someone. A seller?" "Yes," Eva whispered. "They used only first names. I never saw a face." "And the bone token?" Quinn raised an eyebrow. Eva startled. "I—how did you—" She clamped her lips shut. Quinn waited. The silence str
etched
until Eva surrendered, reaching into her satchel and producing a slim, pale token,
etched
with the same sigils as the compass. Quinn took it, weighing it in her palm; it was warm despite the chill. "The Veil Market," Quinn muttered. "No simple mugging, then." Her voice was iron. From the edge of the lamplight, the sound of a far door slamming
echo
ed up the tunnel—too loud, too deliberate, not just the settling bones of this place. Quinn’s eyes cut to Choudhury. "Seal every exit. Now." Choudhury moved, barking orders. The uniforms scattered, boots pounding against flagstones. Quinn crouched once more by the corpse, thoughts winding tighter. The compass, the token, the crate’s inlaid ward, the occult signatures—these were not accidents. The crime was a message, or perhaps a ritual interrupted. Someone wanted something moved through the Market beneath London, and someone else had paid for it with their throat. But the evidence here was incomplete, curated—a staging. She saw now what others missed: the blood
was pooled
, not thrown; the phone tossed too far from a dying hand; the absence of a
watch
left an unacknowledged tan line. Someone had taken only what they feared others might recognize. Only magic or madness could cleave a man so cleanly, or leave him so bereft of life and legend both. Quinn’s jaw set. She closed the evidence bag around the compass, careful not to break the seal. "The story’s wrong," she said to the air, to Eva, to her own memory of things lost beneath London. "And I intend to write the true one."
created_at
"2026-02-17T02:51:11.933932+00:00"
2/17/2026, 2:51:11 AM
model
"openai/gpt-4.1-2025-04-14"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
424
moderation_latency
(null)
generation_time
59442
tokens_prompt
923
tokens_completion
2301
native_tokens_prompt
836
native_tokens_completion
2028
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.017896
router
(null)
provider_responses
0
id
"resp_0e4a2c15d39946e7016993d76439a08190856f8b52bd6d12b1"
status
200
is_byok
false
latency
163
endpoint_id
"c235abe8-11cc-42d3-95ad-72f4d198287a"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-4.1-2025-04-14"
api_type
"completions"
id
"gen-1771296612-fPMCS8AVV5Qq9UlQYT5T"
upstream_id
"resp_0e4a2c15d39946e7016993d76439a08190856f8b52bd6d12b1"
total_cost
0.017896
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
8
adverbTagCount
0
adverbTags
(empty)
86.4223%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1473
totalAiIsmAdverbs
2
found
0
adverb
"sharply"
count
1
1
adverb
"slightly"
count
1
highlights
0
"sharply"
1
"slightly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
59.2668%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1473
totalAiIsms
6
found
0
word
"gleaming"
count
1
1
word
"weight"
count
1
2
word
"echo"
count
1
3
word
"echoed"
count
1
4
word
"etched"
count
2
highlights
0
"gleaming"
1
"weight"
2
"echo"
3
"echoed"
4
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
95
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
95
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
113
matches
0
"through that entrance"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
40
mean
36.15
std
22.87
cv
0.633
sampleLengths
0
79
1
59
2
61
3
68
4
33
5
39
6
54
7
27
8
8
9
62
10
3
11
8
12
31
13
65
14
27
15
76
16
63
17
9
18
11
19
37
20
17
21
51
22
4
23
32
24
54
25
9
26
26
27
47
28
23
29
65
30
13
31
8
32
10
33
40
34
13
35
37
36
11
37
53
38
69
39
44
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
3
totalSentences
95
matches
0
"were curled"
1
"been turned"
2
"was pooled"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
184
matches
0
"was clutching"
94.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1092
adjectiveStacks
1
stackExamples
0
"faint crescent-shaped patch—"
adverbCount
40
adverbRatio
0.03663003663003663
lyAdverbCount
11
lyAdverbRatio
0.010073260073260074
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
113
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
113
mean
12.8
std
8.37
cv
0.654
sampleLengths
0
21
1
20
2
14
3
24
4
21
5
21
6
17
7
8
8
18
9
14
10
21
11
17
12
18
13
14
14
8
15
11
16
15
17
7
18
11
19
4
20
26
21
9
22
9
23
29
24
16
25
11
26
10
27
6
28
8
29
7
30
10
31
17
32
7
33
21
34
3
35
5
36
3
37
14
38
17
39
24
40
8
41
9
42
10
43
12
44
2
45
21
46
6
47
6
48
8
49
3
69.0265%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
2
diversityRatio
0.4247787610619469
totalSentences
113
uniqueOpeners
48
81.2500%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
8
fancyCount
3
fancyTags
0
"Eva whispered (whisper)"
1
"Quinn muttered (mutter)"
2
"she said to the air, to Eva, to her own memory of things lost beneath London (say lose)"
93.5274%