NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQTBB36Z1R4EYS604X2G8D
Run Details
Model:
Claude Haiku 4.5
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
19.6s
Cost:
$0.008897
Provider:
Anthropic
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The abandoned Tube station smelled like decades of rust and stagnant water. Detective Harlow Quinn descended the crumbling platform stairs with
methodical
precision, her worn leather
watch
catching on the concrete railing. The beam of her torch cut through the darkness, illuminating decades of graffiti and the chalk outline of a body that was no longer there. DC
Sarah
Chen
stood near the tracks, wearing the careful expression of someone who'd already spent three hours at a crime scene and wasn't sure what she
was looking
at anymore. She glanced up as Quinn approached, relief flickering across her face. "Thought you might want to see this before we seal it up,"
Chen
said. "The body was found by some teenagers exploring at half past nine last night. Uniform responded, called it in at ten-fifteen." Quinn crouched beside the outline, her knees protesting with the kind of ache that came from forty-one years and too many crime scenes. The chalk was fresh, the measurements precise. She studied the placement—the body had
been positioned
with feet toward the tunnels, arms at its sides. Deliberate. Arranged. "ID?" Quinn asked. "None on him. No wallet, no phone, nothing. Still waiting on forensics to get back with dental records, but preliminary assessment suggests male, late twenties to early thirties, approximately six feet tall." Quinn stood, her eyes tracing the platform. The usual detritus of urban exploration lay scattered about—empty cans, a shoe with no mate, a tangle of something that might have once been clothing. But there were other things too. Her detective's eye caught them immediately. "The chalk line includes defensive wounds on the hands,"
Chen
continued. "Victim fought back. We found some trace evidence—hair fibers, possible skin under the fingernails. There's also this."
Chen
guided her to a spot about three meters from the body outline. Arranged in the dust and debris was a pattern. Not quite deliberate enough to be obvious, but deliberate enough that Quinn's internal alarm
began to
sound. It was a circle. Rough, broken in places, but unmistakably a circle. Inside it, something had
been drawn
with what looked like ash or powdered bone. Quinn couldn't quite make out the symbols—they were too degraded, smudged by footprints and time. "Might be nothing,"
Chen
offered, reading her silence. "Could've been there for months. Could be unrelated." Quinn disagreed, though she didn't say so yet. In eighteen years of decorated service, she'd
learned that coincidence
was the laziest explanation a detective could reach for. She photographed the circle with her personal camera, then knelt closer, careful not to disturb the pattern further. The symbols were wrong. Not like gang tags or the sort of crude drawings teenagers left in forgotten places. These had precision despite their degradation. Intention. The same intention she recognized in how the body had
been positioned
. She pulled out a small notebook from her jacket pocket, the leather soft from years of use. Without touching anything, she sk
etched
what she could see: a broken circle, the fragments of symbols inside. Curves and angles that suggested something systemic. Something organized. "There's more,"
Chen
said quietly. She held out an evidence bag. Inside was a small object, roughly the size of a woman's hand, crafted from brass. Even through the plastic, Quinn could see the patina of verdigris on its surface, the delicate work of the casing. The face of it
was etched
with symbols—not crude, but
intricate
. Professional. A compass. Quinn's chest tightened. She took the bag without touching it, held it up to the fluorescent work
lights that uniform
had set up around the crime scene. The needle didn't point north. It
wasn't pointing
anywhere in particular now—it swung
slowly
, as if searching. "Found it about two meters from the body,"
Chen
said. "Tucked into a crack in the platform edge, like someone tried to hide it. Had to be the killer—no reason a victim would position it there." Quinn's mind moved through the possibilities, each one clicking into place with a certainty that made her skin prickle. This wasn't a mugging gone wrong. This wasn't a gang initiation or a deal that turned violent. This was something else entirely. Something she'd hoped, three years after DS Morris
disappeared, that she'd
never have to face again. "Has anyone else been down here?" Quinn asked. "Besides uniform and the teenagers?" "Just us. I secured the scene immediately when I arrived. Why?" Quinn didn't answer immediately. She
was looking
at the pattern again, at how the chalk line suggested a body that had
been positioned
with care, almost reverence. She
was thinking
about Morris, about how he'd vanished from a secured building with no signs of forced entry or exit. About the way his case file had
been closed
before she could ask the right questions. "The defensive wounds," Quinn said. "Exact locations?"
Chen
consulted her notes. "Hands, forearms, one significant laceration on the right shoulder. Looks like he was fighting someone armed—knife, probably. The medical examiner suggested the blade was unusual. Didn't create a typical wound pattern." Unusual. That was one word for it. Quinn straightened, her jaw tightening with the kind of certainty that came from experience and dread in equal measure. She looked at
Chen
—young, capable, but with the untested confidence of someone who still believed the world operated according to rational rules. "This wasn't a street robbery," Quinn said. "The positioning, the circle, the compass—this was ritual. Someone brought this victim here
deliberately
. They fought, maybe tried to escape, but they lost. And whoever did this..." She paused, considering how much to say. "They knew exactly what they were doing." "Ritual,"
Chen
repeated
slowly
. "You think this is... what, some kind of cult killing?" Quinn studied the compass again, the way its needle moved without direction, searching. She thought about her options. She could hand this over to the occult crimes task force—a unit that officially didn't exist but that she'd had to contact once before, three years ago, in the aftermath of Morris's disappearance. She could pretend this was straightforward and let the case slip away, unsolved and rationalized into comfortable ambiguity. Or she could do what she'd sworn to do when she made detective: follow the evidence wherever it led. "Get everything catalogued and bagged," Quinn said. "And
Chen
? Keep the compass secure. Personal custody—don't let it go to standard evidence lockup." "Why? What is it?" Quinn looked at her colleague, measuring how much the truth would help or hurt. "Something from a part of this city most people don't know exists," she said finally. "And if I'm right, we've just opened a door we're not going to be able to close." She pulled out her phone and scrolled through her contacts until she found the number she'd hoped she'd never have to call again. She'd made it two years—two years of relative quiet, of bodies that made sense, of crimes with rational explanations. That quiet was over.
created_at
"2026-02-17T02:45:57.23602+00:00"
2/17/2026, 2:45:57 AM
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
423
moderation_latency
(null)
generation_time
19477
tokens_prompt
923
tokens_completion
1845
native_tokens_prompt
937
native_tokens_completion
1592
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.008897
router
(null)
provider_responses
0
id
"msg_01PNZwXXQpepGJYenj6Z6siV"
status
200
is_byok
false
latency
423
endpoint_id
"41d2915a-92e6-4993-b537-210b4e10cba8"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1771296337-vCodnro0uGaRScGZF5lo"
upstream_id
"msg_01PNZwXXQpepGJYenj6Z6siV"
total_cost
0.008897
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
50.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
12
adverbTagCount
3
adverbTags
0
"Chen said quietly [quietly]"
1
"Chen repeated slowly [slowly]"
2
"she said finally [finally]"
74.3590%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1170
totalAiIsmAdverbs
3
found
0
adverb
"deliberately"
count
1
1
adverb
"slowly"
count
2
highlights
0
"deliberately"
1
"slowly"
60.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
0
"Chen"
1
"Sarah"
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
74.3590%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1170
totalAiIsms
3
found
0
word
"methodical"
count
1
1
word
"intricate"
count
1
2
word
"etched"
count
1
highlights
0
"methodical"
1
"intricate"
2
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
79
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
1
narrationSentences
79
filterMatches
0
"watch"
hedgeMatches
0
"began to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
3
totalSentences
93
matches
0
"learned that coincidence"
1
"lights that uniform"
2
"disappeared, that she'd"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
35
mean
33
std
18.75
cv
0.568
sampleLengths
0
57
1
42
2
35
3
49
4
3
5
32
6
44
7
28
8
39
9
41
10
16
11
45
12
38
13
43
14
58
15
2
16
44
17
36
18
57
19
13
20
11
21
64
22
7
23
35
24
7
25
41
26
48
27
14
28
69
29
19
30
22
31
4
32
46
33
42
34
4
82.7004%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
6
totalSentences
79
matches
0
"been positioned"
1
"been drawn"
2
"been positioned"
3
"was etched"
4
"been positioned"
5
"been closed"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
4
totalVerbs
153
matches
0
"was looking"
1
"wasn't pointing"
2
"was looking"
3
"was thinking"
94.4907%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
864
adjectiveStacks
0
stackExamples
(empty)
adverbCount
40
adverbRatio
0.046296296296296294
lyAdverbCount
10
lyAdverbRatio
0.011574074074074073
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
93
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
93
mean
12.42
std
8.23
cv
0.663
sampleLengths
0
12
1
20
2
25
3
31
4
11
5
14
6
21
7
23
8
7
9
17
10
1
11
1
12
3
13
32
14
7
15
25
16
6
17
6
18
11
19
17
20
13
21
9
22
17
23
4
24
8
25
14
26
15
27
8
28
8
29
8
30
19
31
18
32
4
33
15
34
6
35
1
36
12
37
17
38
17
39
7
40
2
41
5
42
6
43
15
44
20
45
11
46
1
47
2
48
3
49
24
79.5699%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
6
diversityRatio
0.5161290322580645
totalSentences
93
uniqueOpeners
48
70.8333%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
12
fancyCount
5
fancyTags
0
"Chen continued (continue)"
1
"Chen offered, reading her silence (offer, read)"
2
"Chen said quietly (say quietly)"
3
"Chen repeated slowly (repeat slowly)"
4
"she said finally (say finally)"
87.4301%