NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQJWZA637QMN14X6M81AF1
Run Details
Model:
Claude Haiku 4.5
Test:
Bad Writing Habits
Scenario:
Thriller: chase through city streets
Duration:
21.7s
Cost:
$0.009962
Provider:
Anthropic
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4417" name="Tomás Herrera" aliases="Tommy, Tomás, Herrera" pronouns="he/him" species="Human" storyRole="Support"> <eyeColor>warm brown</eyeColor> <hair>short curly dark brown</hair> <height>5'10"</height> <age>29</age> <skinTone>olive skin</skinTone> <scar>scar running along his left forearm from a knife attack</scar> <medallion>Saint Christopher medallion around his neck</medallion> <birthplace>Seville, Spain</birthplace> <description>A former paramedic who provides off-the-books medical care for the clique.</description> <backstory>Born in Seville, Spain. He moved to London to work for the NHS but lost his license after administering unauthorized treatments to supernatural patients.</backstory> </character> <location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth"> <signColor>distinctive green neon sign above the entrance</signColor> <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom> <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description> </location> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> </codex> Write a fully fleshed out scene of at least 1000 words in the thriller genre. A detective is pursuing a suspect through rain-soaked city streets at night. The chase takes them from the street level down into an underground market. The detective must decide whether to follow the suspect into unfamiliar and potentially dangerous territory. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain came down in sheets, turning the Soho streets into rivers of reflected neon. Detective Harlow Quinn's shoes splashed through puddles thick with city grime as she pushed through the late-night crowd, her eyes locked on the figure ahead. Tomás Herrera. The paramedic who'd vanished from three different crime scenes in the past six months. He'd been tailing the entrance to The Raven's Nest when Quinn spotted him, and the moment their eyes met across the rain-slicked pavement, he'd bolted. Quinn's worn leather
watch
caught the light from a passing taxi—11:47 PM—as she lengthened her stride. Twenty-three years on the force, eighteen of them decorated service, and her knees still didn't complain when she moved like this. Herrera was fast, but he
was panicking
. Panicked people made mistakes. He cut left onto a narrow side street, the kind that tourists never found. Quinn knew these alleys intimately—every corner, every fire escape, every dead end that would trip up someone unfamiliar with the geography. She'd run down dozens of suspects through these streets. She could do it in her sleep. The rain intensified. Her short salt-and-pepper hair clung to her scalp, water streaming down her sharp jawline. Herrera's silhouette was only twenty meters ahead now, his curly dark hair plastered to his head, his olive skin slick as he stumbled
slightly
on the wet pavement. "Police!" Quinn shouted, more out of habit than hope. Herrera didn't slow. He reached the corner of Bateman Street and instead of turning, he disappeared. Quinn's muscles tensed. She'd run this exact grid a hundred times. There was nowhere to dis
appear to
. She rounded the corner hard, her breath coming in controlled bursts, and found nothing but empty street. The rain drummed against the pavement like a thousand impatient fingers. Then she saw it—the grate. A basement entrance she'd passed a thousand times without
truly
seeing. The kind of thing London was full of, forgotten infrastructure from forgotten times. Herrera had left it open. Quinn's hand instinctively went to her hip, checking the position of her service weapon. She'd been thirty-four years old when she lost DS Morris. Six months into that case, he'd gone into somewhere he shouldn't have, investigating leads that made no sense, talking about things that couldn't possibly be real. Three years later, they still hadn't found his body. The case file gathered dust in her office, a personal mission that her superiors had quietly filed away. She'd spent those three years telling herself that she didn't believe in the impossible. That there were rational explanations for everything. That the world operated according to laws she understood. The grate descended into darkness. Quinn pulled her phone from her jacket pocket, activated the flashlight, and peered down. Concrete stairs, painted institutional green decades ago, now faded and peeling. Water trickled down the walls, carrying the smell of earth and something else—something that made her sinuses prickle. Chemical. Old. She could call for backup. Standard procedure. Except that Herrera had evaded three crime scenes, and each time, the evidence had vanished. He was part of something larger, something that moved faster than the Metropolitan Police could react. If she called for backup, he'd disappear again. Her sharp
jaw clenched
. Morris had made that same calculus, probably in this same part of the city. Morris had thought he could handle things alone. Quinn descended. The stairs
seemed to
stretch longer than they should, longer than the building's foundation would logically allow. After what felt like a dozen steps, the concrete gave way to something older—stone, worn smooth by time and
footsteps
. The water that had trickled down the walls earlier ran more freely here, forming rivulets that flowed deeper into the darkness. Her flashlight beam caught the
glint
of moisture on the walls, and she realized with a strange twist in her stomach that they weren't painted green anymore. They
were covered
in mold, or algae, or something that grew in places light didn't reach. Her breathing sounded loud in the confined space. At the bottom of the stairs, a tunnel opened up. Not a modern Tube tunnel—she'd grown up in London, knew the maps of the Underground the way other people knew their own homes—but something older. Victorian, maybe. The curved brick of the walls arched overhead in a pattern that suggested purpose beyond modern utility. Gas lamps, now converted to electric fixtures, hung at irregular intervals along the ceiling. Most of them were dark. Her flashlight beam played across one that still functioned, casting deep shadows that made the tunnel
seem to
breathe. She could hear voices ahead. Multiple voices. That stopped her cold. Quinn had
been trained
to move toward threats, not away from them. But something in the tone of those voices—the cadence, the underlying current of
anticipation
—made the hair on her arms stand on end. She
wasn't hearing
normal conversation. She was hearing the sound of people who were expecting something. Waiting for something. She pressed herself against the tunnel wall and moved forward, her breathing shallow. The voices grew clearer as she approached. She caught fragments. "—token for entry?" "New blood tonight?" "Market opens at midnight. Silas is already setting up the ward stones." The Veil Market. She'd heard whispers about it over the past few years, rumors that the regular folks at the Nest would exchange in low voices when they thought she
wasn't listening
. An underground market that moved locations every full moon, where you could buy things that didn't officially exist. Where you could find people who didn't officially exist. A place her missing partner might have gone. The tunnel opened into a vast space that her mind struggled to process. It was too large to fit beneath Camden, too impossibly vast. The walls stretched up into darkness that her flashlight couldn't pierce, and the floor was covered with market stalls arranged in concentric circles. Dozens of them. Hundreds. Each one glowing with the kind of light that didn't have a visible source. And everywhere—everywhere—there were people. Or things that
were shaped
like people. Her detective's mind registered the details: the woman with too many fingers adjusting bottles at a stall, the man whose skin
seemed to
shift between colors, the child whose eyes reflected her flashlight beam like a cat's. Quinn's hand went to her weapon without conscious thought. Tomás Herrera stood near the center of the market, no longer running. He
was talking
to an older man in an expensive coat, gesturing urgently. When Herrera's eyes found her across the market floor, he pointed. Every face in the market turned toward her. Quinn's
watch
ticked audibly against her wrist. 11:53 PM. Time kept moving forward, even in impossible places. Especially in impossible places. That was the thought that
crystallized
in her mind as she stood at the threshold of the tunnel, one hand on her weapon, the other still holding her phone's flashlight like a candle against the dark. She'd
been trained
to protect civilians from criminals. She'd never
been trained
for this. Quinn's jaw tightened as she made the calculation. Herrera was here, in the market. That made him a witness, at minimum. It made him a lead she couldn't afford to lose. She'd spent three years
watch
ing for impossible things, for the cracks in reality where Morris had fallen through. She'd built her career on rational explanation and evidence-based investigation. But evidence was everywhere now. Unavoidable. Staring her in the face across a market that shouldn't exist in a tunnel beneath a city she thought she understood. She stepped forward into the market. Behind her, the tunnel entrance
seemed to
close, or perhaps just to darken. Quinn didn't look back. Her eyes tracked Herrera as he backed away, moving deeper into the impossible crowd, and she followed—a detective chasing a suspect into territory where the rules of her world no longer applied, where the only thing that still made sense was the pursuit itself.
created_at
"2026-02-17T02:41:55.796754+00:00"
2/17/2026, 2:41:55 AM
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
429
moderation_latency
(null)
generation_time
21629
tokens_prompt
914
tokens_completion
2115
native_tokens_prompt
917
native_tokens_completion
1809
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.009962
router
(null)
provider_responses
0
id
"msg_01U4oiKJnvzK5Res44YWskju"
status
200
is_byok
false
latency
427
endpoint_id
"41d2915a-92e6-4993-b537-210b4e10cba8"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1771296093-usVfqqXAcTw8OFUa9EIE"
upstream_id
"msg_01U4oiKJnvzK5Res44YWskju"
total_cost
0.009962
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
1
adverbTagCount
0
adverbTags
(empty)
85.1190%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1344
totalAiIsmAdverbs
2
found
0
adverb
"slightly"
count
1
1
adverb
"truly"
count
1
highlights
0
"slightly"
1
"truly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
62.7976%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1344
totalAiIsms
5
found
0
word
"glint"
count
1
1
word
"crystallized"
count
1
2
word
"anticipation"
count
1
3
word
"jaw clenched"
count
1
4
word
"footsteps"
count
1
highlights
0
"glint"
1
"crystallized"
2
"anticipation"
3
"jaw clenched"
4
"footsteps"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"jaw/fists clenched"
count
1
highlights
0
"jaw clenched"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
114
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
5
narrationSentences
114
filterMatches
0
"watch"
hedgeMatches
0
"appear to"
1
"seemed to"
2
"seem to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
117
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
39
mean
33.92
std
25.9
cv
0.763
sampleLengths
0
56
1
25
2
48
3
51
4
45
5
12
6
13
7
45
8
34
9
77
10
30
11
5
12
45
13
46
14
26
15
2
16
101
17
8
18
92
19
11
20
53
21
23
22
3
23
3
24
12
25
59
26
8
27
65
28
48
29
9
30
36
31
8
32
57
33
8
34
6
35
59
36
27
37
6
38
61
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
5
totalSentences
114
matches
0
"were covered"
1
"been trained"
2
"were shaped"
3
"been trained"
4
"been trained"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
6
totalVerbs
223
matches
0
"was panicking"
1
"wasn't hearing"
2
"was hearing were expecting"
3
"wasn't listening"
4
"was talking"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1324
adjectiveStacks
0
stackExamples
(empty)
adverbCount
48
adverbRatio
0.03625377643504532
lyAdverbCount
18
lyAdverbRatio
0.013595166163141994
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
117
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
117
mean
11.31
std
8.14
cv
0.72
sampleLengths
0
15
1
25
2
2
3
14
4
25
5
16
6
21
7
7
8
4
9
14
10
21
11
9
12
7
13
3
14
14
15
28
16
9
17
3
18
13
19
3
20
8
21
6
22
17
23
11
24
5
25
11
26
13
27
5
28
14
29
10
30
26
31
9
32
18
33
14
34
7
35
9
36
5
37
14
38
11
39
18
40
1
41
1
42
5
43
2
44
15
45
16
46
8
47
4
48
14
49
8
71.7949%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
6
diversityRatio
0.46153846153846156
totalSentences
117
uniqueOpeners
54
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
1
fancyCount
1
fancyTags
0
"shouted (shout)"
89.3948%