NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQXAVCZAT7XV9T6YX1MQZD
Run Details
Model:
Claude Haiku 4.5
Test:
Bad Writing Habits
Scenario:
Thriller: chase through city streets
Duration:
16.6s
Cost:
$0.008127
Provider:
Amazon Bedrock
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4417" name="Tomás Herrera" aliases="Tommy, Tomás, Herrera" pronouns="he/him" species="Human" storyRole="Support"> <eyeColor>warm brown</eyeColor> <hair>short curly dark brown</hair> <height>5'10"</height> <age>29</age> <skinTone>olive skin</skinTone> <scar>scar running along his left forearm from a knife attack</scar> <medallion>Saint Christopher medallion around his neck</medallion> <birthplace>Seville, Spain</birthplace> <description>A former paramedic who provides off-the-books medical care for the clique.</description> <backstory>Born in Seville, Spain. He moved to London to work for the NHS but lost his license after administering unauthorized treatments to supernatural patients.</backstory> </character> <location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth"> <signColor>distinctive green neon sign above the entrance</signColor> <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom> <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description> </location> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> </codex> Write a fully fleshed out scene of at least 1000 words in the thriller genre. A detective is pursuing a suspect through rain-soaked city streets at night. The chase takes them from the street level down into an underground market. The detective must decide whether to follow the suspect into unfamiliar and potentially dangerous territory. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hit the streets of London like a personal vendetta, turning the asphalt into a mirror of reflected neon and sodium light. Detective Harlow Quinn ran, her worn leather
watch
slapping against her wrist in rhythm with her footfalls, water streaming down her sharp jawline and soaking through the collar of her coat. Behind her, the suspect—a thin man in a charcoal hoodie—
was pulling
away, his trainers splashing through puddles on Wardour Street. She'd spotted him leaving The Raven's Nest twenty minutes ago. Just a glimpse,
really
, but enough. Her instincts had
been honed
over eighteen years in the Met, and right now they
were screaming
. This was one of them. One of the ones connected to the clique she'd been tracking for the better part of two months. Her breath came hard and controlled. Military precision dictated every movement, even now, even as rain hammered down hard enough to blur her vision. She vaulted over a street-level barrier, cutting across the alley that ran parallel to the main road. The suspect didn't look back. That was interesting. Most runners looked back. Most panicked. This one just ran like he had somewhere to be, like he'd planned this exact route. They descended into Soho, deeper into the warren of streets where London kept its secrets. Quinn's shoes found purchase on wet cobblestones; her breath came white in the cold night air. She
was gaining
. Forty-one years old and she could still run, could still feel the burn in her legs and chest that meant she was alive, that she
was doing
something that mattered. The suspect cut left, into an alley so narrow Quinn could nearly touch both brick walls if she spread her arms. She followed, her hand hovering near the torch at her belt. Somewhere in the back of her mind—the part that still remembered DS Morris, her partner, the way he'd looked when they found him three years ago—a warning bell
was ringing
. This
was getting
deep. This
was getting
into territory she didn't have clearance for, didn't have backup for. The alley opened onto a street she didn't recognize. That shouldn't have been possible. She knew this city like she knew the scar tissue on her knuckles. But here they were, in some pocket of Soho that
seemed to
exist just outside her mental map. The suspect didn't pause. He ran straight toward what looked like a defunct Tube station entrance, the art deco signage so weathered the letters were barely legible. Quinn's jaw tightened. Every rational instinct told her to stop, to call for backup, to wait for daylight and proper channels. But Morris had taught her something about instinct. Morris had taught her that sometimes the rational choice was the wrong one. She followed the suspect down into darkness. The stairs descended at a steep angle, rain-slicked and ancient. Quinn's torch beam cut through the gloom, revealing tiled walls that must have been white once, decades ago. Now they were the color of old teeth, stained and cracked. The temperature dropped as she went deeper, and her breath became more visible against the thin beam of light. At the bottom, a platform stretched out into shadow. The Tube tunnels curved away into both directions, their mouths impossibly black. The suspect
was gone
. Quinn's hand found the torch again, and she swept the beam across the platform, searching for movement, for heat, for any sign of where he'd gone. That's when she heard it. Music, faint and warped, coming from somewhere beneath her feet. And underneath that, the smell—sharp spices, something like cinnamon mixed with something acrid she couldn't place. Something organic and wrong. Quinn descended further, following the platform around a curve. The wall on her left had changed. Instead of standard London Transport tile, there was stone—old stone, worked with symbols she didn't recognize. Her torch beam caught on them, and for a moment she felt something like electricity run through her spine. She'd never seen anything like them before, but something in her body reacted to them anyway. The music grew louder. So did the smell. The platform opened into a wider space, and Quinn's torch revealed the impossible: a market. An actual market, underground, beneath the Tube station that shouldn't have had any substructure below its platform. Stalls lined makeshift corridors. Hundreds of people moved through narrow passages, their faces obscured by hoods and shadow. Lanterns hanging from the ceiling cast everything in amber light, and everywhere Quinn looked, there were things that didn't belong in the rational world. A vendor at one stall
was selling
bottles filled with liquid that
seemed to
move of its own accord. Another
was dealing
in what looked like carved bone tokens, each one unique, each one pulsing with some kind of subtle luminescence. The suspect had disappeared into the crowd, but Quinn could see him now, near a stall marked with symbols that hurt to look at directly. She made a decision. Quinn stepped down from the platform onto the market floor, her hand resting on her warrant card even though she knew it would mean nothing here. The crowd
seemed to
part around her, though no one actively moved. It was more like the market itself
was creating
space for her passage, a kind of breathing acknowledgment. A vendor caught her eye. He was older, with a face scarred by something more than time, and he
was watching
her with an expression that suggested he'd been expecting someone like her. Not her specifically, perhaps, but something like her. The attention of the law. The attention of someone who didn't belong here. Behind him, partially obscured, Quinn saw a curtain of dark fabric, and beyond that, a light. Not electric light. Something older. Something alive. Her fingers tightened on the torch. She could feel the city above her, the rain still falling, the normal world carrying on. She could turn around now, climb back up those stairs, call it in, get a warrant, do this by the book. By the rules. The safe way. The way Morris had trained her to do things. The way Morris was dead. Quinn made a choice. She pushed deeper into the market, toward the suspect, toward the fabric curtain and whatever lay beyond it. The vendor didn't
try to
stop her. He just smiled, and in his smile was the certainty that she'd made exactly the right decision—or exactly the wrong one, and that maybe there wasn't any difference anymore. The warrant card in her pocket felt like a talisman from another world.
created_at
"2026-02-17T02:47:32.162003+00:00"
2/17/2026, 2:47:32 AM
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
951
moderation_latency
163
generation_time
16365
tokens_prompt
914
tokens_completion
1660
native_tokens_prompt
917
native_tokens_completion
1442
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.008127
router
(null)
provider_responses
0
id
"2ae919dc-db21-4f27-8981-8a6bb58229e3"
status
200
is_byok
false
latency
951
endpoint_id
"d000261c-5aeb-47d1-ac87-ad64dbe1cdb6"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1771296435-OohXUmsWvfVpkAgeWf6Q"
upstream_id
"2ae919dc-db21-4f27-8981-8a6bb58229e3"
total_cost
0.008127
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
0
adverbTagCount
0
adverbTags
(empty)
90.9584%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1106
totalAiIsmAdverbs
1
found
0
adverb
"really"
count
1
highlights
0
"really"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1106
totalAiIsms
0
found
(empty)
highlights
(empty)
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
88
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
4
narrationSentences
88
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
1
"try to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
88
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
23
mean
47.65
std
24.76
cv
0.52
sampleLengths
0
74
1
56
2
71
3
64
4
80
5
72
6
42
7
7
8
58
9
51
10
5
11
30
12
67
13
82
14
66
15
4
16
56
17
54
18
23
19
49
20
14
21
58
22
13
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
88
matches
0
"been honed"
1
"was gone"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
11
totalVerbs
176
matches
0
"was pulling"
1
"were screaming"
2
"was gaining"
3
"was doing"
4
"was ringing"
5
"was getting"
6
"was getting"
7
"was selling"
8
"was dealing"
9
"was creating"
10
"was watching"
98.6076%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1106
adjectiveStacks
0
stackExamples
(empty)
adverbCount
46
adverbRatio
0.04159132007233273
lyAdverbCount
10
lyAdverbRatio
0.009041591320072333
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
88
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
88
mean
12.45
std
7.89
cv
0.633
sampleLengths
0
23
1
31
2
20
3
10
4
6
5
17
6
5
7
18
8
6
9
18
10
17
11
5
12
3
13
4
14
2
15
16
16
15
17
16
18
3
19
30
20
21
21
11
22
30
23
4
24
14
25
9
26
5
27
13
28
18
29
4
30
23
31
3
32
18
33
8
34
13
35
7
36
10
37
18
38
11
39
19
40
9
41
12
42
4
43
26
44
5
45
10
46
16
47
4
48
9
49
7
57.1970%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
8
diversityRatio
0.3977272727272727
totalSentences
88
uniqueOpeners
35
100.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
0
fancyCount
0
fancyTags
(empty)
96.8684%