NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQZQXE4JMMXVZX8WDT49FS
Run Details
Model:
Claude Sonnet 4.5
Test:
Bad Writing Habits
Scenario:
Thriller: chase through city streets
Duration:
33.4s
Cost:
$0.025446
Provider:
Amazon Bedrock
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4417" name="Tomás Herrera" aliases="Tommy, Tomás, Herrera" pronouns="he/him" species="Human" storyRole="Support"> <eyeColor>warm brown</eyeColor> <hair>short curly dark brown</hair> <height>5'10"</height> <age>29</age> <skinTone>olive skin</skinTone> <scar>scar running along his left forearm from a knife attack</scar> <medallion>Saint Christopher medallion around his neck</medallion> <birthplace>Seville, Spain</birthplace> <description>A former paramedic who provides off-the-books medical care for the clique.</description> <backstory>Born in Seville, Spain. He moved to London to work for the NHS but lost his license after administering unauthorized treatments to supernatural patients.</backstory> </character> <location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth"> <signColor>distinctive green neon sign above the entrance</signColor> <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom> <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description> </location> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> </codex> Write a fully fleshed out scene of at least 1000 words in the thriller genre. A detective is pursuing a suspect through rain-soaked city streets at night. The chase takes them from the street level down into an underground market. The detective must decide whether to follow the suspect into unfamiliar and potentially dangerous territory. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered down on Harlow's shoulders as she sprinted across Chalk Farm Road, her boots splashing through puddles that reflected the amber glow of streetlights. Fifty meters ahead, Tomás Herrera dodged between late-night pedestrians, his Saint Christopher medallion catching the light as he glanced back. "Police! Stop!" Her voice cut through the downpour, sharp and commanding. Eighteen years on the force had taught her that suspects rarely complied, but procedure demanded she give the order. Herrera didn't stop. He veered left into an alley, his olive-skinned hand grabbing a drainpipe to swing himself around the corner. Harlow pushed harder, her worn leather
watch
slapping against her wrist with each pump of her arms. The military precision of her movements kept her breathing steady even as her heart rate climbed. This was the third time this month she'd spotted someone from that damned clique—the group she'd been investigating since finding their symbol at DS Morris's death scene three years ago. The symbol she still didn't understand. The alley narrowed, forcing her to turn sideways past overflowing bins. The smell of rotting food mixed with the clean scent of rain. Ahead, Herrera reached a brick wall and scrambled up a fire escape, metal clanging under his
weight
. Harlow grabbed the ladder and hauled herself up, ignoring the rust that flaked off under her palms. Her brown eyes tracked every movement, cataloging details the way she'd
been trained
. The scar on his left forearm. The way he favored his right leg
slightly
. The medical kit bag bouncing against his hip. She'd been
watch
ing The Raven's Nest for weeks, that dingy bar in Soho with its green neon sign and its parade of people who didn't quite fit. People who whispered in corners and left through back doors. People like Herrera, the former paramedic who'd lost his NHS license under circumstances the records couldn't quite explain. Unauthorized treatments for unusual patients, the file had said. The file hadn't specified what made the patients unusual. Harlow reached the rooftop as Herrera crossed it, his
footsteps
sure despite the slick surface. Rain plastered his short curly hair to his skull. He jumped across a gap to the next building, landing in a crouch. She followed without hesitation, her sharp jaw set. The gap was wider than it looked—for a moment she was airborne, rain in her face, the street four stories below—then her boots hit concrete and she rolled forward, coming up running. "Herrera!" She was close enough now to see the panic in his warm brown eyes when he looked back. "I just want to talk!" He shook his head, water flying from his hair, and disappeared through a rooftop access door. Harlow reached it seconds later, finding a stairwell that plunged into darkness. She pulled her torch from her belt and descended, the beam cutting through shadow. Her breathing
echoed
off concrete walls. Three flights down. Four. Five. The building hadn't looked this tall from the outside. The stairs ended in a basement that smelled of mildew and something else—something sharp and mineral, like blood and copper and stone. A single bulb swung from the ceiling, casting moving shadows. There was no sign of Herrera, but a metal door stood ajar on the far wall, revealing more stairs descending into blackness. Harlow's hand moved instinctively to her radio. She should call for backup. Protocol demanded it. But backup would ask questions she couldn't answer. Why was she pursuing a former paramedic? What evidence did she have of criminal activity? How could she
explain that she'd
been obsessing over this case for three years, that she dreamed of Morris's body in that warehouse, that she knew—knew—something impossible had killed him? She couldn't. So she didn't reach for the radio. The stairs beyond the door were older, carved from stone rather than poured concrete. Water trickled along one wall, following grooves that looked
deliberately
cut. Harlow descended
slowly
now, torch in one hand, the other resting on her belt near her cuffs. The air grew colder. The mineral smell intensified. Voices drifted up from below. Many voices, speaking in languages she didn't recognize. Light
flickered
—not electric, but the warm yellow of flames. The stairs opened into a tunnel lined with exposed brick, and Harlow pressed herself against the wall as she moved forward. Her
watch
felt heavy on her wrist, its familiar
weight
grounding her. The tunnel curved, and
suddenly
she could see into a vast space that made her breath catch. The Tube station shouldn't have existed. Camden's underground layout
was mapped
, documented, public record. There was no abandoned station here. Yet here it was: a platform stretching into shadows, lined with stalls and vendors and people—so many people, all of them wrong in ways she couldn't quite articulate. A woman with eyes that reflected light like a cat's. A man whose skin
seemed to
shift between textures, scales to skin to something else. Goods displayed under gaslight: bottles filled with luminescent liquid, books bound in leather that looked disturbingly organic, weapons that hummed with energy she could feel from here. Harlow's hand tightened on her torch. This wasn't possible. This couldn't be real. But there, threading through the crowd with quick steps, was Herrera. He pulled something from his pocket—something pale that caught the light—and pressed it into the palm of a vendor at the entrance. The vendor nodded, stepping aside. A bone token, Harlow realized without knowing how she knew. This was the decision point. She could retreat now, go back up those impossible stairs, return to her flat and her files and pretend she hadn't seen this. Pretend the world was still the size and shape she understood. Or she could step forward into this underground market, this impossible place, and follow the thread that connected to Morris's death. Her partner's face flashed through her mind. Morris, who'd always trusted her instincts. Morris, who'd walked into that warehouse three years ago because she'd asked him to, because she'd insisted they investigate one more lead. Morris, whose body she'd found torn apart in ways that no human could manage. Harlow adjusted her grip on the torch and stepped out of the tunnel, into the market's strange light. The crowd didn't part for her, but no one stopped her either. She kept her shoulders square, her bearing military-precise, projecting confidence she didn't feel. Ahead, Herrera disappeared between two stalls. Harlow followed, descending deeper into the impossible, chasing answers she wasn't sure she wanted to find.
created_at
"2026-02-17T02:49:07.937103+00:00"
2/17/2026, 2:49:07 AM
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2066
moderation_latency
158
generation_time
33175
tokens_prompt
914
tokens_completion
1709
native_tokens_prompt
917
native_tokens_completion
1513
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.025446
router
(null)
provider_responses
0
id
"cc32f899-8d4c-4288-b2e4-049f3ec2a020"
status
200
is_byok
false
latency
2061
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1771296513-eqzjyRTcz2vrd1lg0GiE"
upstream_id
"cc32f899-8d4c-4288-b2e4-049f3ec2a020"
total_cost
0.025446
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
0
adverbTagCount
0
adverbTags
(empty)
63.4703%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1095
totalAiIsmAdverbs
4
found
0
adverb
"deliberately"
count
1
1
adverb
"slightly"
count
1
2
adverb
"slowly"
count
1
3
adverb
"suddenly"
count
1
highlights
0
"deliberately"
1
"slightly"
2
"slowly"
3
"suddenly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
54.3379%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1095
totalAiIsms
5
found
0
word
"flickered"
count
1
1
word
"weight"
count
2
2
word
"footsteps"
count
1
3
word
"echoed"
count
1
highlights
0
"flickered"
1
"weight"
2
"footsteps"
3
"echoed"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
87
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
1
narrationSentences
87
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
88
matches
0
"explain that she'd"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
24
mean
45.08
std
27.36
cv
0.607
sampleLengths
0
46
1
30
2
84
3
6
4
92
5
73
6
77
7
24
8
62
9
54
10
68
11
9
12
50
13
55
14
17
15
100
16
13
17
38
18
10
19
60
20
35
21
14
22
43
23
22
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
87
matches
0
"been trained"
1
"was mapped"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
188
matches
(empty)
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1087
adjectiveStacks
0
stackExamples
(empty)
adverbCount
21
adverbRatio
0.019319227230910764
lyAdverbCount
8
lyAdverbRatio
0.007359705611775529
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
88
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
88
mean
12.3
std
7.36
cv
0.599
sampleLengths
0
26
1
20
2
11
3
19
4
3
5
18
6
17
7
16
8
30
9
6
10
11
11
12
12
17
13
17
14
13
15
6
16
8
17
8
18
27
19
10
20
18
21
9
22
9
23
15
24
9
25
13
26
8
27
32
28
19
29
5
30
16
31
12
32
14
33
6
34
3
35
1
36
1
37
9
38
22
39
10
40
22
41
7
42
5
43
3
44
8
45
7
46
8
47
30
48
2
49
7
54.5455%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
12
diversityRatio
0.4090909090909091
totalSentences
88
uniqueOpeners
36
100.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
0
fancyCount
0
fancyTags
(empty)
92.4914%