NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMXVRC3K8F9WAR2SPYB86PZ
Run Details
Model:
Claude Sonnet 4
Test:
Bad Writing Habits
Scenario:
Thriller: chase through city streets
Duration:
40.6s
Cost:
$0.026121
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4417" name="Tomás Herrera" aliases="Tommy, Tomás, Herrera" pronouns="he/him" species="Human" storyRole="Support"> <eyeColor>warm brown</eyeColor> <hair>short curly dark brown</hair> <height>5'10"</height> <age>29</age> <skinTone>olive skin</skinTone> <scar>scar running along his left forearm from a knife attack</scar> <medallion>Saint Christopher medallion around his neck</medallion> <birthplace>Seville, Spain</birthplace> <description>A former paramedic who provides off-the-books medical care for the clique.</description> <backstory>Born in Seville, Spain. He moved to London to work for the NHS but lost his license after administering unauthorized treatments to supernatural patients.</backstory> </character> <location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth"> <signColor>distinctive green neon sign above the entrance</signColor> <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom> <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description> </location> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> </codex> Write a fully fleshed out scene of at least 1000 words in the thriller genre. A detective is pursuing a suspect through rain-soaked city streets at night. The chase takes them from the street level down into an underground market. The detective must decide whether to follow the suspect into unfamiliar and potentially dangerous territory. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Rain hammered the pavement with the persistence of a debt collector, turning the streets of Camden into a maze of shimmering reflections and dark corners. Detective Harlow Quinn pressed herself against the brick wall of a closed fishmonger's shop, her worn leather
watch
catching the occasional flash of neon from the late-night establishments still clinging to life. Water dripped from the brim of her coat's hood, each drop marking the seconds that had passed since she'd lost visual contact with her target. Tomás Herrera had been three blocks ahead when she'd first spotted him emerging from The Raven's Nest, that peculiar Soho bar where too many of her leads
seemed to
converge. His Saint Christopher medallion had caught the distinctive green glow of the bar's neon sign for just an instant before he'd vanished into the labyrinth of Camden's backstreets. Quinn had been tracking him for weeks—the former paramedic who'd lost his license under circumstances that the NHS refused to discuss in detail, who now
seemed to
appear at the periphery of every case that defied conventional explanation. The kind of case that reminded her of DS Morris and the night three years ago when everything she thought she understood about police work had crumbled like wet cardboard. Thunder rolled overhead, masking the sound of her
footsteps
as she moved from shadow to shadow. The rain had driven most pedestrians indoors, leaving only the desperate and the dangerous to
navigate
these streets. Quinn belonged to neither category, but she was beginning to wonder if that distinction mattered in whatever game Herrera was playing. She caught a glimpse of movement near the mouth of an alley that shouldn't exist—a narrow gap between two Victorian buildings that her mental map of the area insisted was solid wall. Herrera's silhouette
flickered
at its entrance, his olive skin pale in the intermittent streetlight, the scar along his left forearm visible as he pushed something aside. A gate, perhaps, or a door that had
been painted
to match the surrounding brick. Quinn's hand moved instinctively to the radio clipped to her belt, then hesitated. What would she tell dispatch? That she
was pursuing
a suspect into an alley that might not officially exist? That the former paramedic she'd been investigating had somehow opened a passage through solid masonry? The thought of explaining another impossible situation to her superiors made her jaw clench with familiar frustration. Morris would have understood. Morris had been with her that night when the rules of reality had proven more flexible than either of them had
been prepared
to accept. But Morris was gone, and Quinn was left to
navigate
these waters alone, her military precision the only anchor she had left. The alley swallowed Herrera's form
completely
. Quinn waited thirty seconds, counting each heartbeat, then followed. The passage was narrower than it had appeared from the street, forcing her to turn sideways in places where the walls
seemed to
press inward with deliberate malice. The brick was old, older than the buildings it connected, and covered with a film of moisture that felt too warm for rainwater. Symbols had been carved into the mortar at irregular intervals—geometric patterns that hurt to look at directly, as if they existed in more dimensions than her eyes were equipped to process. The alley terminated at a metal grate set into the ground, its bars bent outward as if something had forced its way through from below. The opening was just wide enough for a person to squeeze through, and recent scuff marks on the metal suggested frequent use. Quinn crouched beside it, her flashlight beam revealing a ladder descending into darkness that
seemed to
swallow the light before it could penetrate more than a few feet. The sound of voices drifted up from below—Herrera's accented English mixing with others she couldn't identify. The acoustics suggested a large space, possibly one of the abandoned Tube stations that honeycombed London's underground. She'd heard rumors of squatter communities in the forgotten corners of the transit system, but the voices below carried an undertone of commerce rather than desperation. Quinn gripped the twisted bars and lowered herself through the opening, her feet finding purchase on the ladder's rungs. The metal was slick with condensation, forcing her to move
slowly
to avoid losing her grip. With each step downward, the voices became clearer, revealing the unmistakable rhythms of negotiation and trade. The ladder ended at a platform that might have once served legitimate passengers. Tiles decorated with faded advertisements for long-forgotten products covered the walls, their cheerful messages rendered grotesque by decades of moisture damage and deliberate defacement. Emergency lighting had been strung along the ceiling, creating pools of harsh illumination separated by stretches of impenetrable shadow. But it was the market itself that stopped Quinn's breath in her throat. Stalls constructed from salvaged materials lined both sides of what had once been a passenger platform. Vendors hawked their wares from behind tables laden with objects that Quinn's rational mind refused to categorize—bottles filled with liquids that moved against gravity, mirrors that reflected spaces other than the one she occupied, books bound in materials that definitely weren't leather but might once have been skin. Herrera stood before one such stall, engaged in animated conversation with a vendor whose features
seemed to
shift whenever Quinn looked directly at them. The former paramedic held something small in his palm, offering it to the vendor for inspection. In the unsteady light, it looked like bone. A bone token, Quinn realized with the cold certainty that accompanies nightmares made manifest. The kind of currency that didn't exist in any official records, traded in markets that operated beyond the reach of conventional law enforcement. She pressed herself against the tiled wall, grateful for the shadows that concealed her presence. Around her, other transactions proceeded with the casual efficiency of any marketplace—if any legitimate marketplace dealt in the sort of merchandise that made her weapon
feel
inadequate and her badge seem like a quaint anachronism. This was the Veil Market. It had to be. Quinn had encountered references to it in case files that never
seemed to
lead anywhere, testimonies from witnesses who recanted their statements, evidence that disappeared from secured lockups. An underground economy that traded in impossibilities, accessible only to those who possessed the proper credentials. Credentials like the bone token Herrera was now exchanging for a small vial filled with liquid that pulsed with its own internal light. Quinn faced a choice that would define not just this investigation, but her entire understanding of the world she'd sworn to protect. She could retreat, climb back
up that ladder
and pretend she'd never found this place. File a report that would
carefully
omit the supernatural elements, stick to the facts that wouldn't destroy her credibility or her sanity. Or she could step forward into a marketplace that operated by rules she didn't understand, where her authority meant nothing and her weapon might prove worse than useless. Where answers to questions about Morris's death might finally be within reach, even if those answers came at a price she wasn't prepared to pay. The decision, when it came, felt like falling. Quinn stepped into the light.
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
0
adverbTagCount
0
adverbTags
(empty)
75.2066%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1210
totalAiIsmAdverbs
3
found
0
adverb
"carefully"
count
1
1
adverb
"completely"
count
1
2
adverb
"slowly"
count
1
highlights
0
"carefully"
1
"completely"
2
"slowly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
66.9421%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1210
totalAiIsms
4
found
0
word
"flickered"
count
1
1
word
"footsteps"
count
1
2
word
"navigate"
count
2
highlights
0
"flickered"
1
"footsteps"
2
"navigate"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
60
matches
(empty)
94.2029%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
6
narrationSentences
60
filterMatches
0
"watch"
1
"feel"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
60
matches
0
"up that ladder"
93.1325%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
24
mean
50.08
std
23.84
cv
0.476
sampleLengths
0
82
1
96
2
30
3
55
4
73
5
64
6
51
7
15
8
82
9
75
10
59
11
51
12
56
13
13
14
64
15
48
16
37
17
50
18
53
19
23
20
59
21
53
22
8
23
5
88.8889%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
4
totalSentences
60
matches
0
"been painted"
1
"been prepared"
2
"was gone, was left"
3
"been carved were equipped"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
3
totalVerbs
201
matches
0
"was beginning was playing"
1
"was pursuing"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
1210
adjectiveStacks
0
stackExamples
(empty)
adverbCount
39
adverbRatio
0.032231404958677684
lyAdverbCount
13
lyAdverbRatio
0.010743801652892562
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
60
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
60
mean
20.03
std
8.93
cv
0.446
sampleLengths
0
25
1
32
2
25
3
30
4
28
5
38
6
30
7
16
8
18
9
21
10
32
11
26
12
15
13
13
14
5
15
14
16
15
17
17
18
4
19
25
20
22
21
6
22
9
23
28
24
23
25
31
26
25
27
22
28
28
29
16
30
17
31
26
32
19
33
16
34
16
35
13
36
24
37
19
38
13
39
16
40
48
41
24
42
16
43
8
44
14
45
23
46
15
47
35
48
5
49
4
75.0000%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
5
diversityRatio
0.5
totalSentences
60
uniqueOpeners
30
100.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
0
fancyCount
0
fancyTags
(empty)
93.7278%