NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQR8QE9AWVA7KZM16ZNCG0
Run Details
Model:
Claude Sonnet 4.5
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
32.0s
Cost:
$0.026466
Provider:
Amazon Bedrock
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The stench hit Quinn before she ducked under the police tape—Thames water, rust, and something acrid that made her sinuses burn. The abandoned station platform str
etched
into darkness beyond the crime scene floods, its tiled walls streaked with a century of grime and something that glistened wetly in the harsh light. DS
Patel
looked up from his notebook as Quinn's boots crunched across scattered debris. "Ma'am. Victim's a John Doe, mid-thirties, no identification. Night shift worker from the construction site above called it in around oh-four-hundred." Quinn nodded, her gaze already sweeping the scene with the
methodical
precision that had become second nature over eighteen years. The body lay sprawled near the platform edge, one arm flung out toward the tracks. Male, dressed in expensive wool coat and leather shoes that didn't match the location. No one dressed like that wandered into sealed Underground stations by accident. "Cause of death?" "That's where it gets strange."
Patel
shifted his
weight
, and Quinn caught the uncomfortable tightness around his eyes. "No visible wounds. No signs of struggle. Forensics is thinking cardiac arrest, but..." He gestured at the platform. "Look at the burn pattern." Quinn crouched beside the body, careful not to disturb the evidence markers. The concrete around the victim showed a perfect circle of scorching, black char marks radiating outward like a frozen explosion. She'd seen electrical burns before—they didn't look like this. The pattern was too precise, too deliberate. The edges were sharp as if drawn with a compass. She touched the worn leather of her
watch
, grounding herself in its familiar
weight
. Three years ago, she would have accepted cardiac arrest and moved on. Three years ago, she hadn't found Morris's body in that warehouse with the walls covered in symbols she still couldn't explain. "Cameras?" "None functional down here. Station's been closed since 1955."
Patel
flipped through his notes. "But here's the thing—we found traces of footprints leading toward the northwest tunnel. Recent, maybe hours old. Multiple individuals." Quinn stood, following the direction
Patel
indicated. The tunnel entrance yawned beyond the police tape, swallowing the flood lights. Something glinted near the track bed, and she moved closer, pulling her torch from her belt. A small brass compass lay half-buried in decades of dust and refuse. She photographed it in place, then lifted it with gloved fingers. The casing showed heavy wear, verdigris eating at the edges, but the face
was etched
with markings she didn't recognize—geometric patterns, symbols that hurt to look at too long. The needle spun
lazily
, ignoring
magnetic
north to point insistently toward the northwest tunnel. "
Patel
. Did forensics catalog this?" "What've you got?" She held it up,
watch
ing his expression shift from curiosity to confusion as the needle continued its unnatural rotation. "Compass. But it's not working properly." "Could be broken."
Patel
's tone suggested he
was trying
to convince himself. Quinn turned the compass over. No maker's mark. No serial number. Just smooth brass and those unsettling etchings that
seemed to
shift in her peripheral vision. She thought of the files locked in her desk drawer at the station—the Morris case, the unexplained deaths over the past three years that shared certain characteristics. Bodies in impossible locations. Evidence that contradicted itself. Witnesses who changed their stories or disappeared entirely. "The victim's pockets. What did we find?"
Patel
consulted his notes. "Wallet—expensive, but empty of cash and cards. Some kind of carved bone token, approximately two inches long. Keys. And..." He paused. "Ticket stub from the British Museum's restricted archives. Dated yesterday." The British Museum's restricted archives. Quinn's jaw tightened. She'd been building a file on three persons of interest, and one of them worked there. Eva Kowalski, research assistant. Friend of Aurora Sinclair, who kept appearing at the periphery of her investigations like a shadow Quinn couldn't quite catch. "The bone token. Let me see it."
Patel
retrieved the evidence bag from the forensics kit. The token was smooth, yellowed with age, carved with a symbol that matched nothing in Quinn's mental database of gang signs or criminal markings. But it matched the style of those etchings on the compass. She walked the perimeter of the burn marks, torch beam cutting through the gloom. The scorching was consistent all the way around—no variation in depth or intensity. That required specialized equipment or chemicals. Or something else, something she'd
been trained
her entire career to dismiss as impossible. Near the northwest edge of the pattern, she found them—partial boot prints in the dust, and smaller shoe prints. Women's, size seven or eight. The treads led away from the body, toward the tunnel where the compass insisted on pointing. "
Patel
, I want every security camera in a three-block radius pulled. Traffic cameras, shop fronts, anything that might have caught someone entering or leaving this area between midnight and oh-four-hundred." "You think it's murder?" Quinn straightened, her sharp features set in the expression that had earned her a reputation for being relentless. "I think someone went to considerable effort to make this look like natural causes while leaving evidence that
contradicts that conclusion
. The burn pattern alone suggests accelerants or equipment we haven't identified. The location means the victim either came here voluntarily or was brought here post-mortem." "Or he was chased,"
Patel
offered. Quinn considered this. The boot prints did suggest haste. But they led away from the body, not toward it. Multiple individuals, moving with purpose through a sealed Underground station that shouldn't be accessible. Carrying bone tokens and strange compasses that pointed at things that shouldn't exist. She thought of Eva Kowalski's curly red hair and round glasses, captured on CCTV near three separate incidents over the past six months. Always present, never quite close enough to question. Always with an explanation that fell just short of believable. The compass needle swung in Quinn's palm, pulling insistently toward the darkness. Toward whatever market or meeting place or ritual site these people
were using
in the forgotten spaces beneath London. She'd spent three years piecing together fragments of impossible evidence, trying to understand what had killed her partner. This compass, this body, this pattern—they were pieces of the same puzzle. "Get me a full workup on that bone token," Quinn said, bagging the compass
carefully
. "Cross-reference with any similar items logged in evidence over the past five years. And
Patel
—" She met his eyes. "This stays between us until we know what we're dealing with. Understood?"
Patel
nodded
slowly
, his gaze drifting to the burn marks, the compass, the dark tunnel entrance. "Ma'am, what do you think we're dealing with?" Quinn adjusted her
watch
, feeling the familiar
weight
anchor her. "I don't know yet. But I intend to find out."
created_at
"2026-02-17T02:45:01.376439+00:00"
2/17/2026, 2:45:01 AM
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2663
moderation_latency
193
generation_time
31735
tokens_prompt
923
tokens_completion
1846
native_tokens_prompt
937
native_tokens_completion
1577
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.026466
router
(null)
provider_responses
0
id
"65ddaca2-e82b-40a1-a39e-8cdf6827749d"
status
200
is_byok
false
latency
2663
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1771296269-3o7Ln17xDcpFpvXe9FWY"
upstream_id
"65ddaca2-e82b-40a1-a39e-8cdf6827749d"
total_cost
0.026466
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
2
adverbTagCount
1
adverbTags
0
"Quinn said, bagging the compass carefully [carefully]"
73.5216%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1133
totalAiIsmAdverbs
3
found
0
adverb
"carefully"
count
1
1
adverb
"lazily"
count
1
2
adverb
"slowly"
count
1
highlights
0
"carefully"
1
"lazily"
2
"slowly"
80.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
0
"Patel"
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
47.0432%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1133
totalAiIsms
6
found
0
word
"methodical"
count
1
1
word
"magnetic"
count
1
2
word
"weight"
count
3
3
word
"etched"
count
1
highlights
0
"methodical"
1
"magnetic"
2
"weight"
3
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
70
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
2
hedgeCount
1
narrationSentences
70
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
87
matches
0
"contradicts that conclusion"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
33
mean
33.79
std
20.89
cv
0.618
sampleLengths
0
51
1
35
2
61
3
3
4
41
5
58
6
47
7
1
8
33
9
35
10
66
11
5
12
3
13
25
14
12
15
69
16
7
17
35
18
48
19
7
20
44
21
47
22
40
23
30
24
4
25
64
26
6
27
46
28
41
29
61
30
46
31
24
32
20
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
2
totalSentences
70
matches
0
"was etched"
1
"been trained"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
2
totalVerbs
141
matches
0
"was trying"
1
"were using"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
849
adjectiveStacks
0
stackExamples
(empty)
adverbCount
25
adverbRatio
0.02944640753828033
lyAdverbCount
8
lyAdverbRatio
0.009422850412249705
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
87
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
87
mean
12.82
std
7.63
cv
0.595
sampleLengths
0
21
1
30
2
14
3
21
4
20
5
15
6
14
7
12
8
3
9
18
10
18
11
5
12
12
13
20
14
9
15
7
16
10
17
14
18
12
19
21
20
1
21
14
22
19
23
7
24
12
25
16
26
12
27
11
28
29
29
14
30
5
31
3
32
19
33
6
34
12
35
5
36
3
37
3
38
15
39
27
40
4
41
4
42
8
43
7
44
4
45
21
46
10
47
5
48
3
49
16
69.7318%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
6
diversityRatio
0.45977011494252873
totalSentences
87
uniqueOpeners
40
0.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
2
fancyCount
2
fancyTags
0
"Patel offered (offer)"
1
"Quinn said, bagging the compass carefully (say, bag)"
80.6057%