NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMMGNVPVEZ5PZXS27KZ8Y0T
Run Details
Model:
Hermes 3 405B
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
58.8s
Cost:
$0.002010
Provider:
DeepInfra
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Detective Harlow Quinn descended the worn concrete steps into the abandoned Tube station, her sharp eyes scanning the shadows. The air smelled of stale urine and decay. She ducked under a sagging chain barrier,
footsteps
echoing
in the dim light cast by wan bulbs. Ahead, a mutilated corpse lay sprawled in a pool of congealing blood. Harlow crouched beside it, leather
watch
creaking on her wrist. Deep slashes criss-crossed the torso, exposing
glistening
ribs and intestines. Neat slices stippled the waxy skin. She sucked in a breath between clenched teeth. Ritualistic. Precise. Nothing like a common street murder. "This makes the third one, Quinn. Each full moon, like clockwork. I'm telling you, it has to be the Veil Market." Harlow glanced up at the lanky form of her sergeant. "Spare me the fairytales,
Evans
. We deal in facts, not flights of fancy." She rose, surveying the vaulted ceiling and distant, tiled platform. "Have the scene techs found any trace evidence? Security camera footage?"
Evans
sighed, flipping open his notebook. "Nothing yet. No signs of struggle, no murder weapon, no witnesses. He's a ghost." Harlow's jaw tightened. "No one's that clean. There's always a thread to tug." She studied the occult tattoos inked into the victim's chest. "Zellman. You're familiar with London's underground. What are we looking at here?" The detective in question knelt, gloved fingers hovering over the ink. "Dark alchemy, by the look of it. Sacrifice. The killer's drawing power from the ritual. Something's off though..." Zellman leaned closer, brow
furrowed
. "This glyph here. That's not blood magic, it's a portal sigil. To where, I couldn't say." Harlow's mind raced as she took in the grisly tableau, fitting the pieces together. "Rumors have been swirling about a hidden supernatural black market. Could he have crossed the wrong person there? Stumbled onto something he shouldn't have seen?" She fixed her steely gaze on
Evans
. "I want everything we have on the Veil Market, this supposed Shade compass, everything. Someone there knows something. We find them, we find our killer."
Evans
opened his mouth to protest but a clipped shout cut him off. "Detective Quinn! We've got a witness over here. Says she saw a woman fleeing the scene just before we arrived. Red hair, carrying a satchel..." Harlow spun on her heel, hand instinctively dropping to the service weapon at her hip. "Where?" The uniformed officer led them to a woman huddled against the far wall, knuckles white around the strap of a worn leather bag. Curly red hair framed a pale, freckled face, eyes wide with fear behind round glasses. She flinched as they approached. "Miss, I'm Detective Quinn. Can you tell me what you saw here tonight?" Harlow asked, pitching her voice low and calm. The redhead swallowed hard. "I... I was on my way home from the British Museum. I took a wrong turn and... and I saw her. She had a knife. There was so much blood..." She squeezed her eyes shut, a tear sliding down her cheek. "She must have heard me. She looked right at me. I ran. I'm sorry, I should have done something..." "You did the right thing." Harlow made a subtle gesture and
Evans
stepped back to give them space. She crouched, meeting the woman's gaze. "I know you're frightened, but I need you to tell me everything you remember. Every detail counts." The woman took a shaky breath and nodded. "She was dressed all in black. Gloves, a hood. But her face... she had the most striking green eyes. They almost
seemed to
... glow. Like a cat's. She moved so fast, too fast to be natural..." Harlow's own
eyes narrowed
. Inhuman speed, glowing eyes, ritualistic murder... this was no ordinary crime of passion. The Market looked more and more like their best lead. But she kept her tone neutral as she nodded along with the witness's story. "You've been incredibly helpful, Miss...?" "Kowalski. Eva." She shouldered her satchel, straightening. "I work in the museum's restricted archives. A lot of the texts reference powerful supernatural relics. I've heard whispers about the Veil Market." A pause, then she
seemed to
come to a decision. "A package arrived there last month. A compass rumored to locate rifts between worlds. I may be able to get you more information, if you think it could help." Harlow rose, offering Eva a hand up. "At this point, we'll take any lead we can get. The Market moves each full moon, correct? That doesn't give us much time." She glanced at the body, then back to Eva's determined face. "You're sure you want to get involved? This is dangerous business." Eva shrugged, adjusting her glasses. "A man died tonight. I can't
let that be
for nothing. I'll do whatever I can to help find who did this." Harlow held her gaze a beat, then gave a curt nod. "We appreciate your cooperation. Grab your things and meet us at the car. I'm not letting you out of my sight until that Market closes tonight. You're our ticket in."
created_at
"2026-02-17T01:48:53.66927+00:00"
2/17/2026, 1:48:53 AM
model
"nousresearch/hermes-3-llama-3.1-405b"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
464
moderation_latency
(null)
generation_time
58721
tokens_prompt
958
tokens_completion
1268
native_tokens_prompt
849
native_tokens_completion
1161
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00201
router
(null)
provider_responses
0
id
"cmpl-0248f895dd044fe7a0c9227ebae104be"
status
200
is_byok
false
latency
79
endpoint_id
"3280e8f8-0960-400c-8284-2e03a0bcb446"
provider_name
"DeepInfra"
model_permaslug
"nousresearch/hermes-3-llama-3.1-405b"
api_type
"completions"
id
"gen-1771292874-2ESFAzffrpe2vr3UtXlx"
upstream_id
"cmpl-0248f895dd044fe7a0c9227ebae104be"
total_cost
0.00201
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
1
adverbTagCount
0
adverbTags
(empty)
100.0000%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
842
totalAiIsmAdverbs
0
found
(empty)
highlights
(empty)
80.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
0
"Evans"
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
52.4941%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
842
totalAiIsms
4
found
0
word
"glistening"
count
1
1
word
"furrowed"
count
1
2
word
"footsteps"
count
1
3
word
"echoing"
count
1
highlights
0
"glistening"
1
"furrowed"
2
"footsteps"
3
"echoing"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
1
maxInWindow
1
found
0
label
"eyes widened/narrowed"
count
1
highlights
0
"eyes narrowed"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
42
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
1
narrationSentences
42
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
58
matches
0
"let that be"
61.5787%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
20
mean
42.05
std
15.37
cv
0.366
sampleLengths
0
44
1
54
2
21
3
44
4
20
5
35
6
50
7
71
8
38
9
16
10
43
11
21
12
64
13
41
14
44
15
46
16
69
17
52
18
27
19
41
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
0
totalSentences
42
matches
(empty)
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
0
totalVerbs
82
matches
(empty)
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
386
adjectiveStacks
0
stackExamples
(empty)
adverbCount
9
adverbRatio
0.023316062176165803
lyAdverbCount
4
lyAdverbRatio
0.010362694300518135
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
58
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
58
mean
14.5
std
9.55
cv
0.659
sampleLengths
0
19
1
8
2
17
3
12
4
10
5
10
6
6
7
8
8
1
9
1
10
6
11
21
12
10
13
23
14
11
15
6
16
14
17
3
18
20
19
12
20
11
21
23
22
16
23
14
24
32
25
25
26
13
27
25
28
15
29
1
30
23
31
15
32
5
33
21
34
4
35
41
36
19
37
18
38
6
39
17
40
8
41
36
42
4
43
6
44
7
45
10
46
14
47
5
48
7
49
33
86.2069%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
0
diversityRatio
0.5172413793103449
totalSentences
58
uniqueOpeners
30
100.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
1
fancyCount
0
fancyTags
(empty)
92.9576%