NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMR0V403CV9QPDVDQQ1G49Z
Run Details
Model:
MoonshotAI: Kimi K2.5
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
21.1s
Cost:
$0.008078
Provider:
BaseTen
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The stink of ozone and old earth hit Harlow Quinn three steps down the abandoned stairwell, well before her torch beam cut through the permanent midnight of the Camden tube station. She kept her Glock holstered but unclipped, left wrist rotating just enough to feel the
weight
of her worn leather
watch
settle—a habit from her army days, a grounding mechanism. Eighteen years on the force had taught her that crime scenes underground carried their own gravity, but this one felt wrong in a way that prickled the skin beneath her close-cropped salt-and-pepper hair. She ducked under the police tape strung across the platform’s mouth. The usual Market detritus littered the tiles—crushed bone tokens, scraps of
silk
that had wrapped god-knows-what, the chemical sweetness of spilled tinctures. The Veil Market had cleared out fast when the body dropped, leaving behind their trash and their secrets. “Detective.” Quinn turned, her sharp jaw casting a shadow across her collar. Eva Kowalski stood near the platform’s edge, her round glasses catching the blue glow of the forensic lamps. The girl—twenty-six, five-foot-four, freckles like scattered copper—shouldn’t have been past the cordon, but Quinn had learned not to underestimate civilians who carried worn leather satchels bulging with restricted archives material. Eva tucked a curl of red hair behind her left ear, a nervous tic that Quinn had catalogued during their three prior encounters, all of them involving cases that made the Metropolitan Police incident reports read like fiction. “You touch anything?” Quinn asked. “Only the atmosphere.” Eva’s voice carried the precise diction of an Oxford education, but her tone had gone flat, the way people spoke when their brains were busy reconciling impossible data. “He’s positioned, Harlow. Not fallen. Placed.” Quinn stepped closer to the body. Male, mid-thirties, athletic build now stiffening with rigor. He lay supine on the grimy tiles, arms spread wide, legs together—a geometric posture that suggested crucifixion without the cross. His clothing was unremarkable: canvas trousers, wool coat, sturdy boots. But Quinn’s brown
eyes narrowed
, her military precision clicking through the observations like a slide rule. His boots were clean. Not polished—clean. The soles showed no blackening from the station’s century of coal dust and brake grease. No scuff marks from
being dragged
. The heels were
pristine
, as if he’d levitated into position or
been carried
by hands careful enough to avoid scraping leather against tile. “No drag patterns,” Quinn murmured. She crouched, her left hand resting on her knee, the leather
watch
strap creaking
softly
. “No footprints leading in. Just… him.” Eva knelt opposite, her satchel thumping against her hip. “Look at his hands.” Quinn already had. The palms
were blistered
, the skin peeling back in symmetrical patterns that looked almost deliberate—alchemical burns, the kind that came from handling volatile substances without proper warding. But the symmetry was wrong. In a struggle, in an accident, burns were chaotic. These
were mirrored
, as if he’d pressed his hands against something that required exact placement. “He’s not a victim,” Eva said quietly. “He’s a seal.” Quinn’s sharp jaw tightened. She didn’t believe in seals, not the kind Eva meant. She believed in evidence, in the
weight
of her
watch
ticking against her wrist, in the eighteen years of cases that had logical explanations if you looked hard enough. But DS Morris had died three years ago in a warehouse that smelled like this—ozone and old earth—and the coroner’s report had listed the cause as “cardiac arrest” while the crime scene photos showed walls scorched with patterns that looked exactly like the burns on this man’s hands. Quinn reached into her coat pocket and withdrew an evidence bag. Inside, cradled in sterile plastic, lay a small brass compass. She’d found it clutched in the man’s right hand, the metal still warm when she’d pried it free. The casing bore a patina of verdigris, ancient and green, but the face
was etched
with sigils that
seemed to
shift when viewed peripherally. The needle didn’t point north. It pointed toward the bricked-up archway at the platform’s terminus, the one that led nowhere on any Transport for London map. “He was holding this,” Quinn said, her voice low. “It’s pointing at the wall.” Eva’s green
eyes widened
behind her glasses. She reached out, stopped herself before touching the bag. “That’s a Veil Compass. Crafted by Shade artisans. It doesn’t find
magnetic
north—it finds the nearest rift. The nearest tear.” Quinn looked at the wall. Victorian brickwork, black with moisture, seemingly solid. But the compass needle
trembled
there, insistent, as if pulled by a force that ignored physics. She thought of Morris again, of the way his body had
been positioned
in that warehouse, arms spread, legs together, a geometric posture that the investigating officers had dismissed as the convulsions of a dying heart. But Morris had been trying to close something. Just like this man. “He died sealing it,” Quinn said. The realization settled in her chest, heavy and cold. “The burns are from the Compass. He was holding the rift shut.” Eva nodded, her fingers finding her hair again, tucking that red curl behind her ear. “And if he died mid-ritual…” “It’s not sealed.” Quinn stood, her military bearing rigid, her hand resting on the grip of her Glock. The compass needle in the evidence bag twitched, pointing still at the brickwork, and Quinn understood with a clarity that transcended her usual logic that the wall wasn’t a wall anymore. It was a door, and something
was pushing
from the other side. She looked at Eva, at the satchel full of archives and the freckles standing out against her pale skin, and made a decision that would have horrified her superiors but would have made Morris proud. “Step back,” Quinn ordered, her voice carrying the snap of command. “And tell me how to finish what he started.” The brickwork groaned. The compass needle spun wildly. And Detective Harlow Quinn, who had spent eighteen years believing only in what she could see and touch and file in evidence bags, drew her weapon and prepared to fight a war she was only just beginning to understand.
created_at
"2026-02-17T02:49:31.354543+00:00"
2/17/2026, 2:49:31 AM
model
"moonshotai/kimi-k2.5-0127"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
204
moderation_latency
(null)
generation_time
20946
tokens_prompt
923
tokens_completion
3049
native_tokens_prompt
848
native_tokens_completion
2523
native_tokens_completion_images
(null)
native_tokens_reasoning
1383
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0080778
router
(null)
provider_responses
0
id
"chatcmpl-b27ab2fd9095433e9ab45e9bf7c696b5"
status
200
is_byok
false
latency
204
endpoint_id
"6b68e10a-44e0-4815-9b96-52f34335f83d"
provider_name
"BaseTen"
model_permaslug
"moonshotai/kimi-k2.5-0127"
api_type
"completions"
id
"gen-1771296550-tZc3L5mgAx5QLug6fbbQ"
upstream_id
"chatcmpl-b27ab2fd9095433e9ab45e9bf7c696b5"
total_cost
0.0080778
cache_discount
(null)
upstream_inference_cost
0
provider_name
"BaseTen"
Evaluation details
Result
Evaluator
Details
Meta Data
77.7778%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
6
adverbTagCount
1
adverbTags
0
"Eva said quietly [quietly]"
90.3846%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1040
totalAiIsmAdverbs
1
found
0
adverb
"softly"
count
1
highlights
0
"softly"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
32.6923%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1040
totalAiIsms
7
found
0
word
"trembled"
count
1
1
word
"silk"
count
1
2
word
"pristine"
count
1
3
word
"magnetic"
count
1
4
word
"weight"
count
2
5
word
"etched"
count
1
highlights
0
"trembled"
1
"silk"
2
"pristine"
3
"magnetic"
4
"weight"
5
"etched"
66.6667%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
2
maxInWindow
2
found
0
label
"eyes widened/narrowed"
count
2
highlights
0
"eyes narrowed"
1
"eyes widened"
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
61
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
3
hedgeCount
1
narrationSentences
61
filterMatches
0
"watch"
hedgeMatches
0
"seemed to"
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
0
totalSentences
71
matches
(empty)
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
25
mean
40.76
std
29.31
cv
0.719
sampleLengths
0
94
1
51
2
1
3
97
4
5
5
37
6
60
7
4
8
46
9
26
10
13
11
59
12
10
13
91
14
89
15
14
16
36
17
64
18
12
19
27
20
20
21
61
22
35
23
20
24
47
67.7596%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
6
totalSentences
61
matches
0
"being dragged"
1
"been carried"
2
"were blistered"
3
"were mirrored"
4
"was etched"
5
"been positioned"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
1
totalVerbs
156
matches
0
"was pushing"
100.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
939
adjectiveStacks
0
stackExamples
(empty)
adverbCount
24
adverbRatio
0.025559105431309903
lyAdverbCount
8
lyAdverbRatio
0.008519701810436636
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
71
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
71
mean
14.35
std
10.85
cv
0.756
sampleLengths
0
31
1
30
2
33
3
11
4
22
5
18
6
1
7
11
8
18
9
30
10
38
11
5
12
31
13
6
14
6
15
8
16
20
17
10
18
16
19
4
20
2
21
15
22
6
23
23
24
5
25
15
26
6
27
9
28
4
29
3
30
27
31
5
32
9
33
15
34
7
35
3
36
4
37
10
38
29
39
48
40
11
41
10
42
18
43
24
44
5
45
21
46
9
47
5
48
7
49
9
47.8873%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
6
diversityRatio
0.3380281690140845
totalSentences
71
uniqueOpeners
24
50.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
6
fancyCount
3
fancyTags
0
"Quinn murmured (murmur)"
1
"Eva said quietly (say quietly)"
2
"Quinn ordered, her voice carrying the snap of command (order, carry)"
84.3040%