NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHMQE6P3VJFYB5JHWQE3KQ2K
Run Details
Model:
Mistral Small Creative
Test:
Bad Writing Habits
Scenario:
Mystery: examining a crime scene
Duration:
9.3s
Cost:
$0.000549
Provider:
Mistral
Model input
System
You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue. You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex. Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.
User/Human
<codex> <character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist"> <eyeColor>brown</eyeColor> <hair>closely cropped salt-and-pepper</hair> <height>5'9"</height> <age>41</age> <jawline>sharp jaw</jawline> <watch>worn leather watch on her left wrist</watch> <bearing>military precision</bearing> <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description> <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory> </character> <character id="4414" name="Eva Kowalski" aliases="Eva, Evie" pronouns="she/her" species="Human" storyRole="Best Friend"> <eyeColor>green</eyeColor> <hair>curly red</hair> <height>5'4"</height> <age>26</age> <glasses>round glasses</glasses> <satchel>worn leather satchel full of books</satchel> <freckles>freckled complexion</freckles> <nervousHabit>tucking hair behind her left ear</nervousHabit> <description>Aurora's childhood friend and an occult researcher.</description> <backstory>Moved to London two years before Aurora and works as a research assistant at the British Museum's restricted archives. She holds a master's degree in Ancient History from Oxford.</backstory> </character> <location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth"> <currentLocation>abandoned Tube station beneath Camden</currentLocation> <entryRequirement>bone token</entryRequirement> <schedule>moves locations every full moon</schedule> <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description> </location> <item id="2004" name="Veil Compass" aliases="the Compass, shadow compass" type="Tool" origin="The Veil Market"> <casingMaterial>small brass compass</casingMaterial> <faceMarkings>face etched with protective sigils</faceMarkings> <needle>points toward the nearest supernatural rift or portal</needle> <patina>casing has a patina of verdigris</patina> <creator>crafted by a Shade artisan</creator> <description>A compass attuned to supernatural energy.</description> </item> </codex> Write a fully fleshed out scene of at least 1000 words in the mystery genre. A detective arrives at a crime scene in an unusual location. Something is wrong — the evidence doesn't add up. A colleague is present, offering a different interpretation. Through careful observation and deduction, the detective begins to see what others have missed. Use the codex entries above as your world reference for characters and setting. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The abandoned Tube station reeked of damp concrete and something older—something that clung to the back of Quinn’s throat like rust. Her torch cut a jagged path through the dark, illuminating the peeling posters of long-forgotten theatre productions, their edges curled like dead leaves. The air hummed, not with the usual underground vibration of trains, but with something quieter, more insistent. A low, resonant thrum that set her teeth on edge. She adjusted the strap of her shoulder holster, the leather creaking
softly
. "You’re sure this is the place?" Eva Kowalski didn’t look up from the notebook she
was scribbling
in, her round glasses catching the torchlight. "Positive. The coordinates match the last known location of the Veil Market before it moved." She tucked a curl of red hair behind her ear, a nervous habit Quinn had come to recognize. "Though, technically, we’re not supposed to be here. The Market’s entry requirements are… specific." Quinn exhaled through her nose. "And yet, here we are." The beam of her torch swept over the platform, catching on something half-buried in the grime—a
glint
of brass, half-hidden beneath a discarded newspaper. She crouched, brushing the debris aside with gloved fingers. The compass was small, its casing
etched
with sigils that made her eyes ache if she stared too long. The needle
trembled
, then swung wildly before settling on a point just beyond the platform’s edge. Eva’s breath hitched. "That’s a Veil Compass. They’re not supposed to be outside the Market. Ever." Quinn didn’t answer. She was too busy studying the scene. The platform was a mess—overturned crates, shattered glass, and what looked like scorch marks along the tiled walls. But something was off. The debris was too *neat*, too *deliberate*. Like someone had staged it. She stood, her boots crunching on broken glass. "Walk me through it." Eva hesitated, then stepped forward, her satchel bumping against her hip. "The Market was here last night. It moves e
very
full moon, but it’s always left some trace behind—residual magic, mostly. This time, though, it’s like something *ripped* it out of place." She gestured to the scorch marks. "That’s not just fire. It’s sigil burn. Someone tried to force their way in." Quinn’s gaze
flicked
to the compass in her hand. The needle hadn’t moved. "And this?" "It’s attuned to rifts," Eva said. "If it’s
pointing that way
, there’s either a portal or a tear in the veil somewhere nearby. But the Market’s supposed to be gone by now. It shouldn’t still be here." Quinn didn’t like the way Eva’s voice
wavered
. She didn’t like the way the compass’s needle *twitched*, either, like it was alive. She pocketed it, her fingers brushing the worn leather of her
watch
strap. "You said the Market moves. How?" Eva adjusted her glasses. "No one knows for sure. Theories range from dimensional folding to—" "A *what*?" "Never mind." Eva waved a hand. "The point is, it’s not supposed to be *here* anymore. But something went wrong. The energy’s all tangled up, like a knot that won’t come undone." Quinn’s jaw tightened. She didn’t believe in magic. She believed in evidence, in patterns, in the cold, hard truth of what could be seen and measured. But she also believed in her instincts, and right now, they
were screaming
at her that something was *
very
* wrong. She moved farther down the platform, her torchlight catching on a dark stain near the edge. Crouching again, she ran her fingers along the tile. Dry. Old. But the pattern was unmistakable—spatter, like something had
been thrown
with force. Or *someone*. Eva’s voice was quiet behind her. "You think this is connected to your case?" Quinn didn’t answer. She didn’t have to. They both knew the answer. Three years ago, her partner, DS Morris, had vanished during a routine surveillance op near Camden. The official report called it a botched raid, a tragic accident. But Quinn had seen the way Morris’s notes had gone from meticulous to frantic in the days leading up to his disappearance. She’d seen the symbols he’d scribbled in the margins—symbols that looked *suspiciously* like the ones
etched
into the compass in her pocket. She stood, her gaze sweeping the platform one last time. "There’s no body." Eva blinked. "What?" "No body," Quinn repeated. "If this was a struggle, if someone was hurt, where’s the body?" Eva’s fingers tightened around her notebook. "Maybe they took it with them." "Or maybe there wasn’t one to begin with." Quinn’s voice was low, her mind racing. "You said the Market moves. What happens if it doesn’t? What happens if it gets *stuck*?" Eva’s breath hitched. "You think the Market’s still here?" Quinn didn’t answer. She was already moving toward the edge of the platform, where the compass’s needle pointed like an accusing finger. The air grew thicker, heavier, pressing against her skin like a damp cloth. The thrumming sound intensified, vibrating in her bones. Then she saw it. A tear. Not in the wall, not in the floor—*in the air itself*. A jagged, shimmering line, like heat rising off pavement, but wrong. *Unnatural*. It pulsed, flickering in and out of existence, and for a second, Quinn thought she saw *something* moving on the other side. A shadow. A shape. Eva’s voice was a whisper. "Oh God." Quinn didn’t take her eyes off the tear. "You said the Market moves. How do we make it *stop*?" Eva didn’t answer. She didn’t have to. Because the tear *ripped* wider. A gust of wind howled through the station, carrying with it the scent of ozone and something older, something *rotten*. Quinn’s torch
flickered
, then died. The darkness pressed in, thick and suffocating, and for a heartbeat, she thought she heard laughter—high, shrill, *wrong*. Then the lights came back on. Not the torch. Not the emergency bulbs. *The Market.* Stalls materialized out of the dark, their awnings fluttering in a wind that didn’t exist. Lanterns
flickered
to life, casting long, distorted shadows across the platform. The air filled with the scent of spices and something metallic, like blood and old coins. And standing in the center of it all was a woman. She was tall, her skin the color of polished mahogany, her hair braided with strands of silver that caught the light like spider
silk
. Her eyes—*God, her eyes*—were entirely black, pupil and iris both, like pools of oil. Quinn’s hand went to her holster. The woman smiled. "Detective Quinn. We’ve been expecting you."
created_at
"2026-02-17T02:39:09.155977+00:00"
2/17/2026, 2:39:09 AM
model
"mistralai/mistral-small-creative-20251216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
253
moderation_latency
(null)
generation_time
8748
tokens_prompt
923
tokens_completion
1729
native_tokens_prompt
875
native_tokens_completion
1540
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0005495
router
(null)
provider_responses
0
id
"27182bcb9c5b47a9a842b8b5d49108df"
status
200
is_byok
false
latency
253
endpoint_id
"0118a65e-79f9-48b0-924e-38c4a41378e1"
provider_name
"Mistral"
model_permaslug
"mistralai/mistral-small-creative-20251216"
api_type
"completions"
id
"gen-1771295939-hbklSCVotjcr0rRGBnwX"
upstream_id
"27182bcb9c5b47a9a842b8b5d49108df"
total_cost
0.0005495
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Adverbs in dialogue tags
Target: ≤10% dialogue tags with adverbs
totalTags
2
adverbTagCount
0
adverbTags
(empty)
81.5838%
AI-ism adverb frequency
Target: <1% AI-ism adverbs (57 tracked)
wordCount
1086
totalAiIsmAdverbs
2
found
0
adverb
"softly"
count
1
1
adverb
"very"
count
1
highlights
0
"softly"
1
"very"
100.0000%
AI-ism character names
Target: 0 AI-default names (17 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
100.0000%
AI-ism location names
Target: 0 AI-default location names (33 tracked, −20% each)
codexExemptions
(empty)
found
(empty)
17.1271%
AI-ism word frequency
Target: <1% AI-ism words (140 tracked)
wordCount
1086
totalAiIsms
9
found
0
word
"flickered"
count
2
1
word
"flicked"
count
1
2
word
"glint"
count
1
3
word
"wavered"
count
1
4
word
"trembled"
count
1
5
word
"silk"
count
1
6
word
"etched"
count
2
highlights
0
"flickered"
1
"flicked"
2
"glint"
3
"wavered"
4
"trembled"
5
"silk"
6
"etched"
100.0000%
Cliche density
Target: ≤1 cliche(s) per 800-word window
totalCliches
0
maxInWindow
0
found
(empty)
highlights
(empty)
100.0000%
Emotion telling (show vs. tell)
Target: ≤3% sentences with emotion telling
emotionTells
0
narrationSentences
86
matches
(empty)
100.0000%
Filter word density
Target: ≤12% sentences with filter/hedge words
filterCount
1
hedgeCount
0
narrationSentences
86
filterMatches
0
"watch"
hedgeMatches
(empty)
100.0000%
Overuse of "that" (subordinate clause padding)
Target: ≤10% sentences with "that" clauses
thatCount
1
totalSentences
108
matches
0
"pointing that way"
100.0000%
Paragraph length variance
Target: CV ≥0.5 for paragraph word counts
totalParagraphs
43
mean
25.02
std
21.04
cv
0.841
sampleLengths
0
71
1
18
2
65
3
10
4
68
5
16
6
44
7
12
8
62
9
15
10
37
11
41
12
15
13
2
14
32
15
46
16
41
17
14
18
12
19
71
20
13
21
3
22
16
23
12
24
31
25
9
26
43
27
4
28
2
29
49
30
7
31
19
32
7
33
5
34
43
35
6
36
7
37
2
38
42
39
11
40
38
41
6
42
9
100.0000%
Passive voice overuse
Target: ≤5% passive sentences
passiveCount
1
totalSentences
86
matches
0
"been thrown"
100.0000%
Past progressive (was/were + -ing) overuse
Target: ≤10% past progressive verbs
pastProgressiveCount
2
totalVerbs
142
matches
0
"was scribbling"
1
"were screaming"
94.0000%
Purple prose (modifier overload)
Target: <4% adverbs, <2% -ly adverbs, no adj stacking
wordCount
818
adjectiveStacks
1
stackExamples
0
"thicker, heavier, pressing against her"
adverbCount
27
adverbRatio
0.03300733496332518
lyAdverbCount
4
lyAdverbRatio
0.004889975550122249
100.0000%
Repeated phrase echo
Target: ≤20% sentences with echoes (window: 2)
totalSentences
108
echoCount
0
echoWords
(empty)
100.0000%
Sentence length variance
Target: CV ≥0.4 for sentence word counts
totalSentences
108
mean
9.96
std
7.58
cv
0.761
sampleLengths
0
21
1
23
2
17
3
10
4
12
5
6
6
18
7
33
8
14
9
5
10
5
11
24
12
9
13
19
14
16
15
3
16
13
17
3
18
7
19
18
20
4
21
7
22
5
23
8
24
4
25
11
26
37
27
14
28
9
29
4
30
2
31
6
32
31
33
8
34
14
35
13
36
6
37
4
38
11
39
2
40
6
41
26
42
3
43
5
44
18
45
20
46
16
47
9
48
1
49
1
47.5309%
Sentence opener variety
Target: ≥60% unique sentence openers
consecutiveRepeats
7
diversityRatio
0.32407407407407407
totalSentences
108
uniqueOpeners
35
50.0000%
Dialogue tag variety (said vs. fancy)
Target: ≤30% fancy dialogue tags
totalTags
2
fancyCount
1
fancyTags
0
"Quinn repeated (repeat)"
87.6613%