NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0AREETN729ZQN8HCDM0XG8
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
8.4s
Cost:
$0.000444
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
45.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the fedora perched sideways on her head, squinting at the glittering sign outside the community center that flashed in neon: "BIRTHDAY BONANZA: UNLIMITED CAKE & FUN!" She nudged Bill with her elbow, his broad shoulders trembling as he chuckled into his paper cup of cheap coffee. "Honestly, Bill," she drawled, "if I ever need a sign that we're in the right place, that neon monstrosity is it. 'Unlimited cake'? Who's dumb enough to believe that?" Bill wiped coffee from his upper lip, leaving a brown smudge above his lip like a failed attempt at a mustache. "It’s perfect, Moll. Parents always think their kids’ parties are special. Like the cake’s homemade with unicorn tears or something. Meanwhile, it’s frosted store-bought with rainbow sprinkles and guilt." He crumpled the cup and tossed it into a nearby trash can, missing by a mile. It hit the pavement with a plastic sigh. Molly sighed and pulled a crumpled floor plan from her jacket pocket. "Alright, genius. According to this, after the magician's show—which is probably just Derek from accounting with a wand and a nervous tic—the parents all troop into the kitchen for the 'birthday cake cutting ceremony.' Meanwhile, the kids are herded into the craft room to make feather boas or some nonsense." She tapped the paper with a chipped red nail. "That’s our window. Ten minutes max. Cake guy hands out treats, cake-maker cuts, everyone oohs and aahs. No one's watching the playroom." Bill scratched his stubble, considering. "So we sneak in, grab the good loot, and vanish before the parents realize the action figures are gone and the glitter has called in witness statements." He smirked. "Filosofy." Molly rolled her eyes. "Philosophy? Bill, you’re holding hands with a thesaurus again, aren’t you?" She pointed toward the side door, half-hidden by a sad-looking inflatable bouncy castle from last year’s zoo-park fundraiser. "See that? Blocked by a folding table with a sign: 'Staff Only: Please Do Not Enter—Health & Safety.' Which, of course, means 'Staff is nursing hangovers and wondering why they wore jeans in June.'" Bill adjusted his fake mustache—another prop that probably wouldn’t fool a child under eight. "Fine. We go in quiet. You take the high shelves, I take the floor. Like a party raid." "Like a grift raid," Molly corrected. “We’re not raiding a party. We’re executing a precision heist on childish claptrap.” Bill grinned. “Same thing.” Molly exhaled sharply through her nose. "You’re insufferable when you’re proud of yourself. But okay. Quiet. Fast. No touching anything that’s not *obviously* a toy or game. We’re not taking the teddy bear with the glitter tears. That thing looks haunted." Bill mimed locking his lips and throwing away the key. "I’ll be as silent as a ninja granny at midnight.” Molly shot him a look. "There is no such thing." "Proves how little you know about covert operations," he whispered back, tapping his temple.
created_at
"2026-03-18T11:19:40.263Z"
3/18/2026, 11:19:40 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
365
moderation_latency
(null)
generation_time
8264
tokens_prompt
77
tokens_completion
774
native_tokens_prompt
81
native_tokens_completion
720
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00044415
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"e0e47094d03b4907a3336852986a2115"
is_byok
false
latency
365
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773832780-Bx1ESrpGmN8fKGGae4Ym"
upstream_id
"e0e47094d03b4907a3336852986a2115"
total_cost
0.00044415
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
0.00%
Dialogue to Total Word Ratio
Ratio: 62.02%, Deviation: 32.02%
neededClean
false
wordsTotal
495
wordsDialogue
307
45.2419%