NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM090BBN63FA8CZGSZN67D9A
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7.4s
Cost:
$0.000440
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
69.28%
Dialogue to Total Word Ratio
(details)
81.4624%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her rainbow-colored ski mask in the rearview mirror of their battered getaway van, watching Bill wrestle with his toddler-sized overalls. "You look ridiculous," she said, grabbing a handful of glitter from the console to dab under her eyes like war paint. Bill tugged at the straps, the overalls straining against his beer gut. "I look like what a professional nanny would look like if she moonlighted as a bouncer. These suspenders alone would make a bank vault door blush." Molly snorted, tossing him a pacifier. "Put that in your mouth and stay cute. The hostess already thinks we're the entertainment. You're a clown dad who lost a bet with his wife." Bill popped the pacifier in, immediately looking like he'd been replaced by a zombie toddler. "Now I just need to forget how to count past 10 and we're golden. But seriously, the loot is just sitting there? No security? No motion sensors? No… guard dogs with suspiciously elegant collars?" Molly rolled her eyes. "It's a children's party, Bill. The only security is a guy named Kip who falls asleep by midnight and a single mom named Jess who checks the bathroom every hour because she's terrified someone will steal the balloon animals." She tossed him a pair of inflatable bat wings. "Here. Perfect for hiding weapons." Bill held them up. "I'm going to look like a deranged Batman rejected by Gotham City." "Exactly. You're not a clown dad anymore. You're a tragic superhero here to make the kid’s birthday memorable—not in the ‘your presents were just robbed’ sense." Bill wiggled into the wings, the plastic crinkling like cheap fireworks. "Alright, let's review. We get in, we stuff our goody bags with anything that isn't nailed down, we make a dramatic exit, and at no point do we—" "At no point do we get caught," Molly finished. "We've scoped this place three times, Bill. Unless a kid starts interrogating us about Minecraft tier lists, we're golden. And even then, I'll distract them with a made-up theory about why the Hulk’s pants size changes." Bill adjusted his bat wings nervously. "You think the parents will notice the haul is missing by morning?" Molly grinned. "Nah. They’ve got phones full of videos, a fridge stuffed with casseroles from well-meaning relatives, and a toddler who will cry over not getting the exact green dinosaur she *demanded*—not missing. By the time they realize, we'll be sipping margaritas in Mexico under names like Ricardo and Señora Fluffy McSnackFace." Bill winced. "That’s the worst alias I’ve ever heard." "It’s thematic! Now let’s go. Kip’s about to start his 8pm Snack Time parade, and I want to be front row to steal the big stuffed unicorn while everyone’s distracted by cupcake crumbs." Bill sighed, waddling toward the house with the enthusiasm of a man marching to his own execution. "If I get stuck in this outfit forever, I’m haunting you in your sleep." Molly skipped beside him, clutching their goody bags. "Then I’ll invest in good earplugs. And a bodyguard. Probably named Kip."
created_at
"2026-03-18T10:49:02.085Z"
3/18/2026, 10:49:02 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
363
moderation_latency
(null)
generation_time
7298
tokens_prompt
77
tokens_completion
792
native_tokens_prompt
81
native_tokens_completion
713
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00043995
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"b92a46d35157464fa2f311826d2eded8"
is_byok
false
latency
362
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830942-WDd6eLB2yDI7KmPA8dxX"
upstream_id
"b92a46d35157464fa2f311826d2eded8"
total_cost
0.00043995
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
509
69.28%
Dialogue to Total Word Ratio
Ratio: 72.71%, Deviation: 2.71%
neededClean
false
wordsTotal
513
wordsDialogue
373
81.4624%