NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9GBA6FZ667E267H3CVTFC
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
5.3s
Cost:
$0.000093
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the party balloon arch pulsed like a drunk firefly outside the window, its reflection dancing across the sticky floor of Molly’s apartment. She balanced a half-eaten cupcake on her knee, the frosting smeared in a way that suggested she’d been using it as a stress ball. Bill leaned against the doorframe, arms crossed, his tie already loosened despite the fact that he’d never worn one before tonight. “So,” he said, “the kid’s name is Timmy, right? And his parents think he’s the next Einstein because he can recite the periodic table?” Molly wiped frosting off her chin with the back of her hand. “Timmy’s got the brain of a goldfish and the attention span of a squirrel on espresso. But his parents? They’ve got the wallet of a lottery winner who just found out it’s rigged.” She tossed a crumpled flyer onto the table—*Timmy’s 5th Birthday Bash: All the Toys, All the Cake, All the Chaos (Probably)*—and grinned. “We hit at midnight, when the parents are too drunk to notice us slipping in like ghosts. Or at least like very loud, very sweaty ghosts.” Bill adjusted his imaginary tie. “And the security? The guy in the stupid hat who thinks he’s James Bond?” “Oh, he’s *definitely* James Bond,” Molly said, popping the last bite of cupcake into her mouth. “But he’s also the guy who once mistook a squirrel for a burglar. We’ll just… *distract* him. Maybe offer him a ‘free’ cupcake. See how that goes.” Bill sighed, rubbing his temples. “I still don’t like it. What if we get caught?” Molly stood, dusting flour off her jeans—though she hadn’t baked anything. “Then we’ll tell them we’re the *real* parents, and Timmy’s just a very confused foster kid. Works every time.” She grabbed a duffel bag from the closet, its contents jingling ominously. “Besides, have you *seen* the prize list? A Nerf gun that shoots *lasers*, a robot that dances, and a *giant* Lego set of a spaceship that’s basically a cardboard box with wheels. This isn’t stealing—it’s *art theft*.” Bill groaned. “You’re evil.” Molly grinned. “And you love me.” She tossed him a key fob. “Now let’s go make some magic. Or at least some *very* loud magic.”
created_at
"2026-02-20T19:47:53.813Z"
2/20/2026, 7:47:53 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
208
moderation_latency
(null)
generation_time
5168
tokens_prompt
77
tokens_completion
591
native_tokens_prompt
69
native_tokens_completion
554
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00009345
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"66dbebf42bd747438b4edca902690505"
is_byok
false
latency
208
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771616873-2HRrVcGzuRBYVI7D9Ala"
upstream_id
"66dbebf42bd747438b4edca902690505"
total_cost
0.00009345
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
376
0.00%
Dialogue to Total Word Ratio
Ratio: 56.65%, Deviation: 46.65%
neededClean
false
wordsTotal
376
wordsDialogue
213
0.0000%