NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7QGV8EVPAV6J4QYMXKA1R
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.2s
Cost:
$0.000120
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
81.47%
Dialogue to Total Word Ratio
(details)
66.6956%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon sign outside the *Whimsy Wonderland* party venue flickered like a drunk firefly, casting a sickly glow over the parked cars. Molly adjusted her sunglasses—though it was dark—and nudged Bill with her elbow. "You’re staring at the cake like it’s the last slice of pizza on Earth." Bill wiped imaginary crumbs from his chin. "It’s a *three-tiered* chocolate wonderland. With sprinkles that look like they were hand-painted by a toddler who just discovered glitter." He sighed. "And the kids are *already* tearing into the good stuff. We’re gonna be late." Molly grinned, cracking her knuckles. "Good. The later we start, the more time we’ve got to case the joint. Also, the more time the parents have to get drunk and forget their kids are running around with a piñata full of loose change." Bill groaned. "You’re evil." "Just efficient." She pulled out a blueprint rolled tight like a burrito. "According to my sources—aka that one guy at the bakery who owes me twenty bucks—there’s a side door by the restrooms. It’s got a magnetic lock. Easy pickings." Bill snatched the paper, squinting. "You sure? Last time you ‘cased’ a place, we ended up in a *daycare’s* freezer section." Molly rolled her eyes. "That was *one time*. And it was *their* fault for leaving the door unlocked. This place has security cameras, but they’re all focused on the main entrance. The side door’s blind spot central." Bill tapped the paper. "And the presents?" "Stacked like Jenga towers in the corner of the ballroom. Wrapped in *expensive* paper. Probably gold foil or something. I already checked the security logs—no one’s been in there since setup. It’s a ghost town." Bill rubbed his temples. "We’re stealing from *kids*." Molly gasped, clutching her chest. "Bill! That’s *not* what we’re doing. We’re stealing from *parents* who bought these things. The kids get the leftovers—like the broken toys and the half-eaten cupcakes. It’s *capitalist punishment*." Bill stared at her. "You’re a menace." "To the system." She grinned. "Now, do we go in now, or do we wait for the parents to pass out from the punch?" Bill checked his watch. "The punch’s got vodka in it. They’ll be out by midnight. We’ve got *three hours*." Molly whooped, pumping her fist. "Then let’s move. But if we get caught, I’m blaming the sprinkles." Bill groaned. "You’re *insufferable*." Molly winked. "And yet, you love me." Bill sighed, but his lips twitched. "Just don’t let me get caught with a cupcake in my hand." Molly adjusted her mask—just a silly clown nose she’d swiped from the party’s face-painting station. "No promises." They slipped into the shadows, the distant sound of laughter and squeals following them like a bad omen. Molly nudged Bill again. "You ready?" Bill took a deep breath. "For the greatest heist in children’s party history?" Molly grinned. "Let’s do it."
created_at
"2026-02-20T19:16:51.701Z"
2/20/2026, 7:16:51 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
246
moderation_latency
(null)
generation_time
3611
tokens_prompt
77
tokens_completion
779
native_tokens_prompt
69
native_tokens_completion
732
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00012015
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"81a47c8394b04c7a89c401da0d7ee31c"
is_byok
false
latency
246
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771615011-gm3fgQcDdW5OMXbZjHRh"
upstream_id
"81a47c8394b04c7a89c401da0d7ee31c"
total_cost
0.00012015
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
484
81.47%
Dialogue to Total Word Ratio
Ratio: 67.98%, Deviation: 2.02%
neededClean
false
wordsTotal
484
wordsDialogue
329
66.6956%