NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAZY3SQDA18JKPBETDB6XV
Run Details
Model:
Ministral 3 14B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
8.5s
Cost:
$0.000237
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
2.59%
Dialogue to Total Word Ratio
(details)
1.2939%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the party store’s sign flickered like a dying firefly as Molly adjusted her oversized bow tie, the plastic bow wobbling precariously under her chin. She peered through the fogged-up window, her breath misting the glass in a pattern that vaguely resembled a sad, lopsided smile. Inside, a dozen kids in matching dinosaur onesies were tearing through stacks of presents like seagulls at a buffet. The parents, blissfully unaware, sipped punch from plastic cups while a piñata shaped like a grumpy T-Rex dangled ominously above them. Bill crouched beside her, his fingers twitching near the lock on the side door. He wore a Santa hat tilted at a jaunty angle, the red pom-pom bobbing with every nervous twitch. “You sure about this?” he muttered, voice tight. “Kids are *terrible* at security. They’ll scream, they’ll cry, they’ll *eat* the evidence.” Molly smirked, plucking a candy cane from her pocket and snapping it in half. “Relax, Bill. We’re not here to rob a bank. We’re liberating *toys*. The moral high ground is *ours*.” She tapped the side of her head. “Besides, I did my recon. The parents are all too busy judging each other’s cupcakes to notice a couple of ‘lost’ partygoers.” Bill exhaled through his nose, the sound suspiciously like a deflating balloon. “You realize if we get caught, we’re gonna have to explain why we’re dressed like a failed Christmas elf and a confused clown.” “First of all,” Molly said, adjusting her too-tight clown shoes, “I am *not* a clown. I am a *mime*. A *tragic* mime.” She gestured vaguely toward the door. “Second, if we *do* get caught, we’ll just say we’re the ‘mystery guests’ from the ‘surprise game.’ You know, the one where the parents *forgot* to invite us?” Bill’s eye twitched. “That’s the worst lie I’ve ever heard.” “It’s *creative*.” Molly grinned, then suddenly gasped, clutching her chest. “Oh no. Oh no no no.” Bill groaned. “What now?” “My *glitter*.” She patted her pockets frantically. “I *swear* I had glitter. For *distraction purposes*.” Bill pinched the bridge of his nose. “You brought glitter to a *children’s party*?” “It’s *versatile*!” Molly hissed. “We could’ve used it to blind the security cameras, or—” “There *are no security cameras*,” Bill snapped. “This is a *party store*, Molly. The biggest threat is a toddler with a juice box.” Molly’s face lit up. “*Exactly*. Toddlers are *easy*. We’ll just—” She cut herself off, eyes widening. “Bill. *Look*.” Inside, a kid in a fireman helmet had just knocked over a tower of presents, sending wrapping paper and ribbons flying like confetti in a hurricane. Another child shrieked, pointing at the piñata, which had somehow been *sabotaged*—a string now dangled loose, the T-Rex’s beak gaping open like it was inviting them to strike. Bill’s shoulders slumped. “We’re not robbing this place. We’re *babysitting*.” Molly grabbed his arm, her fingers digging in. “No. We’re *opportunists*. That piñata’s a *goldmine*. And look—” She nodded toward the back corner, where a table groaned under the weight of half-unwrapped gifts. “The parents are *distracted*. The kids are *chaotic*. This is *heist theater*, Bill. We’re the *main event*.” Bill opened his mouth to argue, but then a particularly loud *CRASH* echoed from inside, followed by the sound of a child wailing. Molly’s eyes gleamed. “Now’s our moment.” She yanked Bill toward the door, her Santa hat askew. “Move. And *try* to look less like a man who’s about to commit grand larceny.” Bill adjusted his hat, muttering, “I *am* a man who’s about to commit grand larceny.” Molly shoved him forward. “Then *act* like it.” The door creaked open, and the scent of cake and sticky fingers hit them like a wave. Molly took a deep breath, then stepped inside—only to immediately trip over a stray toy truck, sending it skidding across the floor with a loud *BANG*. A small voice piped up from behind a couch. “Who’s there?” Molly froze. Bill’s face went pale. From the shadows, a pair of wide, curious eyes blinked at them. Molly forced a smile. “Uh. Hi! We’re the—” The kid grinned. “You’re *late*.” Bill’s soul left his body. Molly’s brain short-circuited. And somewhere, a piñata waited.
created_at
"2026-02-20T20:13:53.157Z"
2/20/2026, 8:13:53 PM
model
"mistralai/ministral-14b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
208
moderation_latency
(null)
generation_time
8193
tokens_prompt
77
tokens_completion
1181
native_tokens_prompt
69
native_tokens_completion
1117
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0002372
router
(null)
provider_responses
0
endpoint_id
"0b53717b-9af5-4de3-80af-ebedd2adf981"
id
"5cde514f50524b13bef72fec43ae5c18"
is_byok
false
latency
208
model_permaslug
"mistralai/ministral-14b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618433-x0oyr5YkokOIx8NuIjLB"
upstream_id
"5cde514f50524b13bef72fec43ae5c18"
total_cost
0.0002372
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
703
2.59%
Dialogue to Total Word Ratio
Ratio: 38.55%, Deviation: 8.55%
neededClean
false
wordsTotal
703
wordsDialogue
271
1.2939%