NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9AHXW8MZNG81EZ78M936W
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5.5s
Cost:
$0.000129
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering *Open* sign cast long shadows across the sidewalk as Molly adjusted the brim of her too-tight baseball cap, squinting at the three-story mansion ahead. "Alright, Bill," she muttered, "if we’re doing this, we do it right. No last-minute panic when the kid in the red shirt starts juggling flaming torches." Bill, perched on the fire escape like a disgruntled raccoon, adjusted his gloves. "Relax, Molly. I’ve got the distraction covered. The kid’s got a *serious* sweet tooth. I’ll bribe him with a handful of gummy worms, and by the time he’s licking his fingers, we’ll be in and out like ghosts with a side of glitter." Molly rolled her eyes. "You’re telling me you’re going to *blackmail* a six-year-old with candy? That’s not a distraction, that’s a war crime." Bill grinned, dangling a suspiciously large bag of gummy bears from his pocket. "Nah, just a *temporary* alliance. Besides, if he tells his parents, I’ll just blame it on the raccoons. They’re *always* stealing stuff." Molly sighed, pulling out a tiny lockpick set from her boot. "Fine. But if we get caught, I’m blaming you for not bringing enough duct tape. And also for that time you tried to hotwire a go-kart with a toaster." Bill waved a hand. "Details. The real question is—how do we get past the *security*?" He gestured toward the two burly men in matching "Party Patrol" vests lounging by the front door, sipping lemonade like they’d already won the game. Molly tapped her chin. "Easy. We don’t. We go in through the *kids*." She pointed at the backyard, where a dozen wide-eyed children were already tearing into their presents like they’d never seen a box before. "Those little monsters are our ticket. I’ll blend in as the ‘cool aunt’ who ‘forgot her phone,’ and you can play the ‘lost uncle’ who ‘just needs to borrow the Wi-Fi for one second.’" Bill groaned. "I *hate* pretending to be family. Last time I did that, I had to explain to a toddler why I wasn’t actually his ‘big brother’ who ‘forgot to shave.’" Molly smirked. "Just smile and say you’re ‘on a *very* important mission.’ Works every time." Bill hesitated, then nodded. "Alright, but if I have to listen to another kid ask me if I’m a ‘ninja,’ I’m quitting." Molly clapped him on the shoulder. "Deal. Now let’s go steal some toys before the parents realize their kids are basically living in a Walmart." Bill adjusted his cap, then paused. "Wait—what if the presents are *already* gone? What if this is some kind of trap?" Molly shot him a look. "Bill, if this were a trap, we’d be in a *very* small, very dark room right now with a sign that says *‘Congratulations, you’re the bait.’* Now move it. The cake’s probably already been cut, and I *need* that frosting." Bill sighed, then hopped down. "Fine. But if we get arrested, I’m suing for emotional damages." Molly grinned. "No promises." And with that, they melted into the crowd of giggling children, already plotting their next move—because in heists, the real treasure was never the presents. It was the chaos.
created_at
"2026-02-20T19:44:43.972Z"
2/20/2026, 7:44:43 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
189
moderation_latency
(null)
generation_time
4831
tokens_prompt
77
tokens_completion
822
native_tokens_prompt
69
native_tokens_completion
789
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001287
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"97bab1536e634a139e2c3f7de7d9f17d"
is_byok
false
latency
189
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771616683-2OhuV6JamiCtqSukKJpr"
upstream_id
"97bab1536e634a139e2c3f7de7d9f17d"
total_cost
0.0001287
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
539
0.00%
Dialogue to Total Word Ratio
Ratio: 64.38%, Deviation: 34.38%
neededClean
false
wordsTotal
539
wordsDialogue
347
0.0000%