NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7JKZSR463FP0HPFRTGH7V
Run Details
Model:
Ministral 3 3B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2.5s
Cost:
$0.000075
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
92.95%
Dialogue to Total Word Ratio
(details)
46.4773%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the party hall pulsed like a heartbeat, casting jagged shadows across the walls. Molly adjusted her scarf—too tight, too obvious—while Bill fiddled with his phone, scrolling through the security footage of the venue. The camera feed was grainy, but it showed the front door locked tight, the security guard’s uniform looking suspiciously like the one from last week’s *Mystery of the Missing Lollipop* case. “You think this place is *that* well-guarded?” Molly asked, her voice dripping with fake concern. “Because if it is, we’re gonna need a *very* convincing story about why we’re here.” Bill smirked, swiping through another angle. “Nah, just a few bored parents and a guy who probably thinks he’s the world’s worst DJ. We’ll just—” “*We’ll what?*” Molly cut in, her fingers twitching toward the doorknob. “Bluff our way in? Pretend to be lost? Or do you have a *plan*?” Bill groaned, rubbing his temples. “I was gonna say we’d just—” He paused, then grinned. “Okay, fine. We’ll say we’re *emergency* babysitters. That way, we can sneak in, grab the presents, and *very* subtly*—” “*Very* subtly* what?*” Molly hissed, already yanking the door open. The hallway beyond was lined with plastic chairs and a half-empty punch bowl, but the real prize was just ahead—a glittery, glittery disaster of a kids’ party. A dozen kids in matching outfits were huddled around a table piled high with wrapped gifts, their faces alight with excitement. And right in the center? A *very* suspicious-looking man in a Santa hat, holding a clipboard. Bill froze. “Oh. *Oh no.*” Molly’s eyes narrowed. “He’s not just a Santa. He’s a *security Santa.*” Bill groaned. “We’re screwed.” Molly’s grin was all teeth. “Nah. We’re just gonna *play along.*” She grabbed Bill’s arm and dragged him toward the nearest kid, who was already reaching for a present. “Hey, little buddy! You ever wonder how Santa knows *all* your secrets?” The kid blinked. “Uh… yeah?” “Well, he doesn’t,” Molly said, pulling out a tiny, stolen flashlight. “But *we* do. And we’re gonna need your help.” She held up the flashlight, shining it directly into the Santa’s face. The man’s eyes widened—then he *screamed.* Bill, meanwhile, had already grabbed a nearby cake stand and was using it as a shield. “We’re *not* doing this,” he muttered, but his voice was drowned out by the chaos unfolding around them. The kids were now screaming, the Santa was running, and Molly was already back at the door, yanking it open wider. “Come on!” she shouted over the noise. “Let’s *go!*” Bill sighed, but he was already running—right into a very confused security guard who had just realized his alarm wasn’t going off.
created_at
"2026-02-20T19:14:11.077Z"
2/20/2026, 7:14:11 PM
model
"mistralai/ministral-3b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
192
moderation_latency
(null)
generation_time
2232
tokens_prompt
77
tokens_completion
751
native_tokens_prompt
69
native_tokens_completion
683
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000752
router
(null)
provider_responses
0
endpoint_id
"b9f6828f-aef9-498a-91f5-6816fbf72420"
id
"d41c5d3cca5c4df5bdc9749a97c76203"
is_byok
false
latency
192
model_permaslug
"mistralai/ministral-3b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771614851-kZGGbzuG0x8tcpSAS9bT"
upstream_id
"d41c5d3cca5c4df5bdc9749a97c76203"
total_cost
0.0000752
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
455
92.95%
Dialogue to Total Word Ratio
Ratio: 31.21%, Deviation: 1.21%
neededClean
false
wordsTotal
455
wordsDialogue
142
46.4773%