NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYA97YCXWRR4PM1236TC0W4
Run Details
Model:
Ministral 3 3B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.0s
Cost:
$0.000085
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.32%
Dialogue to Total Word Ratio
(details)
49.5145%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
**Title: *The Great Toy Heist*** The neon glow of the party hall pulsed like a disco heartbeat outside Molly’s window. Inside, a dozen kids screamed as a giant inflatable dragon lunged at them. Molly adjusted her ski mask, her fingers twitching with the weight of her lockpick kit. “Bill,” she whispered, “we’re not just stealing presents. We’re stealing *joy*.” Bill wiped his brow with a crumpled napkin, his breath fogging the cold air. “Joy? Molly, we’re in *kidnap mode* here. If we get caught, they’ll call the cops, and then the cops’ll call the *parents*, and then the parents’ll call the *fire department*—” “And then we’ll be the heroes,” Molly interrupted, tossing him a stolen candy cane. “The *real* heroes.” Bill took a bite, chewing thoughtfully. “You’re right. But we need a plan. The security system’s got motion sensors, and the doors are reinforced with *toy guns*.” “Toy guns?” Molly scoffed. “They’re just for show. The real locks are behind the *giant stuffed animal*.” Bill’s eyes lit up. “You’re on. But we need a distraction.” “A distraction?” Molly rolled her eyes. “We’re *stealing presents*, not *saving the world*.” “Fine,” Bill said, grinning. “We’ll just… *pretend* to be the party’s worst guests.” Molly groaned. “You’re terrible at this.” “Nah,” Bill said, pulling out a stolen party hat. “I’m *perfect*. I’ll be the guy who keeps dropping his *sock* everywhere.” Molly sighed but smirked. “Fine. But if we get caught, I’m blaming you.” “Deal,” Bill said, already sprinting toward the front door. “First rule: no screaming. Second rule: *steal everything*.” Molly grabbed her bag of tools and followed, her heart pounding. The party was chaos—kids laughing, parents yelling, and a very confused bouncer staring at them like they were aliens. Then Bill did it. He tripped over his own sock, faceplanting into a pile of balloons. The kids gasped. The parents gasped. The bouncer gasped. Molly grinned. “Now *that’s* a distraction.” Bill wiped his mouth, adjusting his hat. “We’re in.” The moment the bouncer turned away, Molly kicked in the door. Inside, the room was a riot of wrapped gifts, glitter, and the occasional confused toddler. Bill grabbed a sack. “First rule: *no glitter*.” Molly rolled her eyes but started pilfering. “Second rule: *steal the most expensive stuff*.” They worked in silence, their hands moving faster than their minds. A stuffed unicorn. A toy car. A *very* expensive action figure. Then Bill froze. “Molly,” he whispered, “what if we *don’t* get caught?” Molly paused, holding a half-wrapped present. “Then we’ll just say we were *testing the security system*.” Bill groaned. “You’re evil.” “And you’re *useless*,” Molly shot back, already stuffing another gift into her bag. Bill sighed, but he was grinning. “Fine. But if we get caught, I’m *blaming you*.” Molly smirked. “Deal.” And with that, the two greatest thieves in history vanished into the night, leaving behind only a trail of stolen toys and the faintest echo of a very confused bouncer’s scream.
created_at
"2026-02-20T20:01:29.56Z"
2/20/2026, 8:01:29 PM
model
"mistralai/ministral-3b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
196
moderation_latency
(null)
generation_time
2658
tokens_prompt
77
tokens_completion
877
native_tokens_prompt
69
native_tokens_completion
784
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000853
router
(null)
provider_responses
0
endpoint_id
"b9f6828f-aef9-498a-91f5-6816fbf72420"
id
"bff2af86afce4c90b25037efa8f7a838"
is_byok
false
latency
196
model_permaslug
"mistralai/ministral-3b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771617689-KxooEQld6w367DkdIhCS"
upstream_id
"bff2af86afce4c90b25037efa8f7a838"
total_cost
0.0000853
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
494
0.32%
Dialogue to Total Word Ratio
Ratio: 39.27%, Deviation: 10.73%
neededClean
false
wordsTotal
494
wordsDialogue
194
49.5145%