NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY861CQHEPB8JFNN1SAACV7
Run Details
Model:
Ministral 3 3B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2.7s
Cost:
$0.000074
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the party hall pulsed like a heartbeat, casting jagged shadows across the walls. Molly adjusted her gloves—too tight, but better than nothing—and checked her watch one last time. "Bill, we’re running out of time." Bill wiped his brow with the back of his hand, leaving a smear of glitter on his sleeve. "Yeah, yeah. The security guard’s got a *thing* for sparkly things. He’s been sniffing around like a bloodhound since we left the bakery." Molly sighed, rolling her eyes. "You’re the one who said we’d need a distraction. Not that we’re actually *distracting* anything." "Relax, Molly. We’re just… *disguising* ourselves." He pulled a pair of oversized sunglasses from his pocket and slapped them onto his face. "Now *this* is a distraction." She groaned. "Bill, you’re a menace. What if he tries to take a selfie with you?" "Then he’ll have a *very* confused face." He grinned, flexing his arms. "Besides, we’re not here to *steal* anything. We’re here to *collect*." Molly pinched the bridge of her nose. "We’re not even sure what we’re collecting." "Trust me, we’ve got a *plan.*" He pulled a small, crumpled note from his pocket. "The party favors are all in the same box. Just… *steal the box.*" She rolled her eyes. "The box is *locked.*" "Then we’ll *break the box.*" Molly groaned. "We’re going to be arrested for *box vandalism.*" "Nah, we’ll just say we were *testing the locks.*" He adjusted his sunglasses again. "Besides, the presents are *all* in there. We could have a *whole year’s worth* of toys." Molly hesitated. "What if the presents are *not* all in there?" "Then we’ll just *pretend* we found them." He grinned. "It’s called *creative accounting.*" The door creaked open, and a voice called out, "Hey, guys! You two look like you’re *really* into the glow sticks!" Bill froze. "Oh no." Molly grabbed his arm. "We have to move. Now." They bolted, tripping over a toy train, which sent a cascade of plastic cars crashing to the floor. The partygoers erupted in laughter, and a kid pointed at them. "You guys are *glow stick ninjas!*" Bill groaned. "We’re *not* ninjas." Molly sighed. "We’re *terrible* at this." "Yeah, but we’re *terrible* at *not* getting caught." He adjusted his sunglasses again. "Besides, we’ve got a *plan.*" Molly rolled her eyes. "Just *don’t* tell me you’ve already thought of a way to get caught." "Relax, Molly. We’re just *enjoying* the chaos." He grinned. "Now, where’s that box?"
created_at
"2026-02-20T19:24:47.401Z"
2/20/2026, 7:24:47 PM
model
"mistralai/ministral-3b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
236
moderation_latency
(null)
generation_time
2143
tokens_prompt
77
tokens_completion
676
native_tokens_prompt
69
native_tokens_completion
673
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000742
router
(null)
provider_responses
0
endpoint_id
"b9f6828f-aef9-498a-91f5-6816fbf72420"
id
"3e70011b4a9d454b835a079b5756bd84"
is_byok
false
latency
236
model_permaslug
"mistralai/ministral-3b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771615487-u9dnvtI1KTRmW8QCsFrN"
upstream_id
"3e70011b4a9d454b835a079b5756bd84"
total_cost
0.0000742
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
414
0.00%
Dialogue to Total Word Ratio
Ratio: 58.94%, Deviation: 28.94%
neededClean
false
wordsTotal
414
wordsDialogue
244
0.0000%