NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9NMKXQN0XA4S8NYJ2J2MP
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.3s
Cost:
$0.000103
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
97.63%
Dialogue to Total Word Ratio
(details)
48.8134%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering "Happy Birthday!" sign cast jagged shadows across the backyard as Molly adjusted the strap of her duffel bag, her breath fogging in the crisp autumn air. Bill leaned against the fence, twirling a candy cane between his fingers like a conductor’s baton. "You sure about this?" Bill asked, his voice dripping with skepticism. "Last time we hit a kid’s party, we ended up in a sling shot." Molly shot him a look that could curdle milk. "That was *one* time, and it was because you insisted on using the pool noodles as a distraction. This time, we’re professionals. We’ve got the social engineering covered." Bill smirked. "Oh yeah? How’s that?" "Simple. I’m the birthday girl’s *aunt*—well, *cousin*—who ‘forgot’ to bring the cake. You’re my ‘very confused’ boyfriend who ‘accidentally’ brought the wrong present. Classic." Bill groaned. "I *hate* when you use the word ‘classic.’ It’s like you’re admitting this is a bad idea." Molly rolled her eyes. "Relax. The parents will be too busy making sure the kids don’t eat the balloons to notice us ‘accidentally’ pocketing the entire gift pile." Bill crossed his arms. "And what, exactly, are we supposed to do with all these toys? I mean, I *love* a good action figure, but we can’t exactly resell a ‘My Little Pony: Sparkle Surprise Surprise!’ to the mob." Molly grinned. "We’re not selling them. We’re *donating* them. To a shelter. Or an orphanage. Or—" "—or we just leave them in a dumpster behind a Walmart and call it a day," Bill finished, deadpan. Molly waved a hand. "Details. The point is, we’re not hoarding them. We’re *redistributing* wealth. Like Robin Hood, but with more glitter." Bill sighed. "You’re impossible. You know that?" "Yep. And you love me anyway." She nudged him with her elbow. "Now, are you in or what? Because if you’re not, I’m totally stealing all the presents myself and leaving you to explain to the cops why you were lurking near a kid’s birthday party in a fake mustache." Bill chuckled, rubbing his chin. "Fine. But if we get caught, I’m blaming you for the ‘very confused boyfriend’ bit." Molly grinned. "Deal. Now let’s go turn this party into our personal Santa’s workshop." She grabbed his arm and pulled him toward the back gate. "And Bill?" "Yeah?" "Try not to trip over the dog this time." Bill groaned. "I *hate* that dog."
created_at
"2026-02-20T19:50:47.175Z"
2/20/2026, 7:50:47 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
204
moderation_latency
(null)
generation_time
2959
tokens_prompt
77
tokens_completion
648
native_tokens_prompt
69
native_tokens_completion
615
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001026
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"540dbee0748844de86a1ec1da7151803"
is_byok
false
latency
204
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771617047-jZCGI9M9rjA6v34iNci7"
upstream_id
"540dbee0748844de86a1ec1da7151803"
total_cost
0.0001026
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
404
97.63%
Dialogue to Total Word Ratio
Ratio: 69.31%, Deviation: 0.69%
neededClean
false
wordsTotal
404
wordsDialogue
280
48.8134%