NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYB7F83HXN9YPFSQYWKY94J
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.0s
Cost:
$0.000115
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
14.30%
Matches word count
(details)
19.71%
Dialogue to Total Word Ratio
(details)
17.0047%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering "Happy Birthday!" balloon cast long shadows across the backyard as Molly adjusted the strap of her satchel, her fingers twitching with the kind of restless energy that only came from plotting the perfect heist. Bill leaned against the fence, kicking at a loose brick with the toe of his scuffed boot, a half-eaten candy bar wrapper dangling from his mouth. "You ever think about how ridiculous it is that we’re about to rob a *children’s party*?" Bill asked, wiping his mouth with the back of his hand. "I mean, where’s the challenge? These kids ain’t got no security, no alarms—just a bunch of little monsters with too much sugar and not enough sense." Molly smirked, pulling a pair of kid-sized gloves from her bag and tossing them to him. "Oh, come on, Bill. You love a good soft target. And let’s be real—these parents are *begging* for us to take their stuff. They’re out there laughing, handing over their credit cards like it’s a game of musical chairs. ‘Here, take this $200 LEGO set, we don’t even know what it is!’" Bill caught the gloves, rolling them between his palms. "Yeah, but what if the kid *remembers* us? What if they grow up and become some kind of vigilante or something? ‘Remember that time those weirdos stole my Nerf gun? That’s when I knew I had to save the world!’" Molly waved a hand dismissively. "Kids forget faster than a magician’s trick. Besides, we’re not stealing *from* them—we’re stealing *for* them. Ever think about it that way? These presents? They’re just sitting there, waiting to be *liberated*. Like a buffet, but for toys." Bill groaned, kicking the brick again. "You’re terrible at sales. ‘Hey, kid, your new bike’s not really yours—it’s *ours* now.’" "Watch me." Molly pulled a small, stuffed unicorn from her bag and held it up. "See this? This is *art*. This is a statement. We’re not just thieves—we’re *cultural revolutionaries*. We’re teaching these kids that the world’s not fair, and if they want something, they gotta take it." Bill snatched the unicorn, examining it like it might bite him. "You’re gonna get us arrested." "Nah." Molly grinned. "We’re gonna get *awarded*. Think about it—we’re the Robin Hoods of toy theft. The *Toy Story* heist crew. We’ll be legends." Bill sighed, tossing the unicorn back. "You’re insufferable. But fine. When do we do this?" Molly checked her watch, then her phone, then her watch again. "Now. The parents are all drunk on punch, the kids are hyper, and the cake’s still in the kitchen. That’s our window." Bill straightened, suddenly serious. "Alright. But if we get caught, I’m blaming you." Molly saluted him with a tiny plastic shovel from her bag. "Deal. Now let’s go make some kids’ dreams come true—*the hard way*."
created_at
"2026-02-20T20:18:00.076Z"
2/20/2026, 8:18:00 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
242
moderation_latency
(null)
generation_time
3489
tokens_prompt
77
tokens_completion
740
native_tokens_prompt
69
native_tokens_completion
700
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011535
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"c13b0fd38dbd4ff789be5cc97556f06e"
is_byok
false
latency
235
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618680-ETcUVnou5thZV8SuvKzC"
upstream_id
"c13b0fd38dbd4ff789be5cc97556f06e"
total_cost
0.00011535
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
14.30%
Matches word count
n/a
neededClean
false
words
479
19.71%
Dialogue to Total Word Ratio
Ratio: 64.30%, Deviation: 5.70%
neededClean
false
wordsTotal
479
wordsDialogue
308
17.0047%