NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPS1VFD0T13D5N7HTV5G1K
Run Details
Model:
Mistral Small 3.2 24B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
9.0s
Cost:
$0.000240
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
97.94%
Dialogue to Total Word Ratio
(details)
92.1583%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her oversized sunglasses and leaned against the fence of the brightly colored backyard. The children’s party was in full swing, laughter and squeals of delight filling the air. She turned to Bill, who was fidgeting with a bag of stolen balloons. “You nervous?” Molly asked, nudging him with her elbow. “Me? Nervous?” Bill scoffed. “I’m just psyching myself up. This is a big job, Molly. We’re not just stealing candy bars here. We’re talking high-value, high-stakes—” “It’s a kids’ party, Bill. The stakes are a bunch of plastic toys and a piñata full of gum.” “Exactly! That piñata’s a goldmine. You ever seen what they put in those things? It’s like a treasure chest of dental disasters.” Molly rolled her eyes. “Focus. We need a plan. How are we getting in?” Bill pulled out a crumpled invitation from his pocket. “See this? ‘All are welcome!’ That’s our ticket. We just walk in like we belong.” “And if someone asks who we are?” “We’re the birthday girl’s long-lost cousins. From… uh… Canada.” Molly groaned. “You’re terrible at this.” “No, no, it’s perfect. Canadians are trustworthy. Everyone loves Canadians.” “Fine. But what about the presents? They’re all under the tree. How do we grab them without looking suspicious?” Bill grinned. “Easy. We play the game.” “What game?” “The ‘helpful adult’ game. You know, the one where we ‘accidentally’ knock over the gift table while ‘trying to help’?” Molly sighed. “That’s the worst plan I’ve ever heard.” “It’s genius. No one suspects the helpful ones. Plus, chaos is our friend. The more kids are running around, the less they’ll notice a couple of ‘aunts’ making off with their loot.” Molly crossed her arms. “And what if the parents ask why we’re taking the presents?” “We’re ‘organizing a surprise gift exchange’ for the kids. It’s a thing. I read it in a parenting blog.” “You read parenting blogs?” “Research, Molly. Always research.” A loud crash echoed from the backyard as a child toppled over a stack of cups. The adults nearby laughed it off. Molly smirked. “Okay, maybe your plan isn’t terrible. But we still need a distraction.” Bill pulled out a whoopee cushion from his pocket. “Already on it.” “You’re bringing a whoopee cushion to a kids’ party?” “It’s a classic. Works every time.” Molly shook her head. “You’re ridiculous.” “And you love me for it.” She smirked. “Let’s just get this over with before I change my mind.” Bill clapped his hands together. “That’s the spirit! Now, remember—confidence is key. We’re not thieves. We’re… gift redistributors.” Molly groaned again. “You’re going to get us caught.” “Nah. Worst case, we get a free cupcake.” “That’s not reassuring.” Bill adjusted his fake mustache and grinned. “Relax. We’ve got this. Now, let’s go steal some toys like the professionals we are.” Molly took a deep breath. “Professionals don’t use whoopee cushions.” “Professionals adapt. Now, come on. The presents aren’t going to steal themselves.” With a resigned sigh, Molly followed him toward the party, already bracing for the inevitable disaster.
created_at
"2026-02-20T14:20:36.092Z"
2/20/2026, 2:20:36 PM
model
"mistralai/mistral-small-3.2-24b-instruct-2506"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
198
moderation_latency
(null)
generation_time
8780
tokens_prompt
77
tokens_completion
920
native_tokens_prompt
69
native_tokens_completion
777
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00024
router
(null)
provider_responses
0
endpoint_id
"23fd6b91-74d8-45fc-ac00-cf141ec7f4a5"
id
"cef90b5871b140449d5638d5f4aee95e"
is_byok
false
latency
198
model_permaslug
"mistralai/mistral-small-3.2-24b-instruct-2506"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597236-8dF64kHqk4JYaOt5993t"
upstream_id
"cef90b5871b140449d5638d5f4aee95e"
total_cost
0.00024
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
511
97.94%
Dialogue to Total Word Ratio
Ratio: 70.65%, Deviation: 0.65%
neededClean
false
wordsTotal
511
wordsDialogue
361
92.1583%