NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQA84V5YZKEQWYJNCBS9Y2
Run Details
Model:
Mistral Small 3.2 24B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
11.2s
Cost:
$0.000223
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
34.0511%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain drummed against the roof of the stolen ice cream truck, a perfect disguise for their latest heist. Molly adjusted her oversized sunglasses, even though it was pouring outside, and flipped through a crumpled invitation. "Okay, so the party starts at three, but the presents are delivered at two-thirty. That gives us exactly forty-five minutes to case the joint, disable the security—" "Security?" Bill interrupted, stuffing his face with a stolen cookie. "It's a kids' party, Molly. The only security is a mom with a phone and a vague threat of 'telling Dad.'" Molly sighed. "Fine. But we still need a plan. The house is a two-story Victorian with a wraparound porch. Back door’s probably unlocked—" "Or we could just walk in the front door," Bill said, gesturing wildly. "It’s a *party*. No one questions a guy in a clown costume." Molly stared at him. "You’re not wearing a clown costume." "Yet," Bill said, pulling a red nose from his pocket. "I’ve got the whole ensemble in the back. Wig, oversized shoes, the works." Molly pinched the bridge of her nose. "We are not breaking into a children’s party dressed as clowns." "Says who?" Bill grinned. "Look, think about it. Kids love clowns. Parents love clowns. The only people who don’t love clowns are the ones who’ve seen *It* one too many times, and those people are already suspicious." Molly groaned. "We are not discussing Stephen King right now." "Fine, fine." Bill waved a hand. "But hear me out—we go in as the entertainment. I juggle, you do magic tricks, and while the kids are distracted, we swap the presents with our duplicates." Molly raised an eyebrow. "You can juggle?" "Of course I can juggle," Bill said, immediately dropping a banana and a rubber chicken. "Okay, maybe not *well*, but the kids won’t know the difference." Molly rubbed her temples. "We are not doing this. We are going in, grabbing the presents, and getting out before the cake is cut." Bill pouted. "You’re no fun." "Bill, we are professional thieves, not party crashers." "Same difference," he muttered. Molly ignored him and pulled out a blueprint of the house. "The presents will be in the living room. We go in through the back, grab the loot, and leave before the magician arrives." Bill gasped. "There’s a magician?" Molly groaned. "Focus, Bill." "Right, right." He leaned over the blueprint. "So, we go in, grab the presents, and—wait, why are we stealing kids’ presents again?" Molly shot him a look. "Because they’re expensive, and because the parents are rich enough to not notice." Bill nodded sagely. "Ah, yes. The classic 'steal from the rich, give to the poor' routine." Molly smirked. "And by 'poor,' you mean us." "Exactly." Bill grinned. "So, when do we leave?" Molly checked her watch. "Now. And Bill?" "Yeah?" "No clown costumes." Bill sighed dramatically. "You’re crushing my dreams, Molly."
created_at
"2026-02-20T14:29:59.591Z"
2/20/2026, 2:29:59 PM
model
"mistralai/mistral-small-3.2-24b-instruct-2506"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
226
moderation_latency
(null)
generation_time
11071
tokens_prompt
77
tokens_completion
775
native_tokens_prompt
69
native_tokens_completion
720
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0002229
router
(null)
provider_responses
0
endpoint_id
"23fd6b91-74d8-45fc-ac00-cf141ec7f4a5"
id
"34cd7a9561384334b91c8a763b9b0f58"
is_byok
false
latency
226
model_permaslug
"mistralai/mistral-small-3.2-24b-instruct-2506"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597799-Iqm9TpfLFfCWTuRrR2vn"
upstream_id
"34cd7a9561384334b91c8a763b9b0f58"
total_cost
0.0002229
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
486
0.00%
Dialogue to Total Word Ratio
Ratio: 71.19%, Deviation: 41.19%
neededClean
false
wordsTotal
486
wordsDialogue
346
34.0511%