NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPKGNN1QY096MJ0B515ER6
Run Details
Model:
Mistral Small 3.2 24B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5.8s
Cost:
$0.000214
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.49%
Matches word count
(details)
52.02%
Dialogue to Total Word Ratio
(details)
26.2573%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain drummed against the roof of the stolen ice cream truck as Molly adjusted her stolen party hat, the glittery pink monstrosity clashing violently with her leather jacket. She turned to Bill, who was currently attempting to fold a map of the party venue into a paper airplane. “You’re not actually going to use that, are you?” Molly asked, snatching the map from his hands. Bill shrugged. “It’s a multitasking tool. Navigation and entertainment.” “This isn’t a game, Bill. We’re about to rob a bunch of five-year-olds.” “Technically, we’re robbing their parents. The kids are just collateral damage.” Molly sighed. “You’re impossible.” Bill grinned. “And yet, here you are, stuck with me.” The party was in full swing at the community center, balloons bobbing in the breeze, a bouncy castle defying gravity in the corner, and a table groaning under the weight of presents. Molly and Bill had spent weeks casing the place, timing the security guard’s coffee breaks and memorizing the layout. “Alright,” Molly said, checking her watch. “The guard’s on his third espresso. We’ve got twenty minutes before he’s back from the bathroom.” Bill adjusted his stolen clown nose. “You sure this disguise is necessary?” “Yes. It’s called blending in.” “Blending in is for people who don’t want to be noticed.” Molly rolled her eyes. “Just stick to the plan. We go in, grab the presents, and get out before the cake is cut.” Bill saluted. “Aye, aye, captain.” They slipped inside, the door creaking slightly. The noise was drowned out by the cacophony of children’s laughter and the DJ’s questionable rendition of “Baby Shark.” Molly made a beeline for the present table, her gloved hands already reaching out. “Wait,” Bill hissed, grabbing her arm. “We need a distraction.” Molly groaned. “What now?” Bill pointed to a nearby piñata. “That.” Before Molly could protest, Bill had grabbed a stick and was whacking the piñata with reckless abandon. Candy rained down, and the children shrieked with delight. Molly seized the moment, stuffing presents into her bag with practiced efficiency. “Hurry up,” she muttered. “The guard’s going to be back any second.” Bill, now covered in glitter and half-eaten lollipops, grinned. “I’m multitasking again.” Molly yanked him toward the exit. “We’re leaving. Now.” They burst out the door just as the security guard rounded the corner, his coffee forgotten in his hand. Molly and Bill sprinted to the ice cream truck, their loot jostling in the bag. “That was too close,” Molly panted, slamming the door shut. Bill wiped glitter from his face. “But we got the goods.” Molly glared at him. “Next time, we’re robbing a bank.” Bill laughed. “Where’s the fun in that?” Molly shook her head, but she couldn’t help smiling. Somewhere in the distance, a child wailed about a missing present. The heist was complete.
created_at
"2026-02-20T14:17:34.655Z"
2/20/2026, 2:17:34 PM
model
"mistralai/mistral-small-3.2-24b-instruct-2506"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
197
moderation_latency
(null)
generation_time
5534
tokens_prompt
77
tokens_completion
813
native_tokens_prompt
69
native_tokens_completion
690
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0002139
router
(null)
provider_responses
0
endpoint_id
"23fd6b91-74d8-45fc-ac00-cf141ec7f4a5"
id
"22e1bb49584d477ba00aed8d36f28dcb"
is_byok
false
latency
197
model_permaslug
"mistralai/mistral-small-3.2-24b-instruct-2506"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597054-zjC1uoV2YMZ5CJrvqQpk"
upstream_id
"22e1bb49584d477ba00aed8d36f28dcb"
total_cost
0.0002139
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.49%
Matches word count
n/a
neededClean
false
words
473
52.02%
Dialogue to Total Word Ratio
Ratio: 33.62%, Deviation: 3.62%
neededClean
false
wordsTotal
473
wordsDialogue
159
26.2573%