NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPTNDPMC2FN82DPC9XFX3J
Run Details
Model:
Mistral Small 3.2 24B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
8.0s
Cost:
$0.000110
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
82.76%
Dialogue to Total Word Ratio
(details)
41.3786%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her oversized sunglasses and leaned against the fence of the brightly decorated backyard. "Bill, are you sure this is the right house? The last time we hit a kids' party, we ended up with a room full of plastic dinosaurs and a very angry unicorn piñata." Bill scratched his chin, eyeing the streamers. "Relax, Molly. This one’s different. The kid’s parents are loaded. Presents will be top-tier—no stuffed animals, just gadgets and cash. And look at that security. Zero. Just a bunch of parents too busy pretending they like juice boxes." Molly squinted at the crowd. "You call that zero security? There’s a clown." "Exactly. Clowns are the ultimate distraction. No one suspects the clown. Except maybe the kids, but they’re too busy screaming to notice us." Molly sighed. "Fine. But if we get caught, I’m blaming you. And I’m telling them you’re the one who ate all the cupcakes." Bill grinned. "Deal. Now, here’s the plan: you create a diversion—maybe fake a choking hazard with a Lego—and I’ll slip in through the gift table. We’ll meet at the van in five." Molly groaned. "You want me to fake a choking hazard at a kids’ party? That’s dark, even for us." Bill shrugged. "Hey, desperate times. And desperate thieves." Molly rolled her eyes. "You’re lucky I like you, Bill." "Love you too, partner. Now let’s go steal some toys."
created_at
"2026-02-20T14:21:28.9Z"
2/20/2026, 2:21:28 PM
model
"mistralai/mistral-small-3.2-24b-instruct-2506"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
236
moderation_latency
(null)
generation_time
7829
tokens_prompt
77
tokens_completion
366
native_tokens_prompt
69
native_tokens_completion
343
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001098
router
(null)
provider_responses
0
endpoint_id
"23fd6b91-74d8-45fc-ac00-cf141ec7f4a5"
id
"6945d5df01fc4720993837fcb0bb5a09"
is_byok
false
latency
232
model_permaslug
"mistralai/mistral-small-3.2-24b-instruct-2506"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597288-IPGEplCeaQfIvk9qz3e9"
upstream_id
"6945d5df01fc4720993837fcb0bb5a09"
total_cost
0.0001098
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
235
82.76%
Dialogue to Total Word Ratio
Ratio: 83.40%, Deviation: 6.60%
neededClean
false
wordsTotal
235
wordsDialogue
196
41.3786%