NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQ2H7QVHVWC2N11Y8W803N
Run Details
Model:
Mistral Small 3.2 24B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
15.6s
Cost:
$0.000111
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.84%
Dialogue to Total Word Ratio
(details)
0.4369%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her oversized sunglasses and leaned against the fence of the brightly decorated backyard. "You sure this is the right house, Bill? I don’t see any security cameras, but I *do* see a piñata shaped like a unicorn. That’s either a red flag or a personal vendetta." Bill scratched his chin, eyeing the streamers. "Relax, it’s the perfect target. Kids are too busy stuffing their faces with cupcakes to notice us. And parents? They’re already three mimosas deep. We’ll be in and out before they even realize the presents are gone." Molly crossed her arms. "And what’s our cover? We can’t just waltz in wearing black turtlenecks and calling ourselves ‘the present fairies.’" Bill grinned. "We’re the entertainment. I’ve got a magic trick up my sleeve—literally. Watch this." He pulled out a deck of cards and made a coin disappear. "Ta-da! Now, who’s gonna suspect the magician?" Molly sighed. "You’re terrible at magic. That coin’s in your other pocket." Bill patted his pocket. "…Right. Plan B: we crash the party as the birthday kid’s long-lost cousins from Canada." Molly groaned. "You don’t even have an accent." "Neither do you," Bill shot back. Molly smirked. "Fair. But if we get caught, I’m blaming you. And I’m *really* bad at blaming people." Bill adjusted his hat. "Deal. Now, let’s go steal some toys before the cake gets cut."
created_at
"2026-02-20T14:25:46.753Z"
2/20/2026, 2:25:46 PM
model
"mistralai/mistral-small-3.2-24b-instruct-2506"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
289
moderation_latency
(null)
generation_time
15479
tokens_prompt
77
tokens_completion
370
native_tokens_prompt
69
native_tokens_completion
348
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001113
router
(null)
provider_responses
0
endpoint_id
"23fd6b91-74d8-45fc-ac00-cf141ec7f4a5"
id
"f446611a770f46dd85efe3a8147a17c0"
is_byok
false
latency
289
model_permaslug
"mistralai/mistral-small-3.2-24b-instruct-2506"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597546-ewZZ9btdpw7WKuKsJtUh"
upstream_id
"f446611a770f46dd85efe3a8147a17c0"
total_cost
0.0001113
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
230
0.84%
Dialogue to Total Word Ratio
Ratio: 75.22%, Deviation: 14.78%
neededClean
false
wordsTotal
230
wordsDialogue
173
0.4369%