NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPGCJDAGT1E9G055T30JEH
Run Details
Model:
Mistral Small 3.2 24B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
4.9s
Cost:
$0.000118
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.21%
Matches word count
(details)
96.64%
Dialogue to Total Word Ratio
(details)
48.4292%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
**Molly:** "Bill, I’ve got a foolproof plan. We’re hitting the Thompson kid’s birthday party tonight." **Bill:** "A children’s party? Molly, that’s low. Even for us." **Molly:** "Low? Bill, those kids won’t even notice. They’ll be too busy stuffing their faces with cake and screaming about bouncy castles." **Bill:** "And what’s the grand prize here? A bunch of plastic toys and a few gift cards?" **Molly:** "Exactly. No security. No alarms. Just a bunch of parents too drunk on Pimm’s to notice two adults lurking near the gift table." **Bill:** "You’re forgetting the piñata. That thing’s a weapon. One swing and we’re out cold." **Molly:** "We’ll go in after the piñata. By then, the kids will be too sugared up to care. We’ll just… blend in." **Bill:** "Blend in? Molly, we’re both in our thirties. The last time I blended in at a kids’ party, I got drafted into a game of musical chairs." **Molly:** "Fine. We’ll wear costumes. You be a clown. I’ll be a magician." **Bill:** "A clown? Molly, I’m allergic to face paint." **Molly:** "Then you’ll just have to sneeze your way through the heist. Come on, Bill. Think of the loot. Think of the glory." **Bill:** "Think of the therapy bills. Alright, fine. But if I have to juggle, I’m quitting." **Molly:** "Deal. Now, let’s go steal some kids’ toys like the professionals we are."
created_at
"2026-02-20T14:15:52.153Z"
2/20/2026, 2:15:52 PM
model
"mistralai/mistral-small-3.2-24b-instruct-2506"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
229
moderation_latency
(null)
generation_time
4437
tokens_prompt
77
tokens_completion
374
native_tokens_prompt
69
native_tokens_completion
370
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001179
router
(null)
provider_responses
0
endpoint_id
"23fd6b91-74d8-45fc-ac00-cf141ec7f4a5"
id
"be624d64958c488f9dd0cc01e3ad6187"
is_byok
false
latency
228
model_permaslug
"mistralai/mistral-small-3.2-24b-instruct-2506"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771596952-s9F2BadlnYtDcv0wFlg0"
upstream_id
"be624d64958c488f9dd0cc01e3ad6187"
total_cost
0.0001179
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.21%
Matches word count
n/a
neededClean
false
words
228
96.64%
Dialogue to Total Word Ratio
Ratio: 94.30%, Deviation: 4.30%
neededClean
false
wordsTotal
228
wordsDialogue
215
48.4292%