NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPHSJ1GB67Z84HA1JTPQE6
Run Details
Model:
Mistral Small 3.2 24B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
8.6s
Cost:
$0.000153
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
55.84%
Dialogue to Total Word Ratio
(details)
27.9185%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her oversized sunglasses and leaned against the fence outside the brightly decorated backyard. "You ever notice how kids' parties are the easiest marks? No security, no alarms, just a bunch of parents too busy stuffing their faces with cupcakes to notice a couple of professionals at work." Bill scratched his chin, eyeing the inflatable castle bouncy thing. "Yeah, but have you seen the competition? Those little monsters are faster than greased lightning. One wrong move and we’re outed by a five-year-old with a juice box." Molly smirked. "Please. We’ve stolen from museums, casinos, even a freaking zoo. A few toddlers aren’t gonna stop us. Besides, the presents are right there, under the tree. No safes, no guards, just a big red bow screaming *take me*." Bill sighed. "Okay, but we need a plan. We can’t just waltz in like we’re here for the cake. Parents are suspicious." Molly pulled out a crumpled flyer. "Easy. We’re the clowns. Look, they’re short-staffed. One of the clowns called in sick. We fill in, do a few balloon animals, and when the coast is clear, we swap the presents with our dupes." Bill blinked. "You want us to *clown*?" "Relax, it’s just face paint and a wig. You’ll be fine. Worst case, you trip over your own feet and get booed by a kindergartener. Big deal." Bill groaned. "I’d rather rob a bank." Molly patted his shoulder. "Too late. We’re committed. Now, let’s go get our big shoes on." They slipped into the back of the party, where a harried-looking party planner handed them oversized noses and floppy hats. Molly grinned. "See? Piece of cake." Bill adjusted his wig and muttered, "I hate you." Molly laughed. "No, you don’t. You love the thrill. Now, let’s go make some kids cry with our terrible magic tricks." Bill groaned again but followed her toward the crowd of giggling children. "This is a terrible idea." Molly winked. "That’s why it’s gonna work."
created_at
"2026-02-20T14:16:38.222Z"
2/20/2026, 2:16:38 PM
model
"mistralai/mistral-small-3.2-24b-instruct-2506"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
222
moderation_latency
(null)
generation_time
8399
tokens_prompt
77
tokens_completion
513
native_tokens_prompt
69
native_tokens_completion
487
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000153
router
(null)
provider_responses
0
endpoint_id
"23fd6b91-74d8-45fc-ac00-cf141ec7f4a5"
id
"d8bf0b341e6d44cbb7a5cd9b7a24f5d6"
is_byok
false
latency
222
model_permaslug
"mistralai/mistral-small-3.2-24b-instruct-2506"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771596998-rtRh0Ltbh2FQXAp5la18"
upstream_id
"d8bf0b341e6d44cbb7a5cd9b7a24f5d6"
total_cost
0.000153
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
331
55.84%
Dialogue to Total Word Ratio
Ratio: 73.41%, Deviation: 3.41%
neededClean
false
wordsTotal
331
wordsDialogue
243
27.9185%