NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQ4C0F3KE9Y9TS2Z4WBF3W
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
10.0s
Cost:
$0.001548
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
47.62%
Dialogue to Total Word Ratio
(details)
57.8604%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the *Happy Clown Pizza & Playland* sign flickered like a dying firefly, casting long, wobbly shadows across the alley where Molly and Bill crouched behind a suspiciously sticky dumpster. Molly adjusted her oversized rainbow wig—stolen from a Halloween clearance bin—and wiped a smear of face paint from her cheek. It was supposed to be a butterfly. It looked more like a crime scene. Bill, meanwhile, was struggling to zip up his rented clown costume, which was at least two sizes too small. The buttons on the polka-dot shirt strained like they were plotting an escape. "I swear to god, Molly," he hissed, "if one more kid asks me to make a balloon animal, I’m gonna *become* the balloon animal." Molly ignored him, peering through a gap in the fence at the chaos inside. A horde of sugar-fueled eight-year-olds shrieked as they hurled themselves into a ball pit the size of a swimming pool. A harried woman in a "World’s Best Mom" sash clutched a clipboard like it was the only thing keeping her from collapse. "Perfect," Molly murmured. "Total anarchy. No one’s watching the prize table." Bill craned his neck. "That’s not a prize table. That’s a *mountain* of unsupervised Lego sets and iPads." "Exactly. And in twenty minutes, it’s gonna be *our* mountain." She tossed him a walkie-talkie the size of a brick. "You distract the mom with your ‘award-winning’ magic tricks. I’ll handle the presents." Bill caught the device and groaned. "My magic tricks are *literally* just me dropping stuff." "You’re a *natural*." Molly grinned. "Now, remember—the second you hear me say ‘the piñata’s haunted,’ you pull the fire alarm." Bill’s eye twitched. "We’re stealing from *children*." "And *future* adults who will definitely grow up to be jerks," Molly said, adjusting her fake nose. "Now move. And for god’s sake, don’t let any of them hug you. That costume smells like regret and old hot dogs." Bill waddled toward the entrance, his oversized shoes squeaking with every step. Molly waited until he was inside before scaling the fence with the grace of a woman who had definitely done this before. The prize table was a treasure trove of parental guilt—glittery unicorn backpacks, remote-control cars, a suspiciously expensive-looking dollhouse that probably cost more than her rent. She was halfway through stuffing a shopping bag with loot when a small voice piped up behind her. "Are you the tooth fairy?" Molly froze. Slowly, she turned to see a tiny girl in a princess dress, her eyes wide with wonder. Molly’s brain short-circuited. "Uh. Yeah. *Totally.*" The girl gasped. "I lost a tooth *yesterday*!" "Great! Uh. Congrats." Molly frantically scanned the table. "Here, take this." She shoved a glow-in-the-dark fidget spinner into the girl’s hands. The kid’s face fell. "I wanted *money*." Molly’s walkie-talkie crackled. Bill’s voice, strained: "*Molly, I just set a kid’s hair on fire.*" She grabbed the girl’s shoulders. "Kid, run. *Run and never look back.*" Then she yanked the fire alarm.
created_at
"2026-02-20T14:26:46.936Z"
2/20/2026, 2:26:46 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
267
moderation_latency
(null)
generation_time
9912
tokens_prompt
77
tokens_completion
806
native_tokens_prompt
69
native_tokens_completion
760
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0015476
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"dbf154ed25e24d09b0161d7f4f4e932e"
is_byok
false
latency
267
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597606-u2AVmwUtMOL2Oex9bpJM"
upstream_id
"dbf154ed25e24d09b0161d7f4f4e932e"
total_cost
0.0015476
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
514
47.62%
Dialogue to Total Word Ratio
Ratio: 33.85%, Deviation: 3.85%
neededClean
false
wordsTotal
514
wordsDialogue
174
57.8604%