NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPZHMA4M22FHSSPBT89EEB
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
17.4s
Cost:
$0.001576
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
75.16%
Matches word count
(details)
95.18%
Dialogue to Total Word Ratio
(details)
85.1670%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dim glow of a flickering streetlamp cast long shadows across the alley where Molly and Bill huddled over a crumpled party invitation. Molly adjusted her oversized Santa hat, the pom-pom bobbing like a metronome counting down to disaster. “Alright, let’s run this one more time,” Molly said, tapping the invitation. “We’re crashing the Johnson twins’ eighth birthday party, dressed as—” “A clown and a magician,” Bill interrupted, tugging at the neon-green wig perched unevenly on his head. “Because nothing says ‘trust me’ like a guy who can pull rabbits out of his sleeves and another guy who looks like he just escaped from a haunted carnival.” Molly ignored him. “We blend in, we charm the parents, we ‘accidentally’ knock over the present table, and while everyone’s distracted by the chaos—” “—we grab the loot and bolt,” Bill finished, snapping his fingers. “Classic. Elegant. Almost *too* simple.” “Almost?” Molly raised an eyebrow. “You got a better idea, or are you just nervous because the last time you wore face paint, you cried when the mirror didn’t cry back?” Bill scowled. “I was *three*. And for the record, I’m not nervous. I’m just saying, have you *seen* the security at these suburban shindigs? Moms with eagle eyes, dads who think they’re in a Jason Statham movie, and at least one kid who’s definitely a narc.” Molly waved a hand. “Please. The only security is a guy in a rented Elmo costume who’s high on candy and existential dread. We’ll be fine.” Bill adjusted his fake mustache, which was already peeling at the edges. “Fine. But if we get caught, I’m telling the cops this was all your idea. And that you have a *very* suspicious interest in piñatas.” Molly grinned. “Noted. Now, remember—you’re the magician, so you’ve got to sell the act. Wave your hands, mutter some nonsense, maybe set something on fire if things get dicey.” “Set something on—? Molly, we’re in a *house*.” “Relax, it’ll be fine. Just a *small* fire. For ambiance.” Bill groaned. “You know what? Forget the presents. I’m stealing the cake. I deserve it after this.” Molly clapped him on the back. “That’s the spirit! Now, let’s go make some kids’ birthdays *unforgettable*.” Bill took a deep breath, straightened his bowtie, and muttered, “I hate my life.” Molly linked her arm through his. “No, you don’t. You love the thrill. The danger. The sheer audacity of robbing eight-year-olds blind.” “You’re right,” Bill sighed. “I *do* love the audacity. But if I have to sing ‘Happy Birthday’ one more time, I *will* burn this place down.” Molly beamed. “Now *that’s* teamwork.” She adjusted her fake nose and nodded toward the house, where the distant sound of children shrieking over sugar and games filled the air. “Showtime, partner.” Bill took one last look at the invitation, then at Molly. “You’re sure the presents are worth it?” Molly smirked. “Oh, absolutely. Rumor has it one of the gifts is a *limited edition* glow-in-the-dark fart machine.” Bill’s eyes lit up. “We’re *definitely* doing this.”
created_at
"2026-02-20T14:24:08.852Z"
2/20/2026, 2:24:08 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
227
moderation_latency
(null)
generation_time
17306
tokens_prompt
77
tokens_completion
855
native_tokens_prompt
69
native_tokens_completion
774
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0015756
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"276d466fcc2648448c7ef61e2e7f1876"
is_byok
false
latency
227
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597448-vF5Pl2JDIDu9FVHS5kM2"
upstream_id
"276d466fcc2648448c7ef61e2e7f1876"
total_cost
0.0015756
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
75.16%
Matches word count
n/a
neededClean
false
words
513
95.18%
Dialogue to Total Word Ratio
Ratio: 69.01%, Deviation: 0.99%
neededClean
false
wordsTotal
513
wordsDialogue
354
85.1670%