NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPMKJKDG5CCXX2V31M15AJ
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
31.5s
Cost:
$0.001666
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.03%
Dialogue to Total Word Ratio
(details)
49.9730%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the *Dazzle & Delight Party Emporium* sign flickered like a dying firefly as Molly adjusted her oversized clown nose in the rearview mirror. Beside her, Bill fiddled with a party popper, accidentally setting it off with a *BANG!* that sent glitter raining down onto the dashboard. “You’re a menace,” Molly said, brushing sparkles off her black turtleneck. “We’re supposed to be *inconspicuous*, not look like we just robbed a unicorn’s birthday.” Bill grinned, his teeth smudged with what might’ve been blue frosting—or possibly evidence. “Relax. Kids love glitter. We’ll blend right in.” “Kids love *everything*. That’s why we’re here.” Molly tapped the blueprint of the party venue, a sprawling mansion rented out for little Timothy Whitmore’s *Pirate Extravaganza*. “Security’s a joke. One rent-a-cop, a nanny who’s definitely napping in the pantry, and a dad who’s three mimosas deep by now.” “What’s the play?” Bill cracked his knuckles, then winced—he’d superglued a fake ruby ring to his finger earlier for “practice.” “We go in as entertainers. You’re *Bubbles the Clown*—” “Finally, my big break.” “—and I’m *Magic Molly*, the world’s most unconvincing magician. We ‘perform,’ distract the adults, and while I’m pulling a rabbit out of my hat, you’re pulling iPads out of the gift table.” Bill squinted at the invite. “Wait, the kid’s *nine*. Who gives a nine-year-old an iPad?” “Rich people.” Molly tossed him a bag of balloons. “Inflate these. And for God’s sake, don’t make any *actual* animal shapes. You did that last time, and now there’s a giraffe floating over I-95 with your fingerprints on it.” Bill blew into a balloon, his cheeks puffing like a chipmunk’s. “I *told* you, that was an accident. Also, giraffes *can’t* fly. That’s slander.” Molly ignored him, adjusting the fake mustache she’d glued to her upper lip. “Timing’s key. The piñata drops at 3:15. That’s our cue. Chaos, screaming, free candy—perfect cover.” “What if the piñata’s a dud? Like, what if it’s just, I dunno, *one* Snickers and a bunch of coupons?” “Then you *make* chaos. Trip a kid. Start a rumor about cooties. I don’t care. Just get those presents into the van.” Bill nodded, then paused. “Hey, what’s our exit strategy?” “We walk out the front door.” “With a van full of stolen goods?” “While wearing clown makeup and carrying a piñata stick. Who’s gonna stop us?” Molly smirked. “Worst case, we tell ‘em we’re taking the leftovers to an *orphanage*.” Bill’s eyes widened. “That’s evil.” “That’s *efficient*.” She checked her watch. “Showtime. Remember—no talking to the kids. You start telling knock-knock jokes, we’re both going to jail.” “Too late.” Bill grinned as he hopped out of the car, a balloon sword in hand. “Hey, Molly, why did the thief bring a ladder to the party?” Molly groaned. “Why?” “Because he heard the *gifts were out of this world*!” She considered shoving him into traffic. Instead, she adjusted her top hat and sighed. “Let’s go rob some rich kids.”
created_at
"2026-02-20T14:18:10.399Z"
2/20/2026, 2:18:10 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5090
moderation_latency
(null)
generation_time
31451
tokens_prompt
77
tokens_completion
871
native_tokens_prompt
69
native_tokens_completion
819
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0016656
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"c46f0bfad1f9480b990c40242326824a"
is_byok
false
latency
5090
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597090-cuxBg767PAUaWGbzjsbY"
upstream_id
"c46f0bfad1f9480b990c40242326824a"
total_cost
0.0016656
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
0.03%
Dialogue to Total Word Ratio
Ratio: 62.82%, Deviation: 12.82%
neededClean
false
wordsTotal
503
wordsDialogue
316
49.9730%