NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPH9F8NQ7XVGXPXCPD5E4S
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
17.2s
Cost:
$0.001267
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0049%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the glittery unicorn horn strapped to her forehead and squinted at the inflatable bounce house sagging under the weight of a dozen sugar-crazed five-year-olds. "You’re sure this is the target? Because I’m pretty sure we just robbed a daycare by accident." Bill, resplendent in a dinosaur onesie that made him look like a rejected *Jurassic Park* extra, scratched his chin. "Positive. Intel says the birthday kid’s dad is some tech bro who ‘disrupted’ the juice box industry. That’s where the real money is—organic, locally sourced, gluten-free loot." Molly groaned. "We’re stealing from a child’s birthday party to fund our *artisanal* crime spree. I feel like we’ve hit rock bottom." "Rock bottom has a minibar, Molly. And a bounce house." Bill nudged her toward the gift table, which was groaning under a mountain of beribboned boxes. "Look at this haul. That’s not wrapping paper—that’s *investment capital*." Molly plucked a gift bag from the pile and peeked inside. "Oh, for—this is a *subscription box* for ‘Mindful Mini-Moguls.’ It’s a coloring book and a coupon for a $200 ‘entrepreneurial mindset’ seminar." Bill’s eyes gleamed. "Even better. We’ll fence the seminar tickets to desperate parents. Boom. Secondary market." Molly tossed the bag back onto the table. "We’re monsters." "Monsters with *vision*," Bill corrected. "Now, here’s the plan: I distract the parents with my *legendary* balloon animal skills—" "You once made a balloon dog that looked like a sad sausage." "—and you swoop in, grab the gifts, and we meet at the van. Easy. No one suspects the guy in the T-Rex costume." Molly eyed the harried-looking moms sipping rosé from plastic cups. "What if they *do* suspect us?" Bill waved a dismissive hand. "Then we lean into the bit. ‘Oh, we’re just the *entertainment*—part of the *experience*.’ Works every time." Molly sighed. "Fine. But if I get arrested, I’m telling the cops you made me wear this unicorn horn." Bill grinned. "Deal. Now go—before the piñata gets hit. Last thing we need is a bunch of kids crying over spilled candy *and* stolen presents." Molly crept toward the gift table, dodging a rogue toddler wielding a foam sword. She was halfway through stuffing a particularly large box into her oversized tote when a tiny voice piped up behind her. "Hey. That’s *my* present." Molly froze. A small boy in a superhero cape stood there, arms crossed, looking suspiciously like a tiny, vengeful accountant. Bill, mid-balloon giraffe, glanced over and hissed, "Abort! Abort!" Molly pasted on a smile. "Oh, sweetie, this isn’t a present—it’s a *decoy*! The *real* present is… uh…" She scanned the table. "That one! The *big* one!" The kid’s eyes narrowed. "That’s just a box of diapers." Bill’s balloon giraffe deflated with a sad *pfffft*. Molly grabbed his arm. "Run." They bolted for the exit, dodging a rogue game of musical chairs and a very confused clown. As they tumbled into the getaway van, Bill gasped, "We’re never doing this again." Molly peeled off the unicorn horn and tossed it into the backseat. "Next time, we rob a bank. At least the tellers don’t judge you for stealing."
created_at
"2026-02-20T14:16:21.749Z"
2/20/2026, 2:16:21 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
396
moderation_latency
(null)
generation_time
17090
tokens_prompt
77
tokens_completion
858
native_tokens_prompt
69
native_tokens_completion
822
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0012675
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"e9e758219eb042db987cd84a01d2e179"
is_byok
false
latency
396
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771596981-H6nWds9nGbDgcOmJjE50"
upstream_id
"e9e758219eb042db987cd84a01d2e179"
total_cost
0.0012675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
531
0.00%
Dialogue to Total Word Ratio
Ratio: 51.22%, Deviation: 18.78%
neededClean
false
wordsTotal
531
wordsDialogue
272
0.0049%