NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPFG412FK68R7P9NMFZYVT
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
13.6s
Cost:
$0.001007
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
53.18%
Dialogue to Total Word Ratio
(details)
26.5877%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the glittery unicorn horn strapped to her head and squinted at the inflatable bounce castle swaying in the backyard. "You realize we look like we’re here to perform a magic show, not rob the place." Bill, dressed in a too-tight rainbow clown wig and a shirt that read *World’s Okayest Uncle*, scratched his nose. "First of all, we *are* here to perform a magic show. That’s our cover. Second, if you think this wig is bad, you should see the ones I didn’t pick." Molly groaned. "I can’t believe I let you talk me into wearing a tutu. I look like a rejected backup dancer from a *My Little Pony* concert." "Hey, the tutu’s got pockets. That’s the mark of a professional." Bill patted his own sequined fanny pack. "Besides, if we’re gonna blend in with a bunch of five-year-olds, we’ve gotta commit. Half-measures get you caught." Molly eyed the party from behind a suspiciously large balloon arch. "Alright, fine. But if I get glitter in my hair one more time, I’m taking it out of your cut." Bill grinned. "Deal. Now, here’s the plan: I distract the parents with my *legendary* balloon animal skills—" "You once made a balloon dog that looked like a sad potato." "—and while they’re all laughing at my *artistry*, you swoop in, grab the presents, and stash them in the bounce castle. Easy." Molly raised an eyebrow. "The bounce castle? That’s your big idea?" "Think about it. It’s the one place no one’s gonna look. Kids are bouncing on it, parents are too busy filming their kids bouncing on it—it’s the perfect hiding spot." Molly sighed. "And how do we get the presents out of the bounce castle without someone noticing?" Bill waved a hand. "We’ll figure it out. Maybe we’ll say we’re doing a ‘present piñata’ or something. Parents love that stuff." Molly pinched the bridge of her nose. "We are *terrible* at this." "Nah, we’re *creative*. There’s a difference." Bill adjusted his wig, which had slipped sideways. "Now, let’s go before the birthday kid blows out the candles and we miss our window." Molly took a deep breath. "Alright. But if this goes south, I’m telling everyone you made me wear the tutu." Bill clapped her on the back. "That’s the spirit. Now, let’s go steal some My Little Ponies and LEGO sets like the professionals we are." Molly muttered under her breath as they waded into the sea of screaming children. "I can’t believe my life has come to this." Bill, already twisting a balloon into something vaguely resembling a giraffe, shot her a grin. "Admit it. You’re having fun." Molly glared. "I will *end* you."
created_at
"2026-02-20T14:15:23.021Z"
2/20/2026, 2:15:23 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
337
moderation_latency
(null)
generation_time
13557
tokens_prompt
77
tokens_completion
694
native_tokens_prompt
69
native_tokens_completion
648
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0010065
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"50ec38386cab46d4bfb2d9b3c2ff773d"
is_byok
false
latency
337
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771596923-SNiehDLTbsnUM6ikJ3Nv"
upstream_id
"50ec38386cab46d4bfb2d9b3c2ff773d"
total_cost
0.0010065
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
453
53.18%
Dialogue to Total Word Ratio
Ratio: 66.45%, Deviation: 3.55%
neededClean
false
wordsTotal
453
wordsDialogue
301
26.5877%