NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQ83VM5448CN4Q0PTF28BT
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
16.3s
Cost:
$0.001197
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
48.8170%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of the Tipsy Turtle Pub smelled like stale beer and regret, which was fitting, since that’s exactly what Molly and Bill were serving up tonight. Molly leaned against the sticky table, twirling a lock of her fire-engine-red hair around one finger while Bill squinted at a crumpled flyer like it held the secrets of the universe. “A *children’s party*?” Bill said, his voice cracking like a teenager’s. “Molls, we’re thieves, not monsters.” Molly snatched the flyer from his hands and smoothed it out. “Oh, relax. It’s not like we’re stealing from orphans. This is *Little Timmy Whitmore’s* sixth birthday. His dad’s a hedge fund guy. Trust me, the kid won’t even notice the presents are gone.” Bill rubbed his temples. “I don’t know. Kids *notice* things. They’ve got, like, *spidey-senses* or something.” “Spidey-senses?” Molly snorted. “Bill, we’re not robbing a bank. We’re swiping a bunch of overpriced plastic crap wrapped in shiny paper. The hardest part will be not laughing when some toddler starts crying because we took his *Frozen* Elsa doll.” Bill groaned. “I can already see the headlines: *‘Notorious Thieves Brought to Justice by a Five-Year-Old with a Whistle.’*” “Please. We’ll be in and out before the piñata even gets cracked. Besides, I’ve got a plan.” Molly leaned in, her grin sharp enough to cut glass. “We go in as *party entertainers*.” Bill blinked. “You mean like… clowns?” “No, you idiot. *Magicians.*” She pulled a deck of cards from her pocket and fanned them out with a flourish. “I’ve been practicing.” Bill stared at the cards like they might bite him. “You can’t even shuffle without dropping half the deck.” “Details.” Molly waved a hand. “The point is, we show up, do a few tricks, distract the parents, and while everyone’s watching the kid blow out the candles, we scoop up the loot. Easy.” Bill scratched his chin. “And what if someone *does* notice the presents are gone?” “Then we say it was part of the act. *‘Ta-da! The gifts have vanished!’*” She mimed pulling a rabbit out of a hat. “Parents eat that stuff up. They’ll think it’s *hilarious.*” Bill sighed, long and dramatic. “I don’t know, Molls. This feels… wrong.” Molly rolled her eyes. “Oh, come on. When have we ever let *morality* get in the way of a good score?” “That time with the nun’s retirement fund?” “Okay, *fine*, bad example. But this is different! We’re not stealing from *nuns.* We’re stealing from *rich people.* It’s practically a public service.” Bill pinched the bridge of his nose. “You’re impossible.” “And yet, you keep coming back.” Molly grinned and slapped the table. “So, are we doing this or not?” Bill hesitated, then sighed. “Fine. But if we get arrested, I’m telling the cops you made me do it.” Molly clapped her hands. “That’s the spirit! Now, first order of business—we need *costumes.*” She pulled out her phone and started scrolling. “I’m thinking top hats, capes, maybe some sequins…” Bill groaned. “I’d rather go to prison.”
created_at
"2026-02-20T14:28:49.664Z"
2/20/2026, 2:28:49 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
392
moderation_latency
(null)
generation_time
16219
tokens_prompt
77
tokens_completion
853
native_tokens_prompt
69
native_tokens_completion
775
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001197
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"5fdd2fa7993c45cc83a071e73f0774e5"
is_byok
false
latency
392
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597729-72RU9LrkKo9eN5MKNbu5"
upstream_id
"5fdd2fa7993c45cc83a071e73f0774e5"
total_cost
0.001197
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
507
0.01%
Dialogue to Total Word Ratio
Ratio: 63.91%, Deviation: 13.91%
neededClean
false
wordsTotal
507
wordsDialogue
324
48.8170%