NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQGXS4ME4RWCZS1W55MJH1
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.8s
Cost:
$0.001173
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
25.9628%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of the Tipsy Turtle Pub smelled like stale beer and regret, which was fitting, since that’s exactly what Molly and Bill were serving up tonight. Molly leaned against the sticky table, her fingers drumming a nervous rhythm on the chipped wood. Across from her, Bill slouched in his chair, squinting at a crumpled flyer like it held the secrets of the universe. “Okay, let’s go over it again,” Molly said, tapping the flyer. “‘Princess Penelope’s Magical Tea Party.’ Six o’clock. Backyard of the McAllister mansion. RSVP required.” Bill groaned, rubbing his temples. “I still don’t see why we can’t just rob a bank like normal people.” “Because banks have vaults, Bill. And guards. And those little ink things that explode all over you.” “And children’s parties don’t?” Molly sighed. “Look, the McAllisters are loaded. Their kid’s birthday presents are probably solid gold sippy cups. And the security? A couple of nannies scrolling through TikTok and a golden retriever named Mr. Waffles.” Bill perked up. “Mr. Waffles? That’s the best name I’ve ever heard.” “Focus, Bill. We’re thieves, not dog enthusiasts.” “Fine, fine.” He scratched his chin. “So, what’s the play? We just waltz in, grab the loot, and waltz out?” Molly grinned. “Not exactly. We’re going in as entertainers.” Bill blinked. “Entertainers.” “Yes. Specifically, as *magicians*.” “Molly, I can’t even shuffle a deck of cards without dropping half of them.” “Which is why I’ll be the magician, and you’ll be my *lovely assistant*.” Bill’s face went pale. “Oh, hell no. I am *not* wearing tights.” “It’s a children’s party, Bill. You’ll be wearing a top hat and a cape. Very dignified.” “Dignified my ass. What if someone recognizes me?” Molly waved a hand. “Please. You’re a white guy with a mustache. You’re basically invisible.” Bill touched his upper lip defensively. “This mustache is a work of art.” “It’s a crime against facial hair, but that’s beside the point. We get in, we distract the kids with some *amazing* magic tricks—” “Which you can’t actually do.” “—and while they’re all gasping at the floating bunny, we scoop up the presents and slip out the back.” Bill frowned. “What floating bunny?” Molly’s smile faltered. “I was hoping you’d have one in your pocket.” “Molly.” “Fine, fine. We’ll improvise. The point is, we get in, we get out, and we never have to see another glitter-covered unicorn again.” Bill stared at the flyer for a long moment. Then, with a sigh, he reached into his jacket and pulled out a pair of novelty glasses with a fake nose and mustache attached. “Alright. But if I get arrested, I’m telling them you made me do it.” Molly clapped her hands. “That’s the spirit! Now, let’s go over the tricks. First, the classic *disappearing coin*—” “Molly.” “—which, full disclosure, I’ve never actually pulled off.” Bill groaned and buried his face in his hands. “We’re doomed.”
created_at
"2026-02-20T14:33:38.351Z"
2/20/2026, 2:33:38 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
728
moderation_latency
(null)
generation_time
15707
tokens_prompt
77
tokens_completion
853
native_tokens_prompt
69
native_tokens_completion
759
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001173
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"5044aac4210e4e7b8abb77a2454c5988"
is_byok
false
latency
728
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771598018-ivTy6oDKoFvnlPMm9wf8"
upstream_id
"5044aac4210e4e7b8abb77a2454c5988"
total_cost
0.001173
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
484
0.00%
Dialogue to Total Word Ratio
Ratio: 68.39%, Deviation: 38.39%
neededClean
false
wordsTotal
484
wordsDialogue
331
25.9628%