NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQCPQ9G2PE1E6KE8GZE9EH
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.3s
Cost:
$0.001158
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
48.8138%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The fluorescent lights of the diner buzzed like a swarm of angry bees, casting a sickly glow over the chipped Formica table where Molly and Bill sat. A half-eaten plate of fries congealed between them, forgotten in favor of the blueprint spread across the table. Molly tapped a manicured nail against the paper, her lips pursed in concentration. "You’re telling me we’re hitting a *kids’ party*?" Bill said, scratching his stubble. "I’ve robbed banks, I’ve lifted jewels, but this feels… wrong." Molly rolled her eyes. "Oh, please. You once stole a priest’s collection plate mid-sermon. This is practically charity work." "Yeah, but the priest *expected* it. These are *children*, Mol. They cry when you take their stuff." She leaned forward, her voice dropping to a conspiratorial whisper. "Exactly. Who’s gonna suspect a couple of thieves at a six-year-old’s birthday bash? The cops? They’ll be too busy laughing at the idea to actually show up." Bill groaned, rubbing his temples. "I can already see the headlines: *‘Notorious Thieves Nabbed After Heist Gone Wrong—Culprits Found Sobbing in Ball Pit.’*" Molly snatched a fry and pointed it at him. "Focus. The target is the *presents*. Not the kids. We grab the loot, we bounce, no one gets hurt." "Except the birthday girl. She’s gonna be *devastated*." "She’ll get over it. Kids are resilient. Besides, her parents will just buy her new stuff. It’s not like we’re taking her *pony*." Bill sighed, slumping back in the booth. "Fine. But if I get glitter on me, I’m holding you personally responsible." Molly grinned, snapping the blueprint shut. "Deal. Now, here’s the plan—" "Wait, wait, wait." Bill held up a hand. "We *have* a plan?" "Of course we have a plan. We’re professionals." "Since when?" "Since *always*." She cleared her throat. "Step one: We show up early, scope the place. Step two: We distract the parents with free cake samples—" "Free cake samples?" "Yeah, I swiped a tray from the bakery down the street. Step three: While they’re busy stuffing their faces, we load the presents into the van. Step four: We’re out before the piñata even gets cracked." Bill stared at her. "That’s it? That’s the whole plan?" Molly shrugged. "Sometimes the simplest plans are the best." "Or the dumbest." "Look, if you’ve got a better idea, I’m all ears." Bill opened his mouth, then closed it. "Nope. No better ideas." "Great. Then it’s settled." Molly slid out of the booth, tossing a twenty onto the table. "We hit the party at three. Dress casual." Bill groaned. "I can’t believe I’m doing this." Molly patted his shoulder. "That’s the spirit. And hey—if we pull this off, we’ll be legends. The Bonnie and Clyde of birthday heists." Bill muttered something under his breath that sounded suspiciously like, "More like the Abbott and Costello of birthday heists." Molly ignored him, already striding toward the door. "Come on, Bill. We’ve got a party to crash."
created_at
"2026-02-20T14:31:20.049Z"
2/20/2026, 2:31:20 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
411
moderation_latency
(null)
generation_time
15079
tokens_prompt
77
tokens_completion
802
native_tokens_prompt
69
native_tokens_completion
749
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001158
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"30b1eceadfec4eeba5866454031834d7"
is_byok
false
latency
411
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597880-KLnlnCOr4H4KcocBiqZV"
upstream_id
"30b1eceadfec4eeba5866454031834d7"
total_cost
0.001158
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
493
0.00%
Dialogue to Total Word Ratio
Ratio: 66.94%, Deviation: 36.94%
neededClean
false
wordsTotal
493
wordsDialogue
330
48.8138%