NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM09FVJBM9RHGQW1ZZ12BJ4E
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
7.3s
Cost:
$0.000468
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
9.61%
Matches word count
(details)
0.38%
Dialogue to Total Word Ratio
(details)
4.9923%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain tapped against the window of the stolen ice cream truck like a nervous metronome. Molly, wearing a stolen tuxedo two sizes too small, adjusted the bow tie with a grimace. "You *promised* this thing would fit." Bill, slouched in the passenger seat with his feet propped on the dashboard, didn’t look up from his comic book. "Relax, it’s a costume party. No one’s gonna notice if you look like a 12-year-old who got drunk on soda." Molly glared but yanked the ill-fitting jacket tighter. "We’re not here to blend in, we’re here to *steal*. And the birthday kid’s a hyperactive nightmare in a dinosaur costume. How do we even know the presents are real?" Bill finally lowered his comic book. "Because his dad’s *built* like a linebacker with a coffee addiction. Either the kid’s loaded and his dad bought him real stuff, or this is a drug front. Either way, we take the haul and vanish before the piñata—that’s the only rule." Molly exhaled through her nose. "Fine. But if we get caught, I’m throwing you under the bus first." Bill grinned. "Aw, you say that like you don’t trust me. I’ve got us covered. Look." He tapped the truck’s radio, which now played a tinny, off-key rendition of *Happy Birthday*. "I hacked the local radio station’s automation software. Exact word-for-word sync with the party schedule. We’ve got thirty seconds before the cake comes out, so we move fast." Molly side-eyed him. "You *hacked a radio station* to steal children’s presents?" "Only took me six hours and a dollar fifty in dimes," Bill said proudly. "Efficiency, Molly. That’s the golden rule." Molly grabbed the duffel bag at her feet, stuffed with pillowcases and a novelty whoopee cushion she’d been *very* tempted to leave behind. "And what’s the backup plan if we get caught?" Bill tapped the fake mustache on his face. "Distraction. You start crying about ‘a puppy in the parking lot,’ and I’ll play the concerned parent. They’ll never suspect a thing." Molly blinked. "You *want* me to fake cry?" "Desperate times, desperate measures. Kids cry at parties all the time. It’s practically expected." Molly groaned, rubbing her temples. "I hate you." Bill patted her shoulder. "No, you don’t. You love me. And in ten minutes, we’ll be rolling in unopened Legos, the latest PlayStation, and enough candy to give a dentist nightmares." Molly hesitated, then sighed. "Fine. But if we get arrested, I’m blaming you for the mustache disaster." Bill gasped in mock offense, adjusting the thin strip of face fuzz. "This mustache is *iconic*." Outside, laughter erupted from the party—high-pitched, chaotic, the sounds of sugar-fueled chaos. Molly took a deep breath. "Let’s go before I lose my nerve." Bill clapped his hands. "That’s the spirit! Remember, silence is golden, agility is silver, and *never* eat the cake. Sugar’s the enemy of precision." Molly stepped into the rain, her polished shoes squelching in a puddle. "You are *insane*." "And yet, you keep coming back," Bill said, falling into step beside her. "That’s love, Moll." She didn’t answer. But her grip on the duffel bag tightened.
created_at
"2026-03-18T10:57:30.203Z"
3/18/2026, 10:57:30 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
371
moderation_latency
(null)
generation_time
7195
tokens_prompt
77
tokens_completion
834
native_tokens_prompt
81
native_tokens_completion
774
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00046791
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"0bd8fc3fa8ac4f3c9b5e02b055552f77"
is_byok
false
latency
371
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831450-pyBC8Migf9FSFUOvMaEi"
upstream_id
"0bd8fc3fa8ac4f3c9b5e02b055552f77"
total_cost
0.00046791
cache_discount
0.00000864
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
9.61%
Matches word count
n/a
neededClean
false
words
522
0.38%
Dialogue to Total Word Ratio
Ratio: 60.57%, Deviation: 10.57%
neededClean
false
wordsTotal
530
wordsDialogue
321
4.9923%