NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07GDGA5NM4PRXYG7Q2WM1Z
Run Details
Model:
Mistral Small 4 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
9.1s
Cost:
$0.000640
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.02%
Dialogue to Total Word Ratio
(details)
0.0111%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the windshield of the van, casting wobbly reflections of Molly’s determined face on the glass. She jabbed a finger at the blueprint spread across the dashboard, her hwisdom tooth glinting under the flickering dome light. "The cake table’s right here. That’s where the presents’ll be. Piled up like a gold rush waiting to happen." Bill scratched his stubble, a crumb of stale donut clinging to his lip. He squinted at the diagram like it was a cryptic treasure map. "So we go in through the deliveries entrance. Got it." He tapped the page where a small red arrow marked a door. "But how do we not look suspicious carrying out armfuls of Barbies and remote-control cars?" Molly flicked the donut crumb off his shirt with a finger. "Disguises, Bill. We dress up like entertainers. You know, balloons, face paint, one of those stupid clown wigs—" "Wait, base clown wig?" Bill interrupted. "Because I draw the line at full Sideshow Bob gig. That’s a one-way ticket to prison yard infamy." She rolled her eyes so hard it looked painful. "Not base clown. Magician. With a top hat and everything. The kids’ll think we’re part of the show." Molly leaned back, grinning. "And when they’re distracted by your fake disappearing act, I scoop up the good stuff. Easy." Bill rubbed his chin. "What if they don’t fall for the trick? What if they see through it and start crying?" Molly waved a hand. "Kids cry over spilled juice, Bill. We’re in and out before anyone knows what hit ‘em." She reached under her seat and pulled out a jester’s hat. "Here. You can practice in this. Makes you look less like a felon and more like... a public menace." Bill plopped the hat on his head, the bells jingling sadly. "I look like a rejected Renaissance fair reject." Molly smirked. "Perfect. Now we just need to figure out the getaway." She pulled out a slightly crumpled map. "The alley out back has a dumpster big enough to hide you if things go south." Bill’s face fell. "Hide me? You mean you're not going to hide with me?" She tossed a glow stick at his chest. "Nope. I’m the distraction. You’re the muscle. If anything goes wrong, you chuck the toys into the van while I hold them off with the dramatic monologue." Bill sighed, the bells on the hat flopping as he nodded. "Fine. But when the cops show up and it’s all ‘Why yes officer, we were hired clown assassins for the NSA,’ I’m blaming you." Molly clapped her hands. "Deal. Now help me break into that supply closet. We need more balloons for our alibis."
created_at
"2026-03-18T10:22:51.413Z"
3/18/2026, 10:22:51 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
377
moderation_latency
(null)
generation_time
9003
tokens_prompt
77
tokens_completion
1135
native_tokens_prompt
81
native_tokens_completion
1047
native_tokens_completion_images
(null)
native_tokens_reasoning
469
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00064035
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"f0044a2f090d4529804f2eaadbd851ae"
is_byok
false
latency
373
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829371-VnL0VTAFvzQ9rnzfqPn6"
upstream_id
"f0044a2f090d4529804f2eaadbd851ae"
total_cost
0.00064035
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
449
0.02%
Dialogue to Total Word Ratio
Ratio: 62.97%, Deviation: 12.97%
neededClean
false
wordsTotal
451
wordsDialogue
284
0.0111%