NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0AN5M5SJF6VBEQ2S0YBRJV
Run Details
Model:
Mistral Small 4 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
14.6s
Cost:
$0.001410
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.71%
Dialogue to Total Word Ratio
(details)
0.3565%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his fake mustache in the reflection of a parked car window. "You sure about this, Molly? The last time we pulled a job dressed as clowns, you tripped over your own feet." Molly rolled her eyes, adjusting her curly wig. "That was one time, and the cake was a total loss. Today, we're in and out before the piñata even hits the floor." Bill scratched his chin. "And the presents? They're all going to be in one big pile, right? Easy grab?" "Not necessarily," Molly said, pulling out a floor plan of the venue. "Apparently, they're doing this 'piñata-free' thing for some reason. All the presents are lined up on tables along the wall. No central pile." Bill groaned: "So we have to walk among a room full of sugar-crazed children just to nick loot from tables?" Molly grinned: "Exactly. And here's the best part—we go in as the entertainment." Bill blinked. "We?" "You and I are going to be the magicians. Remember when you pulled that rabbit out of your hat last Thanksgiving?" Bill's face fell. "That rabbit was a stress ball I found under the couch." Molly patted his shoulder. "Details, details. The kids won't know the difference. Just wave your wand, say some nonsense about 'abracadabra,' and while they're distracted, I'll scoop up the presents." Bill sighed: "So I'm the fall guy, and you're the brains. Sounds about right." Molly winked: "And after this, we split the haul. No splitting headaches this time." Bill adjusted his wand—a glorified stick with a star on the end. "One job. In. Out. No drama." "Exactly," Molly said, checking her watch. "Time to dazzle some kids."
created_at
"2026-03-18T11:17:52.916Z"
3/18/2026, 11:17:52 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
409
moderation_latency
(null)
generation_time
14485
tokens_prompt
77
tokens_completion
2456
native_tokens_prompt
81
native_tokens_completion
2330
native_tokens_completion_images
(null)
native_tokens_reasoning
2031
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00141015
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"d82d3b03f822431f92e99f7a3d34c983"
is_byok
false
latency
401
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773832672-zkv3INh77HZX2MK1bdXN"
upstream_id
"d82d3b03f822431f92e99f7a3d34c983"
total_cost
0.00141015
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
279
0.71%
Dialogue to Total Word Ratio
Ratio: 75.09%, Deviation: 14.91%
neededClean
false
wordsTotal
281
wordsDialogue
211
0.3565%