NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07TBNJCH46HRN0VGXCF741
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7.0s
Cost:
$0.000365
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0007%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly zipped her leather jacket up to her chin and adjusted her ski mask in the rearview mirror of the stolen ice cream truck. Bill was already inside, tapping his fingers impatiently against the dashboard. "You’re late," Bill said. "I thought you got caught in traffic or something." Molly slammed the door. "Traffic? In this neighborhood? You try driving a vehicle made entirely of frozen dairy products through a tunnel, then we’ll talk." She pulled a handful of multicolored beads from her pocket and shook them like dice. "Besides, I brought entertainment." Bill squinted at the bead necklace. "That’s a Mardi Gras accessory, not a getaway plan." "Details." Molly threw the necklace around her neck. "It’s camouflage. Kids see colors, their brains short-circuit. They won’t even look at our faces." Bill rubbed his temples. "We’re breaking into a child’s birthday party, not infiltrating a clown convention." "Semantics." Molly popped open the back hatch. Inside lay a duffel bag, two pairs of elbow-length gardening gloves, and a roll of duct tape labeled “For Emergencies (this is an emergency).” "I did recon. Front door’s got a teddy bear balloon arch—classic alarm trigger. Side gate’s got a flimsy padlock. Side gate it is." Bill lifted the roll of duct tape. "Last time we used duct tape, we got caught in a ball pit. Remember the time-out corner?" Molly snatched the tape back. "This isn’t last time. This is next time, smarter. I brought industrial-grade adhesive—sticks to fur, plastic, parental panic." She pulled out a small teddy bear from her bag and squeezed its belly. A tiny alarm chirped. Bill groaned. "You brought a booby-trapped teddy bear to a birthday party heist? Who does that?" Molly stuffed the bear back into her bag. "It was free at the thrift store. Also, it proves my point. The house is rigged for kids, which means the adults are half-asleep. Perfect target." She tossed Bill a stuffed unicorn wearing sunglasses. "Your costume." Bill held the unicorn at arm’s length like it might bite. "I’m not wearing this." "You’ll wear it or you’ll go as the Grinch yourself. Your choice." Molly yanked the unicorn sunglasses from Bill’s hand and plopped them on his head. "Now you look like a disgruntled mascot who got lost on the way to a rave." Bill sighed, looked at himself in the side mirror, and said, "I look like a nightmare Disney rejects hired." Molly zipped the duffel bag shut. "Exactly. Let’s move."
created_at
"2026-03-18T10:28:17.216Z"
3/18/2026, 10:28:17 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1842
moderation_latency
(null)
generation_time
6863
tokens_prompt
77
tokens_completion
650
native_tokens_prompt
81
native_tokens_completion
592
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
16
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00036519
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"b534036e943144e3ac250672adfe761f"
is_byok
false
latency
1842
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829697-fHsKf4WeOz6bcYmTjeyq"
upstream_id
"b534036e943144e3ac250672adfe761f"
total_cost
0.00036519
cache_discount
0.00000216
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
412
0.00%
Dialogue to Total Word Ratio
Ratio: 55.02%, Deviation: 14.98%
neededClean
false
wordsTotal
418
wordsDialogue
230
0.0007%