NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM08J9Q5NNEQTPEKCKVAHA6X
Run Details
Model:
Mistral Small 4 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
42.9s
Cost:
$0.002619
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
18.92%
Dialogue to Total Word Ratio
(details)
59.1487%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched in the alley behind the house, its windows pulsing with neon letters spelling *"HAPPY 5TH BIRTHDAY, TIMMY!"* Inside, a kazoo orchestra massacred "The Wheels on the Bus," and the air smelled of sugar and desperation. A deflated Elsa balloon rolled past them like a tumbleweed from a heist gone wrong. Molly adjusted her ski mask, her breath fogging the fabric. “Back door only. Quiet in and out.” Bill scratched his chin. “Or we just walk in as clowns.” “That’s your big idea?” “Works for birthday parties.” The backyard stank of spilled fruit punch and broken crayons. Their lockpicks clicked against the patio door like a metronome set to chaos. Inside, a mountain of presents glittered under strobing disco lights—a Lego Death Star, a My Little Pony castle, and enough stuffed animals to start their own zoo. Molly stuffed a duffel bag, shoving aside a Tutankhamun action figure. “Three minutes. Then we bolt.” Bill nodded, counting down on his grubby fingers. “One. Two. Three.” A child’s voice shrieked from inside. “THE CAKE IS GONE!” Molly yanked the wagon’s handle. “RUN.” The presents rattled behind them like an avalanche of guilt as they vanished into the night, whistling the birthday song off-key.
created_at
"2026-03-18T10:41:21.648Z"
3/18/2026, 10:41:21 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
441
moderation_latency
(null)
generation_time
42851
tokens_prompt
77
tokens_completion
4554
native_tokens_prompt
81
native_tokens_completion
4345
native_tokens_completion_images
(null)
native_tokens_reasoning
4213
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00261915
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"dd040819393742e8a76a430a9d6648c0"
is_byok
false
latency
421
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830481-FNw8BPXevUFWpZ5L9i2a"
upstream_id
"dd040819393742e8a76a430a9d6648c0"
total_cost
0.00261915
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
205
18.92%
Dialogue to Total Word Ratio
Ratio: 21.36%, Deviation: 11.36%
neededClean
false
wordsTotal
206
wordsDialogue
44
59.1487%