NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM08N5AT8234G2QM148DJM55
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
4.3s
Cost:
$0.000208
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
29.48%
Dialogue to Total Word Ratio
(details)
14.7404%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the roof of the parked van, turning the windows into a warped mirror of the empty playground outside. Molly adjusted her ski mask in the rearview, her breath fogging the plastic lenses of her sunglasses. Bill, hunched over a ratty notebook, traced the blueprint of the community center with a grimy finger. The presents inside the multipurpose room were lined up neatly—plastic dinosaurs, glittery wands, a Remote Control car that looked suspiciously expensive. "You sure about the vent?" Bill muttered, tapping the notebook. "Last week’s ‘Toddler Terrors’ fiasco had us prying open a goddamn clown cake instead of the—" "Relax," Molly interrupted, already tossing a grappling hook toward the downspout. "This one’s not rigged. Unlike that thing." She jerked her chin toward the vent Bill had nearly face-planted through three months ago. Bill grumbled but secured the other end of the rope to the van’s tow hitch. "If I get stuck in another restricted airflow maze, I’m blaming you. And also quitting the life of crime." Molly smirked, already halfway up the side of the building. The presents glittered under the flickering fluorescent light like stolen treasure in an elementary school Indiana Jones. She wriggled her way through the vent with the practiced ease of someone who’d practiced wriggling through vents a lot. Somewhere beneath her, Bill’s heavy breathing echoed through the school corridors like a disapproving Yeti.
created_at
"2026-03-18T10:42:55.461Z"
3/18/2026, 10:42:55 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
412
moderation_latency
(null)
generation_time
4204
tokens_prompt
77
tokens_completion
379
native_tokens_prompt
81
native_tokens_completion
326
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00020775
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"eba57aa49be54634b15567e0255fbe9a"
is_byok
false
latency
412
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830575-fsmJzFn28cnGKA0lta2U"
upstream_id
"eba57aa49be54634b15567e0255fbe9a"
total_cost
0.00020775
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
233
29.48%
Dialogue to Total Word Ratio
Ratio: 20.51%, Deviation: 10.51%
neededClean
false
wordsTotal
234
wordsDialogue
48
14.7404%