NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0902VFDNC9VF59WR08DS6K
Run Details
Model:
Mistral Small 4 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
12.7s
Cost:
$0.001113
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
99.95%
Dialogue to Total Word Ratio
(details)
49.9794%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The diner’s neon sign buzzed like a dying wasp as Molly spread out a crumpled blueprint across their sticky table. Bill slurped his milkshake noisily, still half-asleep at two in the afternoon. "You sure this is the right place?" Bill asked, licking whipped cream off his thumb with the enthusiasm of a man who had never considered consequences. Molly adjusted her glasses, squinting at the blueprint. "Positive. Local moms’ Facebook group posted about the party an hour ago. ‘Little Timmy’s 5th Birthday Bash,’ complete with a treasure hunt and a clown named Bumbles." Bill grinned, nearly knocking over his shake. "Kids and loot—best day ever." Molly’s eye twitched. "What if there are glitter cannons?" "Distraction’s key," Bill said, tossing a fry into his mouth. "You start a food fight. I grab the presents and meet you at the getaway car. Vroom vroom." She frowned. "And if someone cries? Because kids cry. It’s what they do." Bill adopted a solemn tone, adjusting an imaginary tie. "Then we’re professionals who care about the children’s emotional well-being. We offer juice boxes and a consoling pat on the back before vanishing into the night." Molly groaned, rubbing her temples. "We’re going to jail." Bill pulled two ski masks from his jacket—one with a poorly stitched smiley face. "Ready?" "Not even a little," she muttered, eyeing the door like it was rigged with landmines.
created_at
"2026-03-18T10:48:53.373Z"
3/18/2026, 10:48:53 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
400
moderation_latency
(null)
generation_time
12600
tokens_prompt
77
tokens_completion
1977
native_tokens_prompt
81
native_tokens_completion
1835
native_tokens_completion_images
(null)
native_tokens_reasoning
1602
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00111315
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"19fcb4c881a54ef680edfc1a07a1a989"
is_byok
false
latency
387
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830933-gcMcmKIpsCZG6kvjOXfY"
upstream_id
"19fcb4c881a54ef680edfc1a07a1a989"
total_cost
0.00111315
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
231
99.95%
Dialogue to Total Word Ratio
Ratio: 48.50%, Deviation: 1.50%
neededClean
false
wordsTotal
233
wordsDialogue
113
49.9794%