NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM096FQ4ZVTSSSN556NHETE0
Run Details
Model:
Mistral Small 4 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
49.2s
Cost:
$0.005277
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.27%
Dialogue to Total Word Ratio
(details)
49.6340%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the fake mustache for the third time that evening, her eyes locked onto the blueprints of the McAllister mansion penthouse. The layout was simple enough—front door, narrow hallway, and a party room bursting with games, cake, and, most importantly, presents wrapped in enough glitter to require a hazmat suit. The penthouse above the party was a glitter bomb waiting to detonate at any second—metaphorically, of course. Bill, now a clown for the third time that week (after the pirate costume that made him look seasick and the failed mime routine where he tripped into the cake), gnawed nervously on a licorice whip stolen from the craft services table. The shoes, two sizes too small and squeaking like a teakettle with every step, announced his presence long before he arrived, like a foghorn in a library or the world’s most awkward dinner guest. 'Madness.' Molly didn’t look up from the blueprints, where she’d circled the escape route in what she called 'strategic pink highlighter.' 'Cynicism?' Bill groaned. 'If caught—' 'Whining noted.' She tossed him a pillowcase that had seen better days and smelled faintly of apple juice. 'Fast steals, slow exits. And no singing.' 'No balloons. Or noses.' 'Deal.' They stared at each other, two disgraced professionals reduced to breaking into a party full of sugar-fueled toddlers wearing plastic tiaras and superhero capes. Some were probably plotting their first bank heist already. This was either the heist of the century or a one-way ticket to the principal’s office—assuming the parents didn’t tackle them to the ground first.
created_at
"2026-03-18T10:52:23.15Z"
3/18/2026, 10:52:23 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
396
moderation_latency
(null)
generation_time
49114
tokens_prompt
77
tokens_completion
8806
native_tokens_prompt
81
native_tokens_completion
8789
native_tokens_completion_images
(null)
native_tokens_reasoning
8388
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00527691
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"55dd83a3a93248f0ae05217fd7680dfd"
is_byok
false
latency
396
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831143-vq3pgwDMxi4vyAPF5xPS"
upstream_id
"55dd83a3a93248f0ae05217fd7680dfd"
total_cost
0.00527691
cache_discount
0.00000864
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
261
99.27%
Dialogue to Total Word Ratio
Ratio: 12.93%, Deviation: 2.93%
neededClean
false
wordsTotal
263
wordsDialogue
34
49.6340%