NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07CPM48DKP1WZT8W8C2NJ2
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
3.6s
Cost:
$0.000211
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
8.08%
Dialogue to Total Word Ratio
(details)
4.0400%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the straps of her backpack, the zipper threatening to burst under the weight of stolen candy and a single, suspiciously large inflatable T-Rex. Bill flicked his lockpicks into place with the grace of a man who had recently given up on monogamy, then grunted as the blast door groaned open. The hallway beyond smelled like juice boxes and betrayed parental promises. "You really think they keep the good stuff in the main hall?" Molly whispered, her boots squeaking against the balloon-strewn carpet. Balloons that, upon closer inspection, had been aggressively licked by something small and peanut-brained. Bill didn’t answer. He was too busy admiring the cake table, a fortress of frosting and glitter with a single, naive police officer figurine standing guard. Molly elbowed him in the ribs. "Focus. We’re here for the presents, not your midlife crisis." Beneath the glittering wreckage of a piñata battle, they found the treasure chest—a locked plastic bin labeled with a Sharpie's desperate hope: "GIFTS FOR THE CHILDREN, UNLESS THEY CRY TOO MUCH, THEN NOT." Bill produced a pair of novelty scissors, the kind with plastic eyeballs that popped out and screamed, and wiggled them meaningfully. Molly sighed just as the fire alarm began its cheerful chirp, likely triggered by the officer figurine’s violent overthrow. "You triggered it?" Molly hissed. "I thought that was part of the plan." "It was the part where we left before getting caught."
created_at
"2026-03-18T10:20:49.683Z"
3/18/2026, 10:20:49 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
365
moderation_latency
(null)
generation_time
3498
tokens_prompt
77
tokens_completion
381
native_tokens_prompt
81
native_tokens_completion
332
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00021135
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"c16cbe38c5e145dda24c5b973e352946"
is_byok
false
latency
365
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829249-BBGthdlNeKZHwuM6YeLX"
upstream_id
"c16cbe38c5e145dda24c5b973e352946"
total_cost
0.00021135
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
237
8.08%
Dialogue to Total Word Ratio
Ratio: 22.59%, Deviation: 12.59%
neededClean
false
wordsTotal
239
wordsDialogue
54
4.0400%