NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9WQT0NT085D7CMG159ZM4
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1.9s
Cost:
$0.000060
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
1.04%
Matches word count
(details)
13.04%
Dialogue to Total Word Ratio
(details)
7.0381%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a birthday balloon arch pulsed like a drunk firefly outside the party house, its reflection dancing on the rain-slicked pavement. Molly adjusted her gloves, her breath fogging in the cold night air. "Alright, Bill. We’re stealing a room full of toys like it’s a buffet, and no one’s gonna notice." Bill wiped his brow with a bandana, his eyes locked on the security camera mounted above the front door. "Easy for you to say. You’ve got the charm of a chihuahua in a clown suit. I’m the one who’s gotta distract the bouncer." "Oh, *please*. You’re the one who got caught stealing a piñata last year. Remember? The one with the *screaming* mariachi band inside?" Molly cackled, nudging him with her elbow. "I’m the brains, you’re the *brawn*—but right now, you’re the guy who’s gonna trip over his own shoelaces and wake up the whole block." Bill groaned, rubbing his temples. "Fine. But if I have to pretend to be a lost uncle, I’m charging you extra for the emotional damage." Molly grinned, tossing him a fake mustache. "Deal. Just don’t start crying when you see all those *unopened* action figures. We’re not here to *play* with them." She adjusted her mask, her voice dropping to a conspiratorial whisper. "But if you *do* want to keep one… I won’t tell anyone."
created_at
"2026-02-20T19:54:39.822Z"
2/20/2026, 7:54:39 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
206
moderation_latency
(null)
generation_time
1634
tokens_prompt
77
tokens_completion
342
native_tokens_prompt
69
native_tokens_completion
330
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00005985
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"fb88ec6b99f64b9399356dc904f71e50"
is_byok
false
latency
206
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771617279-CsiTrG7tugm7ZRozGRyW"
upstream_id
"fb88ec6b99f64b9399356dc904f71e50"
total_cost
0.00005985
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
1.04%
Matches word count
n/a
neededClean
false
words
226
13.04%
Dialogue to Total Word Ratio
Ratio: 61.95%, Deviation: 11.95%
neededClean
false
wordsTotal
226
wordsDialogue
140
7.0381%