NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAT6F6ME5EHXJ4VF135ADF
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
3.3s
Cost:
$0.000083
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering neon sign cast long shadows over the alley as Molly adjusted her gloves, her breath fogging in the cold night air. "Alright, Bill, we’re stealing a kid’s birthday party. That’s either the dumbest or smartest thing we’ve ever done." Bill leaned against a dumpster, cracking his knuckles. "Smartest. Kids don’t even notice when you’re there. They’re too busy screaming about the cake and the clown." "Or the clown *is* the problem," Molly muttered, eyeing the inflatable palm tree outside the venue. "What if the security’s a guy in a rubber nose?" Bill grinned. "Then we’re stealing from a guy in a rubber nose. That’s a win." "Fine. But if we get caught, I’m blaming you for not checking the guest list for any ex-convicts." "Relax. The only ex-con here is the guy who runs the bounce house. Trust me, he’s harmless." Molly crossed her arms. "You’re saying we’re breaking into a party where the biggest threat is a guy who’s probably just really bad at his job?" "Exactly. And if we’re quick, we’ll be in and out before the parents even realize their kids have turned into tiny, sugar-fueled tornadoes." Molly sighed, pulling out a tiny notepad. "Alright, new plan. We go in through the kitchen—" "—where the food’s the best," Bill interrupted. "—and grab the presents from under the tree. But we *do not* touch the piñata. That thing’s got a history." Bill raised an eyebrow. "A history?" Molly nodded. "Last time I saw that thing, it was in a dumpster behind a carnival. I think it’s cursed." Bill burst out laughing. "You’re the one who stole it from the carnival." "Details!" Molly snapped. "Just don’t let it drop on your head." Bill wiped his eyes. "Deal. But if we get caught, I’m telling everyone you were the one who suggested this." Molly smirked. "And if we get caught, you’re the one who suggested we wear matching Santa hats." Bill groaned. "I *hate* you."
created_at
"2026-02-20T20:10:45.107Z"
2/20/2026, 8:10:45 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
267
moderation_latency
(null)
generation_time
2487
tokens_prompt
77
tokens_completion
526
native_tokens_prompt
69
native_tokens_completion
483
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000828
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"ca066899af5e4fa0a884e34dc6c91a4c"
is_byok
false
latency
266
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618245-T5Seotmnb4BZwL7K0Sub"
upstream_id
"ca066899af5e4fa0a884e34dc6c91a4c"
total_cost
0.0000828
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
334
0.00%
Dialogue to Total Word Ratio
Ratio: 69.16%, Deviation: 20.84%
neededClean
false
wordsTotal
334
wordsDialogue
231
0.0000%