NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYB03S3GJ3PMWG0NQRYD3E6
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3.2s
Cost:
$0.000093
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
42.22%
Dialogue to Total Word Ratio
(details)
21.1080%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering "Happy Birthday!" sign cast long shadows across the backyard as Molly adjusted the strap of her duffel bag, her breath fogging in the crisp autumn air. Bill leaned against the fence, twirling a candy cane between his fingers like it was a sword. "You sure about this?" Bill asked, his voice dripping with skepticism. "Last time we hit a kid’s party, we got chased by a piñata full of marbles." Molly shot him a look. "That was *one* time, and it was *their* fault for leaving the piñata unlocked. Besides, this place is *packed* with presents. You ever seen a warehouse full of toys? This is like a heist *and* a toy drive." Bill scoffed. "You’re comparing stealing from a bunch of kids to, like, robbing a bank?" "Exactly! Banks have guards, but kids have *no* sense of self-preservation. They’ll just hand you the presents like it’s a game of pass-the-parcel." Bill crossed his arms. "And what’s the play? We just walk in, grab everything, and—what, *wave* at the parents?" Molly grinned. "No, we *distract* them. You’re the lookout. I’ll be the one in the clown suit." Bill’s eyes widened. "*Clown?* Molly, I *hate* clowns." "Perfect. Everyone hates clowns. They’ll *beg* you to leave. Meanwhile, I’ll be in there like a furry little tornado, stuffing presents into this bag while the parents are too busy screaming to notice." Bill groaned. "You realize if we get caught, we’re not just stealing presents—we’re stealing *childhoods*." Molly waved a hand. "Relax. Worst case, we get arrested. Best case, we’re rich and famous. Either way, we win." Bill sighed, but there was a smirk tugging at his lips. "Fine. But if I see a single balloon animal, I’m throwing it back at you." Molly clapped her hands. "Deal. Now, let’s go make some kids *very* happy—*temporarily*." Bill adjusted his hat, muttering, "I’m gonna need a therapist after this." Molly laughed, already slipping on her oversized clown shoes. "Just think of it as a *public service*—free toys for the needy." Bill groaned again, but he followed her toward the back gate, already plotting how to explain this to the police.
created_at
"2026-02-20T20:13:58.957Z"
2/20/2026, 8:13:58 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
192
moderation_latency
(null)
generation_time
2743
tokens_prompt
77
tokens_completion
573
native_tokens_prompt
69
native_tokens_completion
554
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00009345
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"ebc3c109ba1343a98cbac525bbac5534"
is_byok
false
latency
192
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618438-kdPYBzjASz6NHEecyZIK"
upstream_id
"ebc3c109ba1343a98cbac525bbac5534"
total_cost
0.00009345
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
366
42.22%
Dialogue to Total Word Ratio
Ratio: 65.85%, Deviation: 4.15%
neededClean
false
wordsTotal
366
wordsDialogue
241
21.1080%