NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY90TS4P3EB4YW6X72CJFX2
Run Details
Model:
Ministral 3 3B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2.2s
Cost:
$0.000069
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
44.09%
Dialogue to Total Word Ratio
(details)
22.0456%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
**Title: *The Great Toy Heist*** The back door of the McGinty mansion creaked open with a sound like a sleepy toddler’s sneeze. Molly adjusted her wire-rimmed glasses and gave Bill a once-over. "You’re *still* wearing those flip-flops? This is a *war*, Bill. We’re not sneaking in like a couple of kindergarteners." Bill grinned, flipping his own flip-flops around his ankles. "Relax, Molly. I’ve got the *look*. The *mood*. I’m basically a pirate who just forgot his sword." Molly rolled her eyes but couldn’t help laughing. "You’re a menace. But fine, let’s do this. The party’s in the basement, right? And the security system’s just a kid with a fire alarm?" Bill nodded, already peeking around the corner. "Yep. And the presents are stacked in the corner like a giant, glittery Christmas tree. Perfect." A loud *BANG* echoed from upstairs. Molly gasped. "Oh no. The parents heard us." Bill groaned. "We’re doomed." Molly grabbed her bag of tools—mostly a butter knife and a hairpin—and zipped it shut. "Then we improvise. We’ll just… *blend in*." Bill groaned again. "You’re worse than a toddler with a toy gun." They crept down the stairs, Molly’s sneakers making *thud-thud* noises that sounded suspiciously like a marching band. The basement door was ajar, and inside, the air smelled like cotton candy and regret. "Okay," Bill whispered, "this is it. The presents are *right there*—" A shadow loomed. A kid, maybe eight years old, stood in the doorway, holding a fire extinguisher like it was a sword. "Who’s in there?" Bill froze. Molly’s eyes widened. "Oh no. *Not* the kid with the fire extinguisher." The boy stepped forward, arms crossed. "You two look like you’re up to no good. What’s your game?" Molly sighed. "We’re just… *borrowing* some toys." The kid smirked. "Borrowing? That’s a fancy word for *stealing*." Bill gulped. "We’re *not* stealing. We’re *helping*." The kid tilted his head. "How?" Molly pulled out her butter knife. "We’re gonna *swap* presents. You give us yours, we give you *ours*." The kid’s eyes lit up. "You’re *geniuses*." Bill groaned. "We’re *terrible* at this." The kid grinned. "Then let’s make it *fun*." And just like that, the Great Toy Heist turned into the *Great Toy Swap*—with a kid who just wanted to be a hero.
created_at
"2026-02-20T19:39:25.36Z"
2/20/2026, 7:39:25 PM
model
"mistralai/ministral-3b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
190
moderation_latency
(null)
generation_time
1945
tokens_prompt
77
tokens_completion
630
native_tokens_prompt
69
native_tokens_completion
619
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000688
router
(null)
provider_responses
0
endpoint_id
"b9f6828f-aef9-498a-91f5-6816fbf72420"
id
"8c03132b349b4bf188475a0e229a8fa7"
is_byok
false
latency
190
model_permaslug
"mistralai/ministral-3b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771616365-TN7RcwHWcG7nZXaIWWGZ"
upstream_id
"8c03132b349b4bf188475a0e229a8fa7"
total_cost
0.0000688
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
383
44.09%
Dialogue to Total Word Ratio
Ratio: 45.95%, Deviation: 4.05%
neededClean
false
wordsTotal
383
wordsDialogue
176
22.0456%