NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9BVX1ARERHKBA96HB772T
Run Details
Model:
Ministral 3 3B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
1.4s
Cost:
$0.000043
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her sunglasses, squinting at the front door. "Bill, we’re gonna need a distraction. Like, *really* good." Bill wiped his hands on his jeans, grinning. "I’ve got it. We’ll just—what? Pretend to be lost parents?" Molly rolled her eyes. "Too obvious. We need something *funny*." "Okay, okay. What if we—" He pulled out his phone. "I’ve got a TikTok trend. ‘The Great Cookie Heist.’" Molly gasped. "You’re insane. That’s *not* how we do this." "Relax, it’s just a distraction. We’ll be gone before anyone notices." He tapped the screen. "See? Already got 12K views." Molly groaned. "We’re not doing that. We’re doing *nothing*. We just—" "Wait, wait. What if we *actually* get caught?" Bill leaned in. "We’ll just say it was an accident. ‘Oh, we were just—’" Molly pinched the bridge of her nose. "Bill, we’re *not* turning this into a comedy sketch." "Fine, fine. But what if we—" He pulled out a toy gun. "A *real* one." Molly stared. "Bill, that’s a toy. That’s *not* how we do this." "Relax, it’s just for show. We’ll—" He fumbled with the safety. "—bluff our way out." Molly sighed. "Fine. But if we get caught, we *both* go to jail." Bill grinned. "Deal. Now, where’s the party?"
created_at
"2026-02-20T19:45:26.957Z"
2/20/2026, 7:45:26 PM
model
"mistralai/ministral-3b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
179
moderation_latency
(null)
generation_time
952
tokens_prompt
77
tokens_completion
343
native_tokens_prompt
69
native_tokens_completion
365
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000434
router
(null)
provider_responses
0
endpoint_id
"b9f6828f-aef9-498a-91f5-6816fbf72420"
id
"5cf96f168951494896362fe69ace74ee"
is_byok
false
latency
179
model_permaslug
"mistralai/ministral-3b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771616726-a2AGHIhyZb3QIVFDBJMR"
upstream_id
"5cf96f168951494896362fe69ace74ee"
total_cost
0.0000434
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
208
0.00%
Dialogue to Total Word Ratio
Ratio: 68.75%, Deviation: 21.25%
neededClean
false
wordsTotal
208
wordsDialogue
143
47.9934%