NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAXR7YNAY2EPRG03FNH574
Run Details
Model:
Ministral 3 3B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2.5s
Cost:
$0.000076
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
**Title: *The Great Toy Heist*** The back door of the McGinty mansion creaked open with a sound like a sleepy toddler’s sneeze. Molly adjusted her scarf, ensuring it covered every inch of her face except her eyes—because, as Bill had reminded her, *"If they see your pupils, they’ll see you’re not a ghost."* "Okay, Bill," Molly whispered, "I’m ready. You’re the one who said we needed a distraction." Bill grinned, already pulling out his smoke bomb. "Distraction? More like *chaos*. You think they’ll just *let* us in?" "Relax, Bill. We’re not here to start a riot. We’re here to *steal* presents." A sharp *pop* filled the air as Bill lit the smoke bomb. Instantly, the backyard was shrouded in thick, acrid fog. Molly grabbed her bag—filled with lockpicks, a stolen key, and a single, very suspicious-looking spoon—and ducked behind a bush. "Whoa," Bill said, holding up a half-eaten cupcake. "This place is *literal gold.*" Molly rolled her eyes. "We’re not here for the snacks. We’re here for the *toys.*" A loud *BANG* echoed from inside the house. Molly and Bill froze. "Oh, come on," Bill muttered. "They’re probably just playing with the fire alarm." But then another *CRASH*—this time, from the kitchen. Molly’s stomach dropped. "Bill, we’re *not* getting caught." "Relax, Molly. We’re *professionals.*" He pulled out his phone. "I’ll text the cops. They’ll be here in five minutes to *help* us." Molly groaned. "You’re *insufferable.*" "Yeah, well, you’re the one who insisted on this *daring* plan." Bill tossed her a flashlight. "Here. Use this. If they see light, they’ll think we’re *vandals*." Molly took it, squinting into the fog. "Fine. But if we get caught, I’m blaming *you.*" "Deal." Bill grinned. "Now, let’s *steal* some toys." The front door burst open. A group of giggling kids spilled out, followed by a very confused-looking security guard. "Uh… guys?" the guard said. "You okay?" Molly and Bill exchanged a look. Then, with a dramatic sigh, Bill said, "We’re just here to *borrow* some toys. For our *very important*… uh… *art project.*" The kids cheered. The guard blinked. Molly and Bill bolted. "Run!" Bill yelled over their shoulders. Molly didn’t stop until they were halfway down the street, panting. "That was *terrible.*" "Yeah, but we got away with it!" Bill said, already pulling out his next smoke bomb. "Next time, we’re *stealing* the entire party." Molly groaned. "We’re *never* doing that again." Bill laughed. "Too late. You’re *hooked.*"
created_at
"2026-02-20T20:12:41.608Z"
2/20/2026, 8:12:41 PM
model
"mistralai/ministral-3b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
228
moderation_latency
(null)
generation_time
1880
tokens_prompt
77
tokens_completion
689
native_tokens_prompt
69
native_tokens_completion
686
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000755
router
(null)
provider_responses
0
endpoint_id
"b9f6828f-aef9-498a-91f5-6816fbf72420"
id
"033a937d64c249f28735d2033550da6e"
is_byok
false
latency
225
model_permaslug
"mistralai/ministral-3b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618361-fKetcWli8UNhbpY3qiKW"
upstream_id
"033a937d64c249f28735d2033550da6e"
total_cost
0.0000755
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
412
0.00%
Dialogue to Total Word Ratio
Ratio: 44.42%, Deviation: 25.58%
neededClean
false
wordsTotal
412
wordsDialogue
183
0.0000%