NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYA9176JZ0V0GEPDNQT3CDS
Run Details
Model:
Ministral 3 14B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
6.5s
Cost:
$0.000183
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
13.48%
Dialogue to Total Word Ratio
(details)
6.7423%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The backseat of Bill’s rusted-out van smelled like stale coffee and regret, which, honestly, was an improvement over the last job where it had smelled like regret and *also* regret. Molly adjusted her ski mask—black, because subtlety was the name of the game—and leaned back against the headrest, kicking her feet up on the dashboard. “You’re sure this is the right house?” she asked, peeling a gum wrapper off her finger and flicking it at Bill’s shoulder. It stuck. Bill didn’t even flinch. “Positive. Mrs. Henderson’s kid’s birthday. *‘A Magical Unicorn Adventure Party,’* according to the invite. Which, by the way, is the most ridiculous thing I’ve ever heard.” “Better than last time when we hit the *‘Gentlemen’s Poker Night’* at the country club,” Molly said, nodding. “At least here, the security’s gonna be a bunch of parents too busy herding toddlers to notice us.” Bill grinned, adjusting the fake mustache he’d taped over his upper lip. “Exactly. And the presents? Top-tier loot. Dolls, action figures, *a pony.*” Molly gasped. “A *pony?*” “Not a real one, obviously. But the kid’s gonna cry when we take it.” “Good. Kids are dramatic. Makes our job easier.” Bill pulled out a folded map and smoothed it on his lap. “Alright, here’s the plan. We go in as *‘aunties’*—” “Aunties?” Molly scoffed. “We look like we just robbed a bank.” “We *did* rob a bank,” Bill pointed out. “But we’re gonna play it off like we’re here for the *‘special cupcake delivery.’*” Molly groaned. “There’s no such thing.” “Exactly! So if anyone asks, we’re *very* confused but *very* committed to the bit.” She considered this. “Fine. But if I have to explain why we’re carrying crowbars in our purses, I’m throwing you under the bus.” Bill clapped his hands together. “Perfect. And while the parents are distracted by the *‘pin the tail on the donkey’* game—” “Which, by the way, is a *terrible* security measure,” Molly interrupted. “—we slip into the gift room, grab everything, and make our escape before the piñata gets smashed.” Molly raised an eyebrow. “You’re assuming the piñata *won’t* get smashed.” “Fair. But if it does, we’ll use that as a distraction. Chaos is our friend.” She nodded, then suddenly sat up straight. “Wait. What if the kid’s *allergic to something?*” Bill blinked. “Like what?” “Like, I don’t know, *balloons?*” “Balloons don’t kill people.” “You sure?” “Positive.” Molly sighed. “Fine. But if we accidentally murder a seven-year-old, I’m blaming you.” Bill laughed, revving the engine. “Relax. We’re professionals. This’ll be a walk in the park.” Molly eyed the crowbar in her lap. “Says the guy who once got caught stealing a single cookie from a gas station.” “That was *one time!* And the cookie was *stale!*” She smirked. “Alright, let’s do this. But if I get arrested, I’m telling them you made me.” Bill grinned, pulling the van into the street. “Deal. But if *you* get arrested, I’m telling them you *planned the whole thing.*” Molly groaned as they rolled up to the house, where a rainbow balloon arch stretched over the front door and the sound of children screaming in delight filled the air. “God help us,” she muttered. Bill adjusted his mustache. “No, *God help the presents.*”
created_at
"2026-02-20T20:01:22.674Z"
2/20/2026, 8:01:22 PM
model
"mistralai/ministral-14b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
233
moderation_latency
(null)
generation_time
6197
tokens_prompt
77
tokens_completion
949
native_tokens_prompt
69
native_tokens_completion
847
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001832
router
(null)
provider_responses
0
endpoint_id
"0b53717b-9af5-4de3-80af-ebedd2adf981"
id
"551fd6d6236e49ec8277bebdd685df7c"
is_byok
false
latency
233
model_permaslug
"mistralai/ministral-14b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771617682-16FihupoardBK1tyUqsA"
upstream_id
"551fd6d6236e49ec8277bebdd685df7c"
total_cost
0.0001832
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
545
13.48%
Dialogue to Total Word Ratio
Ratio: 63.67%, Deviation: 6.33%
neededClean
false
wordsTotal
545
wordsDialogue
347
6.7423%