NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYB6B3SDPPVRQBZ4SZD6V0M
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.4s
Cost:
$0.000100
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
58.66%
Dialogue to Total Word Ratio
(details)
29.3290%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering neon sign outside *The Sugarplum Ballroom* cast long, wobbly shadows across the cracked sidewalk. Molly adjusted the wide-brimmed hat perched atop her head, squinting at the building through the mesh of her sunglasses. Inside, the sound of squealing laughter and the occasional crash of a falling cup drifted out like a siren song for the desperate and the daring. Bill sidled up beside her, his breath smelling faintly of pickled herring and bad decisions. "You sure this is the place?" he asked, poking at the door with a butter knife he’d "borrowed" from a diner earlier that morning. "Last I checked, kids don’t throw parties in places that smell like a dumpster fire." Molly swatted his hand away. "It’s a *children’s* party, Bill. They’re not exactly known for their air quality standards." She leaned in, lowering her voice to a conspiratorial whisper. "Besides, the security’s a pair of overgrown babysitters with more energy drinks than brains. Piece of cake." Bill groaned. "You keep saying that. What if the cake’s got, like, *sprinkles*? What’s the plan for sprinkles?" "Bill, focus." Molly pulled a crumpled map from her pocket, unfolding it with dramatic flair. "We go in through the kitchen—there’s a back door, barely latched. Once inside, we split up. You handle the presents under the tree while I deal with the piñata. Then we’re out before the parents even realize their kids have been replaced by tiny, confused ninjas." Bill’s eyes widened. "You’re stealing *both* the presents *and* the piñata?" "Of course. Why leave anything behind? Unless you’ve got a sudden urge to start a piñata-based side hustle." Bill rubbed his temples. "You’re impossible. What if the piñata’s one of those *surprise* ones? Like, what if it’s full of *dirt*?" Molly sighed, pinching the bridge of her nose. "Bill, if it’s full of dirt, we’re getting *paid* in metaphorical gold. Now move. The bouncer’s starting to look suspicious." Bill hesitated, then nodded. "Alright, but if we get caught, I’m blaming you for the piñata." Molly grinned. "Deal. And Bill?" "Yeah?" "Don’t forget the glitter. They always have glitter." Bill groaned again, but this time, it was with amusement. "You’re a menace." He adjusted the bag slung over his shoulder—filled with what Molly insisted were "essential tools"—and followed her toward the back alley, where the night’s real work began.
created_at
"2026-02-20T20:17:23.076Z"
2/20/2026, 8:17:23 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
172
moderation_latency
(null)
generation_time
3112
tokens_prompt
77
tokens_completion
647
native_tokens_prompt
69
native_tokens_completion
600
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00010035
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"aa73ffea41fb49f99004ff4561539ffa"
is_byok
false
latency
172
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618643-2wITWdZX8x2lpWuYJmwo"
upstream_id
"aa73ffea41fb49f99004ff4561539ffa"
total_cost
0.00010035
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
398
58.66%
Dialogue to Total Word Ratio
Ratio: 53.27%, Deviation: 3.27%
neededClean
false
wordsTotal
398
wordsDialogue
212
29.3290%