NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8SS17QYR0D68NQ9MG2B6D
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
2.0s
Cost:
$0.000058
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
17.5011%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the party balloon arch pulsed like a drunk firefly outside the window, its cheerful chaos a siren call to Molly’s greed. She leaned against the peeling wallpaper of their safehouse—a former laundromat with the scent of bleach and regret—while Bill adjusted the strap of his duffel bag, which was already stuffed with a stolen fire extinguisher (for "emergencies") and a bag of stolen candy (for "moral support"). "Ten thousand reasons why this is a terrible idea," Bill muttered, poking at a half-melted lollipop with a toothpick. "Eleven thousand if we count the fact that we’ll be arrested by a guy who still believes in Santa." Molly smirked, twirling a lock of her hair around her finger like a thief’s charm. "Bill, my love, we’re not stealing from the kids. We’re liberating *their* parents’ hard-earned cash from under the tree. It’s like robbing a bank, but with more glitter and less paperwork." Bill groaned, rubbing his temples. "And what, exactly, is our exit strategy when we’re surrounded by a hundred hyperactive five-year-olds wielding sparklers like flamethrowers?" She grinned. "We don’t have one. That’s the fun part." Bill sighed, but his eyes lit up. "You’re impossible. And I love you." Molly winked. "Good. Because in thirty minutes, we’re stealing Christmas."
created_at
"2026-02-20T19:35:34.194Z"
2/20/2026, 7:35:34 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
231
moderation_latency
(null)
generation_time
1736
tokens_prompt
77
tokens_completion
346
native_tokens_prompt
69
native_tokens_completion
319
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000582
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"c8f86fea195a4da79f0dd2bf890d674a"
is_byok
false
latency
231
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771616134-5GVcA6BmgwwkPMLsiknw"
upstream_id
"c8f86fea195a4da79f0dd2bf890d674a"
total_cost
0.0000582
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
218
0.00%
Dialogue to Total Word Ratio
Ratio: 48.62%, Deviation: 38.62%
neededClean
false
wordsTotal
218
wordsDialogue
106
17.5011%