NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYASY3ZSD0XQNW6BHKBXWMT
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
2.5s
Cost:
$0.000072
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the party balloon arch pulsed like a drunk firefly outside the window, its reflection dancing across the sticky floorboards of Molly’s hideout—a converted storage closet beneath a toy store. Bill adjusted his wire-rimmed glasses, squinting at the blueprint he’d “borrowed” from the security guard’s lunchbox. “Ten p.m.,” he muttered, tracing the outline of the sliding glass door with a stubby finger. “That’s when the parents will be too busy Googling ‘how to stop kids from eating glitter’ to notice us slipping in like ghosts.” Molly leaned against the wall, arms crossed, her boot scuffing a loose floorboard. “Ghosts?” She scoffed, tossing a stolen candy cane between her fingers. “We’re more like *sneaky raccoons*—except raccoons have the decency to actually eat the food they steal.” She nudged a duffel bag overflowing with lockpicks and a hairdryer (for “distraction purposes”). “Also, I checked the security footage. There’s a golden retriever named Biscuit who thinks he’s a guard dog. He’s got the energy of a caffeinated squirrel.” Bill groaned, rubbing his temples. “Biscuit’s the reason we’re doing this at night. Daytime he’s a menace, nighttime he’s a *shadow*—just like us.” He pulled a tiny bell from his pocket, jingling it ominously. “This’ll be our *distraction bell*. Ring it, and all the kids’ll come running, leaving the presents wide open.” Molly grinned, snatching the bell. “And if Biscuit barks?” She asked, already imagining the chaos. Bill shrugged. “Then we improvise. Maybe we bribe him with a whole pizza.” He paused, then added, “Or we don’t. We’ll cross that bridge when we’re drowning in presents.”
created_at
"2026-02-20T20:10:36.554Z"
2/20/2026, 8:10:36 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
227
moderation_latency
(null)
generation_time
1895
tokens_prompt
77
tokens_completion
455
native_tokens_prompt
69
native_tokens_completion
414
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00007245
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"f42e1765ea714cb3a947e94e03197aa5"
is_byok
false
latency
227
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618236-DffDcHqq9lUfPuiKg91R"
upstream_id
"f42e1765ea714cb3a947e94e03197aa5"
total_cost
0.00007245
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
266
0.00%
Dialogue to Total Word Ratio
Ratio: 50.38%, Deviation: 40.38%
neededClean
false
wordsTotal
266
wordsDialogue
134
0.0000%