NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8YW8BG9SPCBAZY6Q0CP0J
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
2.2s
Cost:
$0.000070
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
13.60%
Dialogue to Total Word Ratio
(details)
6.7979%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the neon sign outside the *Sugar Rush Surprise Party* flickered like a drunk firefly. Molly adjusted her sunglasses—inside, she’d look like a normal partygoer, not the woman who’d just spent three hours rigging the place with her own brand of "safety features." Bill leaned against the fence, chewing on a stolen lollipop. "You sure about this?" he asked, wiping his mouth. "Kids scream. Parents scream. The cops scream. It’s a full-circle disaster." Molly tossed him a tiny velvet pouch. "Relax, genius. We’re not here to *steal from* kids. We’re here to *steal for* kids. Ever seen a kid’s allowance? It’s like stealing from a goldfish—except the goldfish has more dignity." Bill groaned. "You’re insane. Also, you just compared me to a goldfish." "Fair. But at least goldfish don’t complain about the *lack of a plan*." She pointed at the house. "That’s the cake table. That’s the piñata. And that’s—" She squinted. "Oh no. That’s *Santa’s sleigh*." Bill’s eyes widened. "You’re not suggesting—" "Of course I am. It’s a *sleigh*. It’s got wheels. It’s got *magic*." She grinned. "We hotwire it, drive it straight to the toy store, and come back with a truckload of *actual* presents. Then we sell them to the parents who *definitely* won’t ask questions." Bill stared. "You’re evil." Molly patted his arm. "No, darling. I’m *efficient*." She pulled out a tiny remote. "Now, let’s see if Santa’s sleigh still has its original battery."
created_at
"2026-02-20T19:38:21.335Z"
2/20/2026, 7:38:21 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
184
moderation_latency
(null)
generation_time
1980
tokens_prompt
77
tokens_completion
400
native_tokens_prompt
69
native_tokens_completion
396
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00006975
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"7206e0a19c1f4784913df03ba6fd79d8"
is_byok
false
latency
184
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771616301-aSYp8gPFDpDuBpaBjKCq"
upstream_id
"7206e0a19c1f4784913df03ba6fd79d8"
total_cost
0.00006975
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
244
13.60%
Dialogue to Total Word Ratio
Ratio: 61.89%, Deviation: 11.89%
neededClean
false
wordsTotal
244
wordsDialogue
151
6.7979%