NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY986SSRYZJWYTJ795BJKZG
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
2.6s
Cost:
$0.000069
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering "Happy Birthday!" balloon cast long shadows across the table as Molly swirled her coffee, her fingers tapping impatiently against the chipped mug. "Alright, Bill. We’re in. Tonight’s the night." Bill leaned back in his chair, arms crossed, grinning like a man who’d just been handed a secret recipe for world peace. "Oh, we’re in, all right. But let’s not get ahead of ourselves. First rule of heisting a kids’ party: *no* loud noises. Second rule: *no* candy bar fingerprints on the door handle. And third—" "Third rule is we don’t get caught," Molly interrupted, deadpan. "Which, by the way, is the same as the first rule." Bill waved a dismissive hand. "Details. So, the plan’s simple. We slip in through the back door—" "The back door’s got a *security camera*," Molly cut in, already shaking her head. "And a *doorman* who looks like he bench-presses toddlers for fun." "Ah, but we’re not using the back door." Bill leaned in, eyes gleaming. "We’re using the *bouncy castle*." Molly blinked. "The *what* now?" "The bouncy castle," Bill repeated, as if she’d just asked him to explain gravity. "We climb in, wait for the parents to get distracted, and *bounce* right into the party like we’re part of the fun. Kids love us. Parents? They’ll be too busy watching their little monsters to notice two adults in a castle full of sugar-rushed chaos." Molly considered this, then sighed. "And if someone *does* notice?" Bill grinned. "Then we tell them we’re the new clowns. Works every time."
created_at
"2026-02-20T19:43:27.045Z"
2/20/2026, 7:43:27 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
150
moderation_latency
(null)
generation_time
2333
tokens_prompt
77
tokens_completion
409
native_tokens_prompt
69
native_tokens_completion
390
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00006885
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"7dc13a5564fe40d8baf01d9884911010"
is_byok
false
latency
150
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771616607-HE5loeFzo1V7O1uu1chs"
upstream_id
"7dc13a5564fe40d8baf01d9884911010"
total_cost
0.00006885
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
261
0.00%
Dialogue to Total Word Ratio
Ratio: 66.28%, Deviation: 23.72%
neededClean
false
wordsTotal
261
wordsDialogue
173
0.0000%