NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYASGK7V18BBVET0KKAZXK2
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3.8s
Cost:
$0.000119
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.20%
Dialogue to Total Word Ratio
(details)
43.2906%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of a flickering "Happy Birthday, Timmy!" balloon cast long shadows across the living room floor as Molly adjusted the strap of her satchel, her fingers twitching with the kind of restless energy that only comes from plotting a crime against joy. Bill leaned against the doorframe, arms crossed, his reflection in the window looking suspiciously like a disgruntled raccoon who’d just been denied a trash can. "Alright, listen," Molly said, tapping her chin with a pen that was *definitely* not a pen—it was a lockpick disguised as one. "We’re not just stealing presents. We’re stealing *the spirit* of childhood. Every wrapped box is a tiny, unsuspecting heart, and we’re going to yank it right out." Bill groaned, rubbing his temples. "Molly, if we get caught, I’m telling the judge about the time you tried to sell your own teeth to a dentist for ‘emergency cash.’" "That was a misunderstanding!" she snapped. "He said ‘fillings’ and I said ‘filling’—it was a language barrier!" Bill sighed, rolling his eyes so hard they nearly popped out. "Fine. But we’re doing this *my* way. No loud noises, no fancy disguises—just a simple, elegant heist. We walk in, grab the presents, and walk out before the parents even realize their kid’s birthday is a lie." Molly grinned, pulling a tiny top hat from her pocket. "Oh, *now* you’re talking. Because I *love* a good disguise. I’m going in as the birthday clown. You’re the ice cream truck driver. Classic. Irresistible." Bill stared at her like she’d just suggested they rob a bakery by pretending to be a loaf of bread. "You want me to drive around in a truck that *sells ice cream* while you juggle balloons and tell bad jokes? Molly, I’m not a *performance artist*." "Relax," she said, tossing him a whistle. "You just honk when I say ‘go.’ Easy. Now, what’s the security like?" Bill checked his notes. "One security guard. Ex-bouncer. Name’s Gary. He’s got a mustache that looks like it was drawn on with a crayon and a temper that could curdle milk." Molly whistled. "Perfect. I’ll distract him with a ‘magic trick’—turns out, my ‘disappearing act’ involves making his wallet vanish. You take the presents, we split, and by the time Gary realizes his wallet is gone, we’ll be long gone with the toys." Bill hesitated. "What if he *doesn’t* have a wallet?" Molly waved a hand. "Then we improvise. Steal his *soul* instead." Bill groaned again, but this time, there was a hint of a smile. "You’re *insane*." "And you love me," she said, nudging him with her elbow. "Now, let’s go ruin a kid’s birthday before he even gets to open one present." Bill sighed, but he was already pulling out his own satchel. "Fine. But if we get caught, I’m blaming you for this." Molly grinned, adjusting her clown nose. "Deal. Now let’s go make some memories—*the bad kind*."
created_at
"2026-02-20T20:10:22.705Z"
2/20/2026, 8:10:22 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
248
moderation_latency
(null)
generation_time
3591
tokens_prompt
77
tokens_completion
761
native_tokens_prompt
69
native_tokens_completion
726
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011925
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"10eae0b9f0574a7691068986603268e9"
is_byok
false
latency
248
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618222-vrV2vjGYfvqyGRrPqXxs"
upstream_id
"10eae0b9f0574a7691068986603268e9"
total_cost
0.00011925
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
0.20%
Dialogue to Total Word Ratio
Ratio: 61.15%, Deviation: 11.15%
neededClean
false
wordsTotal
489
wordsDialogue
299
43.2906%