NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAD68VJKW0WA4TA5TF0JR8
Run Details
Model:
Ministral 3 3B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2.6s
Cost:
$0.000076
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.09%
Dialogue to Total Word Ratio
(details)
0.0459%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the party hall pulsed like a bioluminescent warning sign, casting jagged shadows across the polished floor. Molly adjusted her gloves—too tight, but better than nothing—and peered through the window, her breath fogging the glass. "Bill, you’re *not* wearing a mask," she muttered, though she knew he’d already forgotten. Bill grinned, his teeth too white, his eyes too bright. "Relax, Molly. I’m the *face* of this operation. The kids will love me." He flexed his fingers, already unzipping his jacket. "Besides, if they see a masked guy, they’ll start screaming. We need *charisma*." Molly rolled her eyes. "You’re a disaster. What if they ask for your name?" "Then I’ll say, ‘Bill the Brave.’" He clapped his hands together. "And if they ask for my age? ‘Oh, I’m *very* old.’" Molly groaned. "You’re impossible." "Nah, I’m *perfect*." Bill pulled out his phone, scrolling through a list of partygoers. "First target: the guy with the gold watch. He’s got a safe. Probably full of candy bars." Molly scoffed. "You’re *not* counting on that." "Counting on what?" Bill asked, already mid-stride toward the door. "On you getting caught." She grabbed his arm. "We’re not just stealing presents. We’re *stealing* the *experience* of childhood. The joy. The *sugar rush*." Bill laughed, a sound like a wind chime in a hurricane. "Oh, Molly, you’re *so* dramatic." He yanked her along. "Come on, or I’ll have to *kidnap* you first." The door creaked open, and they slipped inside, the scent of cotton candy and laughter thick in the air. Molly’s stomach twisted. "We’re *doomed*." Bill grinned. "Nah. We’re *legendary*." He pointed at a table laden with wrapped gifts. "First move: distract the parents. Then we *steal*." Molly hesitated. "What if someone sees us?" "Then we’ll say we’re *volunteers*." Bill’s eyes gleamed. "We’ll have a *story*." Molly sighed. "You’re *insane*." "Nah, I’m *brilliant*." He grabbed a nearby chair, flipping it over with a dramatic flourish. "Let’s go, *thieves*." The partygoers turned, confused, as the chair crashed to the floor. Molly’s heart pounded. "We’re *not* doing this." Bill winked. "Too late." He grabbed a gift bag, already yanking it open. "First present—*guaranteed* to be a *hit*." Molly lunged, but it was too late. The bag was already in his pocket. "Bill!" "Shhh!" He pressed a finger to his lips. "We’re *in*." And just like that, the greatest heist of all time began—one that would leave the kids with nothing but empty boxes and a very confused teacher.
created_at
"2026-02-20T20:03:38.923Z"
2/20/2026, 8:03:38 PM
model
"mistralai/ministral-3b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
209
moderation_latency
(null)
generation_time
2030
tokens_prompt
77
tokens_completion
692
native_tokens_prompt
69
native_tokens_completion
693
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000762
router
(null)
provider_responses
0
endpoint_id
"b9f6828f-aef9-498a-91f5-6816fbf72420"
id
"ac25412fb784418d85122f8a55fd9168"
is_byok
false
latency
209
model_permaslug
"mistralai/ministral-3b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771617818-wrvc6B9HS9wuGrUPj4nM"
upstream_id
"ac25412fb784418d85122f8a55fd9168"
total_cost
0.0000762
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
416
0.09%
Dialogue to Total Word Ratio
Ratio: 41.83%, Deviation: 11.83%
neededClean
false
wordsTotal
416
wordsDialogue
174
0.0459%