NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9XY7AN6KTTEEBHDTPVZ56
Run Details
Model:
Ministral 3 14B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
3.0s
Cost:
$0.000093
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
96.17%
Dialogue to Total Word Ratio
(details)
48.0866%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse smelled like stale popcorn and regret, the kind of regret that comes from realizing you’ve just agreed to rob a *children’s party*. Molly adjusted her ski mask, the elastic digging into her forehead like a guilty conscience. Bill leaned against a crate of deflated balloons, twirling a candy cane between his fingers like it was a baton. “You sure about this?” Molly whispered, eyeing the security camera feed on her tablet. A blur of giggling kids and a piñata shaped like a dragon filled the screen. “Last time we hit a party, we got chased by a *goldfish*.” Bill grinned, his breath smelling faintly of caramel. “Relax, Mol. This one’s different. The parents are all too busy stuffing their faces at the buffet to notice a couple of ‘lost’ partygoers. And if they do? We’ll just play along. ‘Oh, hi! We’re here for the *free cake*!’” Molly crossed her arms. “And if someone asks why we’re wearing ski masks?” “Then we’ll say we’re *undercover detectives*.” He mimed adjusting invisible glasses. “‘We’re here to solve the mystery of the *missing cupcakes*.’” She groaned. “We’re gonna get caught.” “Nah,” Bill said, popping the candy cane into his mouth. “Worst case, we blend in. Hand out some lollipops, tell a few jokes—kids love that. By the time they figure out we’re not here for the juice boxes, we’ll be long gone with enough presents to open our own *Toys R Us*.” Molly sighed, already imagining the look on the parents’ faces when they realized their carefully wrapped gifts were now “borrowed.” “You’re a monster.” Bill winked. “And you love me.”
created_at
"2026-02-20T19:55:19.156Z"
2/20/2026, 7:55:19 PM
model
"mistralai/ministral-14b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
207
moderation_latency
(null)
generation_time
2778
tokens_prompt
77
tokens_completion
442
native_tokens_prompt
69
native_tokens_completion
395
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000928
router
(null)
provider_responses
0
endpoint_id
"0b53717b-9af5-4de3-80af-ebedd2adf981"
id
"d6c8b16610b44fe388b8bdcc23edf86c"
is_byok
false
latency
207
model_permaslug
"mistralai/ministral-14b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771617319-YXjXe7czOPvAyK8EqRtw"
upstream_id
"d6c8b16610b44fe388b8bdcc23edf86c"
total_cost
0.0000928
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
270
96.17%
Dialogue to Total Word Ratio
Ratio: 54.44%, Deviation: 4.44%
neededClean
false
wordsTotal
270
wordsDialogue
147
48.0866%