NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-08-01KGYXEE54XEK7PKGVFK0QPPAF
Run Details
Model:
Mistral Small Creative
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
2.2s
Cost:
$0.000129
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0182%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The backseat of Bill’s stolen ice cream truck smelled like expired sprinkles and regret. Molly adjusted her glittery party hat—stolen from a yard sale—and squinted at the blueprint of *Chuckie Cheerful’s Mega-Birthday Extravaganza*. “You’re sure this place has *good* loot?” Bill asked, wiping chocolate syrup off his sleeve. “Last time we hit a kid’s party, all we got was a My Little Pony and a half-eaten cupcake.” Molly tapped the blueprint. “This isn’t some back-alley piñata smash-and-grab. This is *high-end* toddler wealth. We’re talking iPads, unopened LEGO sets, and—get this—a *limited-edition* Baby Yoda action figure.” Bill’s eyes widened. “The one with the *glow-in-the-dark* lightsaber?” “Bingo. And the security? A single rent-a-cop named *Gary* who’s probably napping in the balloon animal station.” Bill scratched his chin. “What’s the play? We waltz in, grab the goodies, and bolt?” Molly grinned. “Better. We *blend in*. I’ve got a fake mustache, a fanny pack full of juice boxes, and a *very* convincing ‘Auntie Molly’ backstory. You’re my *emotionally unstable* uncle who ‘just loves kids.’” Bill groaned. “I *hate* kids.” “Exactly. You’ll be *too* convincing.” She tossed him a clown nose. “Put this on. And try not to scare any of them into therapy.” Bill sighed, shoving the nose onto his face. “If I get finger-painted on, I’m billing you.” Molly revved the engine. “Deal. Now let’s go rob some *innocence*.”
created_at
"2026-02-08T15:20:43.792188+00:00"
2/8/2026, 3:20:43 PM
model
"mistralai/mistral-small-creative-20251216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
217
moderation_latency
(null)
generation_time
2074
tokens_prompt
61
tokens_completion
381
native_tokens_prompt
69
native_tokens_completion
407
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000129
router
(null)
provider_responses
0
id
"2e81352a26b9431fb2bdeb65396598f1"
status
200
is_byok
false
latency
217
endpoint_id
"0118a65e-79f9-48b0-924e-38c4a41378e1"
provider_name
"Mistral"
model_permaslug
"mistralai/mistral-small-creative-20251216"
api_type
"completions"
id
"gen-1770564040-gNpAISWbdGEunjBDv7FQ"
upstream_id
"2e81352a26b9431fb2bdeb65396598f1"
total_cost
0.000129
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
230
0.01%
Dialogue to Total Word Ratio
Ratio: 67.65%, Deviation: 17.65%
neededClean
false
wordsTotal
238
wordsDialogue
161
0.0182%