NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPQ9JFHD4ED62ZQAXZNEZP
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
10.7s
Cost:
$0.000795
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her glittery unicorn horn headband and squinted at the balloon arch sagging over the backyard gate. "You’re sure this is the right place? Because I’m pretty sure we just walked into a toddler’s fever dream." Bill scratched his chin, where a temporary tattoo of a dinosaur was peeling off. "Positive. Intel says the birthday kid’s dad is some tech bro who ‘disrupts’ birthday parties by giving out gold-plated action figures. We’re talking *heist of the century* here, Mol." "Or the most humiliating arrest record in history. ‘Suspect apprehended while attempting to flee on a tricycle.’" She plucked a cupcake from a nearby table and took a bite. "Also, why are we dressed like we’re in a *My Little Pony* spin-off?" "Camouflage. No one suspects the pony." Bill adjusted his rainbow wig, which had slipped over one eye. "Besides, the security’s just a guy in a clown suit. How hard can it be?" Molly pointed as the clown turned, revealing a walkie-talkie clipped to his polka-dot suspenders. "Oh, you *jinxed* us." Bill groaned. "Okay, new plan. You distract him with your *sparkle magic*, and I’ll—" "Absolutely not. Last time I used my ‘sparkle magic,’ I got glitter in my *eyebrows* for a week." "Fine. I’ll distract him. You grab the loot." Bill cleared his throat and waved at the clown. "Hey, uh… *Bozo*! Nice nose! You ever think about branching out into, like, *abstract expressionism*?" The clown’s smile didn’t waver, but his hand drifted toward his walkie-talkie. "Sir, this is a private event." Molly muttered, "We’re so going to jail." She grabbed a gift bag shaped like a rocket ship and shoved it into Bill’s arms. "Run." Bill clutched the bag like a football. "Which way?" "Any way that isn’t *toward the clown with a radio*!" Molly hissed, already sprinting toward the fence. Behind them, the clown’s voice crackled over the walkie-talkie: "Security to all units, we’ve got a code *Rainbow Dash* in progress."
created_at
"2026-02-20T14:19:38.46Z"
2/20/2026, 2:19:38 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
402
moderation_latency
(null)
generation_time
10602
tokens_prompt
77
tokens_completion
523
native_tokens_prompt
69
native_tokens_completion
507
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000795
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"c4f6ec57c019404195eaa2de82c9b280"
is_byok
false
latency
402
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597178-TDksM7bRFBFDxgU0oM1u"
upstream_id
"c4f6ec57c019404195eaa2de82c9b280"
total_cost
0.000795
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
331
0.00%
Dialogue to Total Word Ratio
Ratio: 58.61%, Deviation: 31.39%
neededClean
false
wordsTotal
331
wordsDialogue
194
0.0000%