NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQ1EXR99R93ARFQ08KNT8H
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
10.2s
Cost:
$0.001424
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0012%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the *Dizzy Dinosaur Bounce House* flickered through the minivan’s windshield as Molly adjusted her oversized clown wig in the rearview mirror. Beside her, Bill fiddled with a walkie-talkie, his fake mustache already peeling at the edges. “Remind me again why we’re robbing a six-year-old’s birthday party?” Bill muttered, tugging at the itchy fabric of his rented elf costume. Molly shot him a look. “Because, genius, rich kids get *good* presents. We’re talking iPads, limited-edition sneakers, maybe even a Bitcoin wallet from some tech-bro dad trying to look cool.” Bill squinted at the chaos unfolding beyond the windshield—sugar-high children screaming, parents chugging wine from plastic cups, a harried magician dropping his rabbits. “Yeah, but have you *seen* these kids? They’re like feral raccoons. One wrong move and we’re getting tackled by a swarm of sticky-fingered maniacs.” Molly tossed him a bag of glitter bombs. “That’s what these are for. Distraction and misdirection. Classic heist tactics.” “Glitter bombs?” Bill held one up like it was a live grenade. “You want to *glitter-bomb* a bunch of kids?” “Relax, it’s non-toxic. Mostly.” She grinned. “Besides, have you ever tried to clean glitter out of carpet? Parents will be so busy vacuuming for the next decade, they won’t even notice the PlayStation’s missing.” Bill groaned. “I knew we should’ve stuck to banks.” “Banks have *security*. This place has a piñata and a guy in a mascot suit who looks like he’s one tequila shot away from quitting.” Molly checked her watch. “Alright, Phase One: you cause a diversion by ‘accidentally’ setting off the confetti cannon near the cake table. I’ll slip into the present pile, swap the good stuff with the decoy gifts we wrapped—” “Hold on.” Bill held up a finger. “We *wrapped* decoy presents?” Molly rolled her eyes. “Yes, Bill. Because nothing says ‘innocent birthday guest’ like showing up with a suspiciously unlabeled box. Ours say *‘To Timmy, Love Aunt Carol’*—generic, forgettable, and full of rocks.” Bill peered at the gift pile, where a towering box labeled *‘XBOX – DO NOT SHAKE’* glinted under the party lights. “What if the rocks tip someone off?” “Please. Half these parents are too drunk to notice if we replaced the presents with actual rocks.” Molly cracked her knuckles. “Now, you ready?” Bill took a deep breath, adjusted his elf hat, and muttered, “I went to college for this.” Molly clapped him on the back. “Cheer up. If we get caught, we’ll just say we’re the entertainment. ‘Surprise! It’s a *heist*-themed party!’” Bill opened the van door, then paused. “…That’s actually not a bad idea.” Molly smirked. “Told you. Now go make it rain.”
created_at
"2026-02-20T14:25:11.618Z"
2/20/2026, 2:25:11 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1020
moderation_latency
(null)
generation_time
10080
tokens_prompt
77
tokens_completion
758
native_tokens_prompt
69
native_tokens_completion
698
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
48
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0014236
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"0822713bf6514f73b86e2702fbf62517"
is_byok
false
latency
1019
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597511-lziubt6iSBiGO3gJCWIw"
upstream_id
"0822713bf6514f73b86e2702fbf62517"
total_cost
0.0014236
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
449
0.00%
Dialogue to Total Word Ratio
Ratio: 64.59%, Deviation: 14.59%
neededClean
false
wordsTotal
449
wordsDialogue
290
0.0012%