NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQF9BDQK6E7J7TQM7FF4TK
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
13.7s
Cost:
$0.001594
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.33%
Dialogue to Total Word Ratio
(details)
50.1665%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the *Dazzle & Delight Party Emporium* sign flickered like a dying firefly as Molly adjusted her oversized clown wig in the rearview mirror. Bill sat beside her, methodically peeling the wrapper off a stolen lollipop he’d swiped from the last job—a baptism, if anyone asked. “You sure this is the right move?” Bill sucked on the lollipop, the sticky sound making Molly’s eye twitch. “I mean, kids, Molly. *Kids.*” Molly shot him a look. “Kids are the easiest marks. They’re distracted by cake, they cry if you look at them wrong, and their parents are too busy Instagramming the ‘perfect moment’ to notice a couple of professionals doing their thing.” Bill turned the lollipop stick between his fingers. “Yeah, but last time we hit a kid’s place, that little demon with the glue stick nearly took my eyebrow off.” “That was a *craft fair*, Bill. Totally different energy.” Molly pulled a crumpled flyer from her pocket and smoothed it out. *“Little Timmy’s 6th Birthday Extravaganza—Pin the Tail on the Unicorn, Magical Pony Rides, and a Mountain of Presents!”* She grinned. “We’re talking high-end LEGO, Bill. Limited edition. The kind of stuff that resells for triple retail.” Bill squinted at the flyer. “Pony rides? You ever seen a pony up close? They’re judgmental. They *know* things.” Molly ignored him. “We go in as the entertainment. I’ll be *Sparkles the Clown*—” She gestured to her wig. “—and you’ll be *Bubbles the Balloon Artist*.” Bill held up a sad, deflated balloon animal he’d been fiddling with. “I can’t make balloon animals.” “You don’t have to. You just have to *look* like you can.” She tossed him a name tag that read *BUBBLES* in glittery letters. “Wear that and carry a pump. If anyone asks, you’re ‘between masterpieces.’” Bill sighed, sticking the tag to his shirt. “What’s the play once we’re inside?” Molly ticked points off on her fingers. “Phase one: I distract the parents with a ‘complimentary’ face-painting station. You scope the present table. Phase two: We swap the good stuff with the decoy gifts we brought—” She patted a duffel bag full of suspiciously light, rewrapped boxes. “—then we ‘accidentally’ knock over the cake. Chaos ensues. Phase three: We vanish into the night, richer and slightly more traumatized.” Bill chewed his lollipop thoughtfully. “What if the cake’s already been cut?” Molly blinked. “Then we knock over *Timmy*.” A beat. Bill nodded. “Fair.” Molly checked her watch. “We’ve got twenty minutes till showtime. Remember—no eye contact with the kids. They sense fear.” Bill adjusted his name tag. “What if one of them asks for a balloon sword?” “Tell them you’re *allergic to latex*.” “What if they ask why I’m sweating?” Molly grabbed his face and smeared his cheeks with greasepaint. “Now you’re *Bubbles the Overly Invested Balloon Artist*.” Bill stared at his reflection in the side mirror. “I look like a serial killer.” Molly clapped him on the back. “Perfect. Let’s go ruin a childhood.”
created_at
"2026-02-20T14:32:44.665Z"
2/20/2026, 2:32:44 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
313
moderation_latency
(null)
generation_time
13590
tokens_prompt
77
tokens_completion
844
native_tokens_prompt
69
native_tokens_completion
783
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0015936
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"706c98873316461b804a4f2f3a136de2"
is_byok
false
latency
313
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597964-lBl50GTM6wjYv6pciuYd"
upstream_id
"706c98873316461b804a4f2f3a136de2"
total_cost
0.0015936
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
501
0.33%
Dialogue to Total Word Ratio
Ratio: 60.68%, Deviation: 10.68%
neededClean
false
wordsTotal
501
wordsDialogue
304
50.1665%