NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQE4J9R2KA4SD9F19KVTH1
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
7.9s
Cost:
$0.001278
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.86%
Dialogue to Total Word Ratio
(details)
0.4312%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon glow of the *Dazzle & Delight Party Emporium* sign flickered like a dying firefly as Molly adjusted her oversized clown wig in the rearview mirror. Bill sat beside her, methodically peeling the wrapper off a stolen lollipop he’d swiped from the last job—a baptism, if memory served. “You realize we’re about to rob a bunch of six-year-olds, right?” Bill said, sucking the lollipop thoughtfully. “Morally, that’s like kicking a puppy. But with more glitter.” Molly shot him a look. “First of all, we’re not *robbing* them. We’re *redistributing* unearned wealth. Second, have you *seen* the price of a decent Lego set these days? Those kids are sitting on a gold mine of plastic bricks and sugar-induced chaos.” Bill squinted at the party through the windshield. A horde of tiny humans in paper crowns screamed as a man in a dragon costume belched confetti from a cannon. “Fair point. But what’s the play? We can’t just waltz in there with a sack and a ‘thanks, kids.’” Molly pulled a folded piece of paper from her pocket—an invitation she’d “borrowed” from a mailbox. “*‘Bring your favorite stuffed animal for the Pin the Tail on the Unicorn game!’*” she read. “We go in as Mr. Snuggles and Lady Whiskers, ‘accidentally’ knock over the present table, and *poof*—Santa’s little helpers just became Santa’s little *thieves*.” Bill stared. “You want us to dress as stuffed animals.” “Tactical stuffed animals.” “Molly, I have *standards*.” “You had standards before you ate a lollipop you found in a church donation basket.” Bill wiped his mouth. “That was a *dark* time.” A woman in a bedazzled “Party Mom” sash herded the children toward a bounce house. Molly adjusted her fake nose. “Clock’s ticking. You take the high road—distract the parents with your ‘charming’ small talk. I’ll hit the presents, stash ‘em in the diaper bag—” “We’re using a *diaper bag*?” “It’s got *pockets*, Bill.” He sighed. “Fine. But if I have to pretend to be a teddy bear, I’m drawing the line at hugs.” Molly grinned. “Too late. You *are* the hugs.” Bill groaned as she tossed him a pair of googly-eyed bear ears. “I hate my life.” “No, you don’t.” Molly fluffed her wig. “You love it. Almost as much as you love free cake.” Bill perked up. “There’s cake?” Molly opened the car door. “Let’s go earn it.”
created_at
"2026-02-20T14:32:06.997Z"
2/20/2026, 2:32:06 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
311
moderation_latency
(null)
generation_time
7743
tokens_prompt
77
tokens_completion
664
native_tokens_prompt
69
native_tokens_completion
625
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0012776
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"ff6c0b41e5da4de08a9d0383adcffaa4"
is_byok
false
latency
311
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597927-xQAy4RwdiFBiwKOpZtwx"
upstream_id
"ff6c0b41e5da4de08a9d0383adcffaa4"
total_cost
0.0012776
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
400
0.86%
Dialogue to Total Word Ratio
Ratio: 59.75%, Deviation: 9.75%
neededClean
false
wordsTotal
400
wordsDialogue
239
0.4312%