NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQ6V15838CYNF6NRP8BEPP
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.6s
Cost:
$0.001156
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
25.9628%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of the Tipsy Turtle Pub smelled like stale beer and regret, which was fitting, since Molly and Bill were knee-deep in both. Molly swirled a lukewarm gin and tonic in her glass, the ice long since melted into sad little ghosts of their former selves. Across the table, Bill was hunched over a crumpled napkin, scribbling what he insisted was a "foolproof plan" but looked more like a toddler’s first attempt at hieroglyphics. "You’re telling me," Molly said, squinting at the napkin, "that the grand scheme involves us dressing up as clowns?" Bill didn’t look up. "Not just clowns. *Beloved* clowns. The kind kids trust. The kind parents hire without checking references." Molly took a long sip. "I once saw a clown at a birthday party when I was six. He tripped over his own shoes, knocked over the cake, and then cried. I still have nightmares." "Exactly!" Bill jabbed the napkin with his pen. "That’s the energy we’re going for. Authentic. Relatable. The kind of clown who makes parents think, *‘Well, at least he’s not a mime.’*" Molly sighed. "And how, pray tell, do we get hired for this party? Last I checked, clowns don’t just materialize out of thin air like some kind of sad, balloon-animal-wielding specters." Bill grinned. "We don’t get hired. We *become* the party." Molly blinked. "I’m going to need you to explain that one like I’m five." "Simple. We show up early, set up the decorations, inflate the bouncy castle, maybe even do a little face painting. By the time the parents realize we’re not the actual entertainment, we’ll already be halfway to the getaway van with a sack full of presents." Molly rubbed her temples. "So your plan is to commit grand larceny by pretending to be the world’s worst party planners?" "World’s *most* forgettable party planners," Bill corrected. "The kind no one remembers until they’re filing a police report." Molly took another sip. "And what happens when the real clowns show up?" Bill waved a hand. "We’ll be long gone by then. Besides, have you ever met a clown who wasn’t at least a little unhinged? They’ll probably just assume we’re part of the act." Molly stared at him. "You’re serious about this." "Dead serious. Also, I already bought the wigs." Molly groaned. "You bought the wigs *before* running this by me?" Bill shrugged. "I had a coupon." Molly pinched the bridge of her nose. "Fine. But if I get arrested, I’m telling the cops you forced me into this." Bill clapped his hands together. "That’s the spirit! Now, let’s talk about the van. I was thinking we go with something nondescript. Maybe a white panel van with no windows." Molly raised an eyebrow. "You mean the universal symbol for ‘I am definitely not up to something shady’?" Bill grinned. "Exactly. No one suspects the obvious." Molly sighed and downed the rest of her drink. "I need another gin." Bill slid his glass toward her. "Take mine. I’ve got a feeling we’re going to need all the liquid courage we can get."
created_at
"2026-02-20T14:28:07.857Z"
2/20/2026, 2:28:07 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
382
moderation_latency
(null)
generation_time
15405
tokens_prompt
77
tokens_completion
800
native_tokens_prompt
69
native_tokens_completion
748
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0011565
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"049ff3a5c7414ae89ba8741f112bf58d"
is_byok
false
latency
382
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597687-q4uTQjgAew56COL9gnck"
upstream_id
"049ff3a5c7414ae89ba8741f112bf58d"
total_cost
0.0011565
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
516
0.00%
Dialogue to Total Word Ratio
Ratio: 69.38%, Deviation: 39.38%
neededClean
false
wordsTotal
516
wordsDialogue
358
25.9628%