NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07GKVH0EE33D1KSQPGHJ1E
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
6.3s
Cost:
$0.000393
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the fake mustache in the mirror, flicking an imaginary speck of lint from Bill’s borrowed blazer. The costume shop had smelled like old burritos, but the ensemble was convincing—if you squinted. Bill tugged at his collar, which felt like it was strangling a family of moths. "You look ridiculous," Molly said, flipping her wig to hide a smirk. "Better ridiculous than jail," Bill grunted, straightening a tie that screamed "desperate office temp." He glanced at the party flyer taped to the glove compartment. "Sunshine Bounce Festival. Ten kids. Thirty adults. Cake towers. Probably one of those tiny cupcakes I can’t resist." Molly smacked his wrist with the back of her hand. "Focus. Presents first. Birthday money is the easiest score. No alarms, no vaults, just a room full of loot in shiny paper." Bill scoffed, pulling out a folded map of the rec center. "You realize we’re dressed like pervy clowns disguised as dad at a soccer game, right?" "Performers. We’re performers. I’m Mrs. Whiskerstein the Magical Unicorn Lady. You’re Big Bill the Juggling Thunderbolt." She gestured grandly, nearly taking out a stack of parking permits. "Besides, who’s gonna question a group of adults playing dress-up at a children’s party?" Bill rubbed his temples. "The kids. The kids will question it. You ever see a five-year-old call out a scam?" Molly waved a dismissive hand. "Kids see what we want them to see. We hand out a few cheap rings, a couple of balloon animals, and they’re too busy licking frosting off their fingers to notice we’re emptying their goodie bags." Bill exhaled sharply through his nose. "You said this was clean. No people harmed. No witnesses." "Witnesses? They’ll throw us a thank-you cake and ask for an encore. Now come on, the parents are pooling their cars in the drop-off zone. All you gotta do is stroll in, look like you’ve adopted a child you’ve never met before, and slip into the gift room." Bill glared at the rec center sign glowing in the rearview like a warning beacon. "I hate this plan." Molly clapped him on the shoulder. "You hate every plan until you’re two minutes from the score and sweating less than a bacon-wrapped stripper at a health seminar." Bill checked the duffel bag at his feet. Inside, nestled beside zip ties and lock picks, was a half-eaten protein bar and a crumpled napkin that said "Win" in Sharpie. He sighed. "One hour. Tops. Then we’re gone before the piñata makes an appearance." Molly grinned, adjusting her horned headband. "One hour? I don’t know, Bill. Maybe we stay for the cake. Maybe we stay for the bounce house. Maybe we become the life of the party."
created_at
"2026-03-18T10:22:57.917Z"
3/18/2026, 10:22:57 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
430
moderation_latency
(null)
generation_time
6184
tokens_prompt
77
tokens_completion
696
native_tokens_prompt
81
native_tokens_completion
649
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00039291
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"6eee433f7e914f3bb7531e9de486d2cb"
is_byok
false
latency
430
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829377-8XVMQcRN47jnPZWP5IdY"
upstream_id
"6eee433f7e914f3bb7531e9de486d2cb"
total_cost
0.00039291
cache_discount
0.00000864
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
452
0.00%
Dialogue to Total Word Ratio
Ratio: 60.35%, Deviation: 30.35%
neededClean
false
wordsTotal
459
wordsDialogue
277
0.0000%