NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07QX2EN6GXTGYF8KQY1EXX
Run Details
Model:
Mistral Small 4
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
8.5s
Cost:
$0.000510
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
65.59%
Dialogue to Total Word Ratio
(details)
32.7939%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain tapped against the window of the stolen minivan like tiny, impatient fingers. Molly shifted in the passenger seat, her neon-pink ski mask crinkling with every move. Bill drummed his gloved fingers on the steering wheel, the other hand tapping away at a GPS screen that read *10:47 PM—Right on Schedule*. "You sure this place is a goldmine?" Molly asked, peeling back her mask just enough to take a sip from the flask she kept in her coat. "I mean, it's a kids' party. Not exactly Fort Knox." Bill exhaled through his nose, the sound equal parts amusement and exasperation. "Molls, Molls, Molls. You’re thinking too small. These aren’t just kids. These are *rich* kids. You ever seen a toddler’s birthday party in the Hamptons? It’s not streamers and juice boxes. It’s caviar cucumber sandwiches and *actual* gold-wrapped presents." Molly squinted out the window. "So we’re hitting a party where the kids get better snacks than we do?" "Exactly. And while those little rich brats are eating those little caviar things, we’re walking out with, what, twenty grand in toys?" Bill leaned back, grinning. "I did recon. Security’s some old guy who fell asleep by the punch bowl two years ago. Kid’s magician’s got a crush on the nanny. The caterer’s sneaking champagne to the valets. This place is softer than a marshmallow." Molly pulled the flask away and wiped her mouth. "And the presents?" "Scattered like breadcrumbs. All in one room. Unsupervised. They trust the parents *and* the kids to keep their grubby mitts off until they’re ‘ready.’" He air-quoted the last word. "Spoiler: They’re never ready. That’s our window." Molly nodded, then frowned. "What if there’s, like, a *My Little Pony* collection in there?" Bill waved a hand. "We’re not looting sentimental value. We’re hitting the *big* items. That Lego set that costs more than my first car. The stuffed unicorn with the diamond-studded horn. The *actual* diamond-studded horn, Molly." She groaned. "That’s so specific." "Priorities. But the key is we’re in and out before the cops even think about ‘pedestrian safety.’" He tapped the GPS again. "Party ends at eleven. We hit at eleven-oh-five. Tops." Molly finally pulled her mask down, her dark eyes sharp. "What if a kid sees us?" Bill barked a laugh. "A kid? That’s perfect. Kid yells ‘STRANGER DANGER!’—we’re already gone. Fastest getaway route’s through the backyard, down the alley, and into the sewer system via the broken manhole cover I *definitely* checked earlier." Molly blinked. "Wait, you *checked* the sewer?" "Scout’s honor." He made a vague gesture. "Besides, worst-case scenario, we fake a medical emergency. Nothing cuts through suspicion like a guy clutching his chest and a woman screaming ‘HELP, HE’S DYING!’ Turns out rich parents would rather surrender their own keys than risk a liability lawsuit." Molly sighed, pulling her mask back up. "You’re a monster." "And you’re the one who suggested robbing a *children’s party* first." Bill grinned, then turned the key in the ignition. "Ready?" Molly adjusted her gloves, the latex snapping against her fingers. "Let’s go steal Christmas. Or whatever this is." "Holiday of capitalism achieves maximum irony," Bill said, shifting the van into drive. "Lights out, presents out, *rollicking good time*."
created_at
"2026-03-18T10:26:56.732Z"
3/18/2026, 10:26:56 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
401
moderation_latency
(null)
generation_time
8404
tokens_prompt
77
tokens_completion
883
native_tokens_prompt
81
native_tokens_completion
830
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00051015
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"03eb2b2c9e284d6787c94e1a00972dac"
is_byok
false
latency
401
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829616-79LwP4mj49xqZH67Sc0U"
upstream_id
"03eb2b2c9e284d6787c94e1a00972dac"
total_cost
0.00051015
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
536
65.59%
Dialogue to Total Word Ratio
Ratio: 67.10%, Deviation: 2.90%
neededClean
false
wordsTotal
544
wordsDialogue
365
32.7939%