NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPJ51AJ9EWXV3HB4WX7C21
Run Details
Model:
Mistral Large 3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
14.1s
Cost:
$0.001083
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
10.1000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of the Tipsy Turtle pub smelled like stale beer and regret, which was fitting, since that’s exactly what Molly and Bill were serving up tonight. Molly leaned against the sticky table, spinning a butter knife between her fingers like it was a lockpick. Bill, meanwhile, was hunched over a crumpled napkin, scribbling what he swore was a "foolproof plan" but looked more like a toddler’s finger-painting. "You’re telling me," Molly said, eyeing the napkin, "that our grand strategy involves dressing up as clowns?" Bill didn’t look up. "Not just clowns. *Beloved* clowns. The kind kids trust. The kind parents hire to make their spawn forget they’re being raised by screens and sugar." Molly sighed. "I hate clowns. They’re just adults in face paint who’ve given up on dignity." "Exactly! It’s perfect. No one suspects the guy who’s already lost his soul." Bill finally met her gaze, his eyes alight with the kind of misplaced confidence that usually preceded disaster. "We roll in, we distract, we grab the loot. Easy." Molly flicked the butter knife. It clattered against a half-empty pint glass. "And how, pray tell, do we distract a room full of children? Juggle? Balloon animals? Because I draw the line at making a giraffe out of latex." Bill grinned. "Better. We tell them there’s a *piñata* in the back." Molly blinked. "That’s… actually not terrible." "See? I *do* have good ideas." Bill tapped the napkin. "We say it’s a *magic* piñata. One that only appears if they all close their eyes and count to ten. Then—bam—we’re out the back door with the presents before they even finish screaming ‘ten!’" Molly considered this. "And what if some kid *doesn’t* close their eyes? What if little Timmy McTrustissues over there peeks?" Bill waved a hand. "Then we improvise. I’ll trip over my own giant shoes, you’ll honk the horn on your nose, and we’ll make a run for it. Classic clown chaos." Molly groaned. "I can’t believe I’m agreeing to this." "Because you love me," Bill said, batting his eyelashes. "Because I love *money*," Molly corrected. "And because if we pull this off, we can finally afford to rob a place that doesn’t smell like juice boxes and despair." Bill clapped his hands. "Then it’s settled! Tomorrow, we become the most terrifying thing a child can imagine: *fun*." Molly picked up the butter knife again. "I swear to God, Bill, if I end up in jail because of a five-year-old’s birthday party, I’m taking you down with me." Bill grinned. "Wouldn’t have it any other way, partner." He raised his pint. "To the heist of the century." Molly clinked her glass against his. "To not getting caught by a mob of sugar-crazed kindergarteners." Outside, the pub’s neon sign buzzed and flickered, as if even it knew this was a terrible idea.
created_at
"2026-02-20T14:16:49.975Z"
2/20/2026, 2:16:49 PM
model
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
381
moderation_latency
(null)
generation_time
14081
tokens_prompt
77
tokens_completion
748
native_tokens_prompt
69
native_tokens_completion
699
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001083
router
(null)
provider_responses
0
endpoint_id
"fa718841-07ea-4b7e-b8a3-6069545c7e6a"
id
"8fc764fd203249d086a37e850646e28e"
is_byok
false
latency
381
model_permaslug
"mistralai/mistral-large-2512"
1/1/2512, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597009-nIZTXELMzwtUJrhmGhH3"
upstream_id
"8fc764fd203249d086a37e850646e28e"
total_cost
0.001083
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
480
0.01%
Dialogue to Total Word Ratio
Ratio: 63.54%, Deviation: 13.54%
neededClean
false
wordsTotal
480
wordsDialogue
305
10.1000%