NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXPT53G1VHT7RAGSDEV3T12
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
16.5s
Cost:
$0.000726
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
1.8117%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rainbow-bright piñata swayed ominously in the breeze, its papier-mâché grin taunting Molly from across the backyard. She adjusted her oversized "Party Mom" name tag—*Hi! I’m Carol!*—and sidled up to Bill, who was currently pretending to be a face-painting artist with all the subtlety of a bull in a tutu. His attempt at a butterfly on a six-year-old’s cheek looked suspiciously like a Rorschach test gone wrong. “You’re going to traumatize that kid,” Molly muttered, nudging a plate of half-eaten cupcakes aside with her hip. “Also, the clown’s watching you. The *real* one.” Bill wiped his hands on his apron, leaving streaks of glittery blue across his already questionable mustache. “Relax. I’m building trust. Parents love an artist who *commits*.” “You’re committing a crime against art.” She nodded toward the gift table, a fortress of wrapping paper and ribbon. “Three minutes till the cake. That’s our window.” “You sure the motion sensor’s disabled?” “Unless someone’s hiding in the sprinkler system, yeah.” Bill cracked his knuckles. “Then let’s make these kids the most disappointed generation since the ones who got coal.” Molly sighed. “You’re *supposed* to be the heartless one. I’m the one with the tragic backstory.” “Fine.” He grabbed a handful of balloons. “I’ll cry on cue. *‘Woe is me, my goldfish died…’*” “Your goldfish *was* a cracker.”
created_at
"2026-02-20T14:21:12.188Z"
2/20/2026, 2:21:12 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2051
moderation_latency
(null)
generation_time
16465
tokens_prompt
77
tokens_completion
391
native_tokens_prompt
69
native_tokens_completion
349
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0007256
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"4c3ce62ee4c9471ab209f6df238b5d44"
is_byok
false
latency
2051
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597272-bzKPfjWrBRxFpAsHW3ZO"
upstream_id
"4c3ce62ee4c9471ab209f6df238b5d44"
total_cost
0.0007256
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
224
0.00%
Dialogue to Total Word Ratio
Ratio: 44.20%, Deviation: 34.20%
neededClean
false
wordsTotal
224
wordsDialogue
99
1.8117%