NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHXQ9QWAVXX0EAZ8RPR9DF2D
Run Details
Model:
Mistral Medium 3.1
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
6.1s
Cost:
$0.000698
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
34.0511%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rainbow-bright piñata swayed ominously in the breeze, its papier-mâché grin taunting Molly from across the backyard. She adjusted her oversized "Party Mom" name tag—*Glenda*—and sidled up to Bill, who was currently failing to blend in while aggressively clutching a plate of deviled eggs. "Stop eating like a man who’s never seen food," Molly muttered, nudging his elbow as a cluster of sugar-high six-year-olds shrieked past. "We’re here for the *haul*, not the hors d’oeuvres." Bill wiped mayo from his chin with the back of his sleeve. "These eggs are *art*. Also, I’m method acting. Glenda’s husband, *Bret*, is a man of simple pleasures." Molly pinched the bridge of her nose. "Bret doesn’t exist. And neither will our freedom if you keep calling the birthday boy ‘boss.’" A tiny pirate barged into Bill’s shins, demanding to know where the treasure chest was. Bill, ever the professional, dropped to one knee. "Ahoy, matey! The *real* treasure is the friendships we—" Molly yanked him up by his clip-on tie. "Focus. The gift table’s by the bounce house. Three minutes. In, out, no kids traumatized." Bill sighed, brushing egg crumbs off his "World’s Okayest Dad" shirt. "Fine. But if we’re doing this, I’m taking the unicorn plushie. It’s *looking* at me."
created_at
"2026-02-20T14:29:42.932Z"
2/20/2026, 2:29:42 PM
model
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
243
moderation_latency
(null)
generation_time
6051
tokens_prompt
77
tokens_completion
342
native_tokens_prompt
69
native_tokens_completion
335
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0006976
router
(null)
provider_responses
0
endpoint_id
"13c5002b-8c5b-490c-8a5f-52ec0242804f"
id
"f6e43a1b9dfb4a79b05b384be9f8f236"
is_byok
false
latency
240
model_permaslug
"mistralai/mistral-medium-3.1"
3/1/2001, 12:00:00 AM
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771597782-oK7doNIjm8vMgpwYPQAA"
upstream_id
"f6e43a1b9dfb4a79b05b384be9f8f236"
total_cost
0.0006976
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
214
0.00%
Dialogue to Total Word Ratio
Ratio: 44.39%, Deviation: 34.39%
neededClean
false
wordsTotal
214
wordsDialogue
95
34.0511%