NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYBAK0A9PCGV8458MBP5GDT
Run Details
Model:
Ministral 3 8B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
2.0s
Cost:
$0.000056
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning sun spilled through the bakery’s windows, casting golden dust over the counter where a fresh batch of cinnamon rolls cooled. The scent of vanilla and warm butter filled the air as the bell above the door chimed. "Three almond croissants, please,"
a voice said
, smooth and practiced. "And a coffee—black, no sugar." The baker, arms laden with flour dust, wiped his hands on his apron. "You’re early. The usual?" "Not today. Something’s off." The voice paused, then added, "Did you hear about the old Miller place? Empty for weeks now, and last night, someone swore they saw a shadow moving behind the windows." The baker exhaled through his nose, stirring a pot of chai on the stove. "Rumors, that’s all. The town’s been jumpy since that letter showed up at the post office—signed with red ink, no return address." A pause. Then, quieter: "I found one in my mailbox yesterday. Just a single word: *Watch.*" The baker’s fingers stilled. "You didn’t tell the sheriff?" "No. Not yet." The voice lowered. "What if it’s not just a prank? What if someone’s trying to scare us out?" The baker’s gaze flicked to the bakery’s front window, where the glass fogged with condensation. "Then we’d better start watching *each other.*"
created_at
"2026-02-20T20:19:42.231Z"
2/20/2026, 8:19:42 PM
model
"mistralai/ministral-8b-2512"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
242
moderation_latency
(null)
generation_time
1771
tokens_prompt
81
tokens_completion
333
native_tokens_prompt
68
native_tokens_completion
306
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000561
router
(null)
provider_responses
0
endpoint_id
"c85d26e1-1e0c-449b-9775-2afce7ae510b"
id
"aabd704d9af94a2e92a1c71de885b7a1"
is_byok
false
latency
242
model_permaslug
"mistralai/ministral-8b-2512"
provider_name
"Mistral"
status
200
api_type
"completions"
id
"gen-1771618782-1Q0OCSkBZFEvGZq4qCQ8"
upstream_id
"aabd704d9af94a2e92a1c71de885b7a1"
total_cost
0.0000561
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"a voice said"