NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ08TKNC59CXCD0JVS3ANS1J
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
6.4s
Cost:
$0.000890
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of cinnamon and yeast hung thick in the morning air, a comforting blanket over the clink of ceramic. Steam curled from two chipped mugs on the scarred wooden counter. “You’re using the good vanilla again.” It wasn’t a question. “The wedding cake samples need to taste like something special, don’t they? Unlike some of the… substitutions lately.” “Substitutions happen. Sugar was delivered late. The butter shipment looked a bit pale.” “Pale butter doesn’t keep me awake at night. Missing vanilla extract does. And your flour supplier’s been cutting corners for weeks. I can taste the difference.” A soft sigh blended with the hiss from the espresso machine. “What do you want me to say? Business is tight. Everyone’s looking to save a penny.” “Save a penny, maybe. But Mrs. Higgins’ blue ribbon apple pie won’t taste right with off-brand apples. It could cost you more than a penny. It could cost you the county fair prize.” A long pause stretched between them, filled only by the hum of the refrigerator. A flour-dusted finger tapped twice on the counter. “You notice everything, don’t you?” “Someone has to, around here. Pass the powdered sugar.”
created_at
"2026-04-24T17:32:59.445Z"
4/24/2026, 5:32:59 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
758
moderation_latency
(null)
generation_time
6248
tokens_prompt
81
tokens_completion
551
native_tokens_prompt
311
native_tokens_completion
424
native_tokens_completion_images
(null)
native_tokens_reasoning
169
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00089048
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"67a74a418b0147568b1335b6b0fa0cd0"
is_byok
false
latency
661
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777051979-joLkTpe9E4q17iFXFmQp"
session_id
(null)
api_type
"completions"
id
"gen-1777051979-I3QVxVixz2k7U5buCz8A"
upstream_id
"67a74a418b0147568b1335b6b0fa0cd0"
total_cost
0.00089048
cache_discount
0.00008192
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)