NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ02T64TRYTSHZH9MD8RQFZR
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
10.0s
Cost:
$0.001391
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The air in Rosie’s Sweet Spot always smelled of cinnamon and hope. Today, the scent of baking bread mingled with the damp wool and melting snow brought in by each customer. “The ginger snaps are gone again,”
she said
, not looking up from her dough. “Three times this week?” Her voice was low, incredulous. “No one in this town buys ginger snaps in that quantity.” “They don’t just buy them. They take them. The box is simply… empty when I check. Always just before the lunch rush.” A stool creaked. “Could it be Edna from the library? She’s had a sweet tooth since the holidays.” “No. She pays for her scones with exact change and a long story about her cat.” The baker finally looked up, her flour-dusted face worried. “Someone is helping themselves to the display case in the back. Nothing else is touched. Not the marzipan, not the financiers. Just those specific, crunchy little cookies.” The other woman leaned in, her voice dropping to a conspiratorial whisper. “Then we need to find out why. And more importantly… why they’re so afraid of getting caught paying for them.”
created_at
"2026-04-24T15:47:54.148Z"
4/24/2026, 3:47:54 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1474
moderation_latency
(null)
generation_time
9917
tokens_prompt
81
tokens_completion
538
native_tokens_prompt
315
native_tokens_completion
427
native_tokens_completion_images
(null)
native_tokens_reasoning
180
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0013912
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"2641cfe84e564210a799edf4f5d0f0c4"
is_byok
false
latency
1385
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777045674-UvCIViBo6aUscxxWzdaE"
session_id
(null)
api_type
"completions"
id
"gen-1777045674-Wks5ImGp6JzDujrYJw03"
upstream_id
"2641cfe84e564210a799edf4f5d0f0c4"
total_cost
0.0013912
cache_discount
0.0002048
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"she said"