NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ08FZ48Y2QR9HPDHR032PMN
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
12.8s
Cost:
$0.001634
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
92.99%
Dialogue to Total Word Ratio
(details)
95.8509%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars. “There. The bouncy castle is our entry point. Parents are distracted by the clown.” Bill squinted. “A clown? Molly, I have a phobia.” “You’ll have a phobia of prison if we don’t focus. You take the gift bag, I’ll handle the diversion.” “What’s the diversion?” “You scream ‘escaped pygmy goat’ and run clockwise.” “Why clockwise?” “Coriolis effect. It’ll confuse them. Look, just grab the biggest presents. The ones with the shiny paper.” “What if a kid tries to stop me?” “You use the secret password: ‘Pirate birthday code seven.’ They’ll back off.” “What’s code seven?” “It’s not real, Bill! It’s a children’s party! Just sound authoritative. And for God’s sake, stay away from the craft table. Last time you got covered in glitter.” “I glittered for a week. The dog wouldn’t stop following me.” “Exactly. So. Bouncy castle, gifts, goat panic, clockwise. Simple.” “You’re forgetting the pool noodle barrier.” “The what?” “Around the cake. It’s a defense perimeter. I saw it on the blueprints.” Molly sighed. “Fine. You disable the noodle. Now, any questions?” “Just one. Can we be back by six? My mother-in-law is coming for pot roast.” Molly stared. “We are stealing from children, Bill. We’ll be back by five.”
created_at
"2026-04-24T17:27:10.743Z"
4/24/2026, 5:27:10 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1754
moderation_latency
(null)
generation_time
12706
tokens_prompt
77
tokens_completion
623
native_tokens_prompt
315
native_tokens_completion
508
native_tokens_completion_images
(null)
native_tokens_reasoning
193
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0016342
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"8f4f9e5f6e704299825e4f8210fa88c3"
is_byok
false
latency
1672
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777051630-vgOv4k6m0Tv6MrUGgPzt"
session_id
(null)
api_type
"completions"
id
"gen-1777051630-csKNWwf77RL2mGLtCLkr"
upstream_id
"8f4f9e5f6e704299825e4f8210fa88c3"
total_cost
0.0016342
cache_discount
0.0002048
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
206
92.99%
Dialogue to Total Word Ratio
Ratio: 95.19%, Deviation: 5.19%
neededClean
false
wordsTotal
208
wordsDialogue
198
95.8509%