NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ06GH3X7YN1DY571NCK6MAN
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
11.2s
Cost:
$0.001622
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
98.15%
Dialogue to Total Word Ratio
(details)
75.0358%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Molly, this is a terrible idea." Bill whispered from behind the enormous, rainbow-frosted cake. "Bill, *all* our ideas are terrible. That's our brand. Now, focus." Molly adjusted her oversized novelty glasses. "The presents are in the living room. The clown goes on his break at 3:15. That's our window." "But it's a *children's* party. There are games! Pinatas! Balloon animals!" Bill gestured wildly. "What if someone makes us play musical chairs? I can't handle that pressure." "You will not play musical chairs. You are a shadow. A silent, gift-swiping shadow." Molly pointed. "Your job is to create a diversion when I give the signal." "What's the signal?" "I will pretend to faint near the bouncy castle. You will, with great concern, announce that the ice cream is melting." "Molly, that's diabolical. But what about the loot? I don't want to steal a kid's toy horse." "We're not *monsters*, Bill. We're redistributors of joy. They have too much joy here. We're taking it to a less joyous location." She peered around the cake. "Okay, the clown is leaving. Get ready to faint. I mean, get ready to *notice* me fainting." Bill took a deep breath. "I still think we should have just robbed the bank." "The bank has security. This place has a face-painting station. Now go."
created_at
"2026-04-24T16:52:32.008Z"
4/24/2026, 4:52:32 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1095
moderation_latency
(null)
generation_time
11108
tokens_prompt
77
tokens_completion
541
native_tokens_prompt
315
native_tokens_completion
487
native_tokens_completion_images
(null)
native_tokens_reasoning
168
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0016224
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"cc4cf8bef5e34d37a2a1f53153edd31b"
is_byok
false
latency
1095
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777049552-Bsfzpof8wpZo3ATsf3ps"
session_id
(null)
api_type
"completions"
id
"gen-1777049552-b7VHHxglzEzw7N2m5tsn"
upstream_id
"cc4cf8bef5e34d37a2a1f53153edd31b"
total_cost
0.0016224
cache_discount
0.0001536
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
216
98.15%
Dialogue to Total Word Ratio
Ratio: 86.30%, Deviation: 3.70%
neededClean
false
wordsTotal
219
wordsDialogue
189
75.0358%