NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0368CNSQGM1HGWMTD3P4HS
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
12.1s
Cost:
$0.001855
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.59%
Dialogue to Total Word Ratio
(details)
0.2948%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her face against the fence, squinting at the brightly coloured backyard. "Okay, the target is a 'Unicorn and Rainbows' themed fifth birthday party. Security is a bouncy castle and a man dressed as a banana." Bill jotted on a notepad. "Bouncy castle. So, a mobile, inflatable distraction. Got it. What's the haul?" "Presents, Bill. Mountains of them. Mostly shaped like cubes and rectangles. High-value plastic, potential for LEGO, maybe a bike. We're talking top-tier loot for a five-year-old." "The pay's not great for kid's stuff," Bill mused. "The pay is the principle of the thing," Molly corrected. "These parties have gone soft. Too much structure. We're providing a... chaotic critique. Now, entry point?" "The banana is our in," Bill said firmly. "I'll engage. I'll compliment the banana. I'll ask about the peel situation. I'll make him question his life choices." "While you're philosophizing with fruit, I'll slip around to the gift table. What about the parents? The 'Mummy juice' brigade near the cooler?" "Handled. I'll create a diversion. Something emotionally complex, like spilling a juice box on a critical play structure. They'll be busy with mediation and apologies." Molly nodded, pulling a small, rainbow-striped ladder from her bag. "Good. I've got the ladder. It's not subtle, but it's on theme." "We're thieves, Molly, not interior decorators. Though this does feel like decorating with crime." He peered closer. "Wait. What's that on the table? The big, shiny thing next to the cake?" Molly squinted. "It's the piñata. A glittery, cardboard donkey. Full of candy." Bill was silent for a long moment. "We're taking the piñata." "What? No! That's not the mission!" "Molly, think about it. We don't want candy. The kids want candy. We grab the piñata, they're all in tears. The parents have to scramble for replacements. Diversion complete. Simplicity itself." "That's evil," Molly said, horrified and impressed. "It's efficient. We take the presents *and* the symbol of their fleeting joy. It's poetic." He started sketching a new plan. "Okay. I distract the banana. You grab the piñata. On my signal, you shake it violently to simulate an attack." "And then?" "And then we run. They'll be too confused to chase a man in a banana suit and a woman wielding a glitter donkey." Molly slowly packed the ladder away. "You've thought about this." "I have a system," Bill said, grinning. "Now, about your costume. That black jumpsuit is a bit 'adult'. I've brought you something." He produced a floppy, sequined wizard hat. "For the element of surprise." She took it. "You're a terrible influence." "I'm a professional," he corrected, adjusting his own disguise—a slightly-too-large birthday crown. "Now, let's go steal some childhood."
created_at
"2026-04-24T15:54:29.663Z"
4/24/2026, 3:54:29 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1044
moderation_latency
(null)
generation_time
11966
tokens_prompt
77
tokens_completion
999
native_tokens_prompt
311
native_tokens_completion
896
native_tokens_completion_images
(null)
native_tokens_reasoning
233
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00185496
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"c4fc2cc401214fb18b7ce38327ad0692"
is_byok
false
latency
998
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777046069-OsGuZ6kupNnN3IJTN2jA"
session_id
(null)
api_type
"completions"
id
"gen-1777046069-SpqpjGgXquW0eC8NQpBN"
upstream_id
"c4fc2cc401214fb18b7ce38327ad0692"
total_cost
0.00185496
cache_discount
0.00006144
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
446
0.59%
Dialogue to Total Word Ratio
Ratio: 80.13%, Deviation: 10.13%
neededClean
false
wordsTotal
453
wordsDialogue
363
0.2948%