NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ016MMYWC8A3PRDDSMZNWWS
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
22.6s
Cost:
$0.003254
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed the binoculars against the rain-streaked window of their rented bouncy castle van. “Target’s loaded. Must be thirty presents in there, minimum. A goldmine of stuffed animals and Lego sets.” Bill, adjusting his bright yellow hazard-vest, peered over her shoulder. “A Lego set? For the right collector, that’s six months rent. And stuffed animals? Please. The down payment on a submarine.” “It’s a metaphor, Bill.” “Sure, sure.” He rubbed his hands together. “So, the plan?” “The plan is elegant,” Molly said, not looking away from the suburban house where a ponytailed child was beating a piñata shaped like a star. “We roll up as the entertainment.” Bill blinked. “The entertainment? We don’t have a clown costume between us.” “Who needs a clown?” Molly scoffed. “We have an industrial-grade bouncy castle and a certified party planner.” She tapped her own chest. “You’ve never planned a party in your life.” “I’ve attended several. I observed the flow. The cake arrives, chaos peaks, the parents drink the wine out of the box. We set up the castle, the children are magnetically drawn to it, and while every single small human is airborne, we make the swap.” “Swap?” “Presents in, decoy presents out. We leave with the haul. They’ll just think some kid mixed them up during the sugar crash. By the time they unwrap the decoys—filled with glitter, it’s a fabulous idea—they’ll blame the clown who never showed.” Bill chewed his lip. “I like it. It’s diabolical. It’s also stupidly simple. What’s the catch?” “The catch,” Molly said, finally turning, “is that you have to run the bouncy castle. The noisy, distracting, children-entertaining bouncy castle. Can you handle that?” “Handle it? Molly, I was born for this. I’m a crowd-pleaser. I’m a vibes-coordinator.” “You’re a liability in a good mood,” she muttered, pulling out a map of the backyard. “Okay. We park here. You inflate and start the sales pitch. I’ll slip in through the back gate during the initial euphoria. Timing is everything.” “What if the parents ask questions?” Bill asked, now nervously adjusting his vest again. “You say you’re from ‘Fun-Zone Rentals.’ You say the booking was made by… Mrs. Henderson.” “Is there a Mrs. Henderson?” “There is now. She paid in cash and is tragically allergic to phones. No contact. See? Simple.” Bill grinned. “I love a tragic backstory. So, once I’ve got them all hyped up and defying gravity?” “Signal me with a specific party song.” “Which one?” “The one that goes ‘boom, boom, boom, boom, I want you in my room.’ That’s the all-clear.” “Classic,” Bill said. “And if something goes wrong? What’s the abort signal?” Molly met his gaze, dead serious. “If you hear ‘It’s a Small World’ played on a toy xylophone, you run. You don’t look back. You don’t ask questions. You leave the castle and you run.” Bill swallowed. “Got it. So, our success hinges on me, an inflated plastic palace, and avoiding copyright infringement.” “And glitter,” Molly added, hefting a duffel bag. “Lots of glitter. Let’s move.” The van rumbled to life, pulling away from the curb just as the sound of children’s laughter and the weak thwack of a baseball bat on papier-mâché filled the air. The heist was on.
created_at
"2026-04-24T15:19:45.064Z"
4/24/2026, 3:19:45 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
751
moderation_latency
(null)
generation_time
22543
tokens_prompt
77
tokens_completion
1241
native_tokens_prompt
315
native_tokens_completion
1031
native_tokens_completion_images
(null)
native_tokens_reasoning
248
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0032544
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"0d99024b3d5840edb8947fe504d334c8"
is_byok
false
latency
661
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777043985-iixsfNrlTz8XBOrjjmzz"
session_id
(null)
api_type
"completions"
id
"gen-1777043985-GUoJuT5UVjJ3C2TH2YLX"
upstream_id
"0d99024b3d5840edb8947fe504d334c8"
total_cost
0.0032544
cache_discount
0.0001536
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
541
0.00%
Dialogue to Total Word Ratio
Ratio: 74.68%, Deviation: 24.68%
neededClean
false
wordsTotal
549
wordsDialogue
410
0.0000%