NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ066089MM2VBKMFPGX6DPMX
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
26.0s
Cost:
$0.003554
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.15%
Dialogue to Total Word Ratio
(details)
0.0773%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the binoculars, squinting through the drizzle at the sprawling suburban house ablaze with light. “Alright, Bill. Deep breaths. We are not panicking.” Bill was not deep breathing. He was hyperventilating into a paper bag he’d brought for the job. “I am panicking, Molly! Look at it! There are at least thirty kids in there! And a clown! Clowns are unpredictable!” “Every occupation has its hazards. The clown is a variable, not an obstacle. Focus on the objective: the present pile in the bay window. That’s our target.” Molly lowered the binoculars, her expression clinical. “I count roughly forty-two gifts. Mostly rectangular, suggesting books and board games. Standard haul for a children’s birthday.” “But it’s for a *child*, Molly. A child named… what did the invite say?” Bill frantically checked his pocket. “‘Liliana.’ It’s for Liliana. That’s just… that’s just mean.” “It’s business. Liliana has an overabundance of material wealth. We are redistributing. Think of it as a very late, very confusing tax.” She began unfolding a laminated party invitation she’d “borrowed” from a mail slot. “Phase one: infiltration. You’re the distraction.” Bill’s eyes widened. “Why am I always the distraction?” “You have a natural, harmless energy. People don’t suspect you. Go to the door, tell them you’re from the clown agency, that Bobo had a… a pie-related incident. Create a three-minute window of chaos. I’ll slip in the back door, which I’ve already propped open with a wedge of cheese from their picnic table.” “What if someone asks what kind of pie?” “Any pie, Bill! Cream pie! Then act flustered and leave. It’s foolproof.” She handed him a red, squeaky nose. “For authenticity.” Ten minutes later, Molly heard a commotion from the front. Bill’s voice carried, a masterpiece of faux-panic. “The pie! It was a catastrophic canary-in-the-coal-mine situation with the cream filling! He’s okay, but he’s… emotionally unavailable for the remainder of the evening!” Molly slipped through the back door into the cavernous kitchen. The noise from the party was a solid wall of sound—screaming, a chaotic rendition of “Happy Birthday.” She moved like a shadow towards the living room. The present pile was indeed magnificent, a glittering mountain of wrapping paper. She had her arms around three large boxes when a small voice said, “Are you taking my presents?” Molly froze. A tiny girl in a tiara stood in the doorway, holding a half-eaten cupcake. Liliana. Molly’s mind raced. This was the variable they hadn’t accounted for: the target herself. She dropped into a crouch. “Shh! No! I’m… I’m the present inspector. Your parents hired me to check for… uh… glitter bombs. From a rival party planner. It’s a nasty business.” Liliana took another bite of cupcake. “They don’t look like bombs.” “That’s what makes them so dangerous.” Molly tapped a box conspiratorially. “This one feels suspiciously heavy.” “My grandpa gave me that. It’s a big box of Lego.” “Ah. Classic misdirection.” Molly stood, shifting the boxes. “Well, all clear here. You’re safe. For now.” From the front, Bill’s voice rose again, now in full theatrical retreat. “The pie prognosis is grim! I must go! Tell Liliana… Bobo sends his regrets and a coupon for ten percent off next birthday!” Molly made her escape, heart pounding. Bill was waiting by the car, panting. “Did you get them?” “Most of them.” She dumped the boxes in the trunk. “The big one’s Lego. It’s a premium item.” “Did you see the clown?” “I spoke with the birthday girl.” Bill went pale. “Was she suspicious?” Molly looked back at the house, at the glow from the windows. “She was eating a cupcake. She’ll be fine.” She started the engine. “Now, about the clown agency cover story. We need a name.” Bill perked up. “I was thinking ‘Bobo’s Big Day Out’?” “Absolutely not.” She pulled away from the curb. “We’re calling it ‘The Pie Incident.’ And for the record, next time, we’re stealing from adults. They don’t have tiaras.”
created_at
"2026-04-24T16:46:47.062Z"
4/24/2026, 4:46:47 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1915
moderation_latency
(null)
generation_time
25924
tokens_prompt
77
tokens_completion
1343
native_tokens_prompt
315
native_tokens_completion
1131
native_tokens_completion_images
(null)
native_tokens_reasoning
168
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0035544
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"15eaee1f5e1845bd99ace74050504205"
is_byok
false
latency
1796
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777049207-z4eUolDxp4uR5csHhjzJ"
session_id
(null)
api_type
"completions"
id
"gen-1777049207-EUzjLrGdHj2PNfqw01Hj"
upstream_id
"15eaee1f5e1845bd99ace74050504205"
total_cost
0.0035544
cache_discount
0.0001536
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
659
0.15%
Dialogue to Total Word Ratio
Ratio: 61.38%, Deviation: 11.38%
neededClean
false
wordsTotal
668
wordsDialogue
410
0.0773%