NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0ACTPDQBSJZ11MTSN9T7FX
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
11.0s
Cost:
$0.001629
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"So here's the plan," Molly said, spreading a crayon-colored napkin across the table. "Saturday, 2 PM, the Henderson backyard. Forty kids, twenty-five parents, one bouncy castle." Bill squinted at the napkin. "Is this a children's menu from Applebee's?" "It's the only paper I had in my pocket. Focus." "Right. So we're hitting a six-year-old's birthday party." "Exactly. Think about it, Bill. The security is nonexistent. No guards, no cameras, no laser grids. Just a guy named Dave who grills burgers and gets way too competitive at Cornhole." Bill tapped his chin. "And the loot?" Molly grinned. "Sixteen wrapped presents under a plastic tree. I'm talking Barbies, LEGO sets, a remote-control helicopter. Do you know what a sealedLEGO Star Destroyer goes for online?" Bill's eyes widened. "Those go for like three hundred bucks." "Four hundred if it's the Collector's Edition." "Sweet mother of—" "The helicopter alone is worth a hundred and twenty. And that's before we even get to the gift cards. Every parent throws in a gift card, Bill. Five here, ten there. It adds up." "So what's the entry strategy?" Molly unrolled the napkin and pointed at a wobbly drawing. "Back fence, near the oak tree. The dog next door only barks on Tuesdays. Saturday, we're golden." "What about the kids?" "What about them? They're six. What are they gonna do, tackle us? They can't even tie their shoes." "Fair point. What about the piñata?" Molly paused. "What about it?" "That thing's full of candy. Are we taking the candy too?" "We're not animals, Bill. We leave the piñata." "You sure? Skittles go for—" "We leave. The piñata." Bill held up his hands. "Fine, fine. So when do we make the move?" "Right after cake. That's the golden window. Every adult is singing, all the kids are screaming, nobody's paying attention. We slip in through the side gate, fill two duffel bags, and we're out in ninety seconds." "Ninety seconds? That's tight." "You complaining?" "No, I'm impressed. You really thought this through." "I've been casing the Henderson house for three weeks. I know their garbage schedule, their Wi-Fi password, and their dog's name is Captain Biscuit." "That's disturbing." "That's professionalism." Bill nodded slowly. "Alright, I'm in. But I have one condition." "What?" "I want to eat a cupcake before we leave." Molly sighed. "You always do this. Every job, you want to eat something on the job." "The cupcakes at these parties are incredible, Molly. Little frosting roses on top. You can't buy that quality." "Fine. One cupcake. Then we move." "Deal." Bill extended his hand. "Partners?" Molly shook it. "Partners. Now let's go buy some duffel bags." "Should we use cash or—" "Definitely cash, Bill. Definitely cash."
created_at
"2026-04-24T18:00:25.05Z"
4/24/2026, 6:00:25 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1053
moderation_latency
(null)
generation_time
10648
tokens_prompt
77
tokens_completion
872
native_tokens_prompt
311
native_tokens_completion
783
native_tokens_completion_images
(null)
native_tokens_reasoning
105
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00162896
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"33ee42f710ab477ca00badc93a0bb6bb"
is_byok
false
latency
1003
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777053625-McRnVLtpxgWChUqoyIvK"
session_id
(null)
api_type
"completions"
id
"gen-1777053625-RgZ0PchRlLFm9clJtSoQ"
upstream_id
"33ee42f710ab477ca00badc93a0bb6bb"
total_cost
0.00162896
cache_discount
0.00006144
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
449
0.00%
Dialogue to Total Word Ratio
Ratio: 88.35%, Deviation: 18.35%
neededClean
false
wordsTotal
455
wordsDialogue
402
0.0000%