NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ03WWFKC5C42DPEDKQ6MWGP
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
13.7s
Cost:
$0.002039
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
43.1902%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The minivan's engine coughed into silence outside the suburban two-story. Molly peered through binoculars, her breath fogging the glass. "Target acquired. Single-story residential, reinforced door. Standard birthday defenses." Bill, slumped in the passenger seat, crunched on a bag of cheese puffs. "What's the security detail? An angry golden retriever?" "Worse," Molly whispered, her voice tight. "Multiple hostiles. High-pitched vocalizations. Unsynchronized movement patterns." She lowered the binoculars. "It's a six-year-old's party, Bill. In full swing." Bill squinted at the house. Through the picture window, a blur of tiny, sugar-fueled bodies ricocheted off furniture. A clown was inflating animals, his face a rictus of strained cheer. "So the score is... what, half-eaten cake? Sparkly stickers?" "You're not thinking tactically," Molly hissed, pulling out a laminated diagram she'd scribbled on a napkin. "The main cache is in the living room. Observe. Pile of brightly wrapped boxes. The intelligence suggests high-value loot: a remote-control car, multiple Lego sets, a plastic dinosaur that roars." "A roaring dinosaur. My heart's racing." "The point isn't the individual items, it's the volume. We need to blend in. Phase one: social infiltration. We pose as the entertainment." Bill choked on a puff. "We're not clowns. You hate clowns." "I have a better idea." Molly's eyes glinted. "We're the 'fun cousins' from out of town. Here to help with the games. You, Bill, will be enthusiastic but clumsy. Trip over the cake. Distract the adults. I will execute the primary extraction during the musical chairs." "Musical chairs is chaotic. It's perfect." "It's a battlefield of screaming toddlers and furious parents. I'll signal you when the presents are bagged. You create a diversion." "What kind of diversion? I could set the clown's hair on fire." "That's your default answer to everything. No. You will lead them in a rousing, off-key rendition of 'Old MacDonald.' Your voice is a weapon of mass distraction." Bill considered this. "Fair point. And the getaway?" "Phase two: exfiltration. We load the loot into the big cardboard box the bouncy castle came in. We tell the birthday boy it's a special, grown-up present he can't open yet. We walk it to the van." "What if he cries?" "You're the fun cousin. You'll think of something. A threat? A bribe? Offer him your cheese puffs." "This is a flawless plan," Bill said, his mouth full. "There's one complication." Molly pointed. A small, determined girl in a princess dress was methodically popping balloons with a plastic sword. "That's the mark. Emily. The birthday girl. She's already acquired three presents from other children through complex social bargaining and outright theft. She's sharp." "Terrifying," Bill mumbled. "Stay focused. We enter on the count of three. Remember: smile, be useless, and when I give the signal, start singing about a cow." "What kind of cow?" "A cow with severe commitment issues. Now, look enthusiastic." Molly plastered on a wide, terrifying grin. Bill mirrored her, looking like a maniac. Together, they stepped out of the van, carrying a suspiciously empty gym bag, and walked toward the maelstrom of joyous chaos. The heist was on.
created_at
"2026-04-24T16:06:51.132Z"
4/24/2026, 4:06:51 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
605
moderation_latency
(null)
generation_time
13436
tokens_prompt
77
tokens_completion
1154
native_tokens_prompt
311
native_tokens_completion
988
native_tokens_completion_images
(null)
native_tokens_reasoning
236
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00203896
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"22e1264341d347dea41811a6a755f359"
is_byok
false
latency
554
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777046811-BCnpLcMomKUDT3hZpC6G"
session_id
(null)
api_type
"completions"
id
"gen-1777046811-CFcQ4ZPnvAN0j46nyxnw"
upstream_id
"22e1264341d347dea41811a6a755f359"
total_cost
0.00203896
cache_discount
0.00006144
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
511
0.00%
Dialogue to Total Word Ratio
Ratio: 69.35%, Deviation: 19.35%
neededClean
false
wordsTotal
522
wordsDialogue
362
43.1902%