NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ023ZBS9G1WS11KQH64823E
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
24.8s
Cost:
$0.003131
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the oversized bow tie she’d stolen from a thrift store mannequin and peered through the binoculars at the suburban house. From the safety of the minivan parked across the street, the target materialized in all its plastic-glory: the driveway was a chaos of shimmering gift bags, and the backyard, visible through a gate, hosted a trampoline, a bouncy castle, and at least twenty screaming children. “Bill, status report,” she whispered, not taking her eyes off the prize. Bill, hunched in the driver’s seat with a roadmap spread over the steering wheel, didn’t look up. “Status is, this is insane. We’re robbing a birthday party. For a seven-year-old named Tiffany.” “Exactly. No sophisticated security, no lasers. Just cake and bad decisions. We slip in, blend with the chaos, and extract the goods. Simple.” Molly snapped the binoculars shut. “Blend with the chaos?” Bill’s voice cracked. “I don’t know how to talk to children. I’m a grown man with a crowbar in his boot.” “That’s why you’re on distraction duty,” Molly said, opening a duffel bag. She pulled out a clown nose, a pair of mismatched socks, and a large, flat box. “You’ll be ‘Uncle Bumbling Bill.’ You’ll ‘accidentally’ trip over the extension cord to the disco lights, creating a diversion. While they’re gasping over the darkness, I’ll be under the gift table.” Bill picked up the clown nose with the tip of his thumb and forefinger, as if it were a biohazard. “And what are you supposed to be?” “Party Planner #3. See?” She gestured to her clipboard and walkie-talkie, both utterly fake. “I walk in with authority. Parents don’t question a walkie-talkie. It’s science.” She began loading gift bags into her own, identical-looking but subtly larger duffel. “We hit the big ones first. The long, flat ones. Could be a bike. The soft, lumpy ones—stuffed animals, maybe. Fill our bags and melt away into the sugar-hyped sunset.” “You’re forgetting the variable that cannot be planned for: children. They’re tiny, emotional landmines. What if one tries to talk to me?” “Use the ‘Aw, shucks’ routine,” Molly instructed, stuffing a garish, foil-wrapped box into her bag. “Just smile, nod, and point vaguely at a parent. They’ll go away. Or better yet, give them one of these.” She tossed him a handful of colorful, pre-filled bubble tubes. “It’s a bribe. They’re cheap, they’re mesmerizing, and it buys you fifteen seconds.” Bill examined a bubble tube, a flicker of professional curiosity breaking through his fear. “The schematics on this are surprisingly efficient. Low viscosity solution, dual-wand design for maximum shimmer…” “See? You’re a natural!” Molly clapped him on the shoulder. “Now, the most critical phase: the exit. Once we have the goods, we don’t run. We walk. Calmly. To the minivan, which I’ll have idling. If anyone asks, we’re just… picking up leftover catering supplies. The sheer, unmitigated audacity is our camouflage.” Bill finally looked up from the roadmap, his expression a mixture of profound doubt and resigned amusement. “You know, for a ‘simple’ plan, there’s an awful lot of clown noses and bubble warfare involved.” “That’s what makes it art, Bill. Not just a heist.” Molly grinned, her eyes glinting with a chaotic gleam that had nothing to do with the afternoon sun. “Now, put on the nose. It’s showtime.”
created_at
"2026-04-24T15:35:46.306Z"
4/24/2026, 3:35:46 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2059
moderation_latency
(null)
generation_time
24735
tokens_prompt
77
tokens_completion
1160
native_tokens_prompt
315
native_tokens_completion
1007
native_tokens_completion_images
(null)
native_tokens_reasoning
207
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0031312
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"83b957f2bd64411cbe83663dd606582c"
is_byok
false
latency
1969
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777044946-N7OMANPd7mawgpvb2wT2"
session_id
(null)
api_type
"completions"
id
"gen-1777044946-KMCwloCbDsuBN8OEJRnT"
upstream_id
"83b957f2bd64411cbe83663dd606582c"
total_cost
0.0031312
cache_discount
0.0002048
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
550
0.00%
Dialogue to Total Word Ratio
Ratio: 56.17%, Deviation: 26.17%
neededClean
false
wordsTotal
559
wordsDialogue
314
0.0000%