NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ00ZCF5WP1FM5W9Y5SCAHCY
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
13.9s
Cost:
$0.002035
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Alright," Molly whispered, spreading a crumpled napkin on the park bench. "The target is 342 Maple Lane. Birthday party for a six-year-old. Codename: 'Operation Sprinkles.'" Bill squinted at the napkin, which was covered in what appeared to be ketchup diagrams. "A kid's party? Seriously? What's the score? A few plastic trucks? A juice box?" "Don't be naive, Bill. Intelligence suggests a significant haul. A Play Palace Mega-Dungeon, three LEGO sets, a remote-controlled dinosaur, and—this is the prize—a genuine, glitter-filled, battery-operated singing unicorn." "A unicorn," Bill repeated flatly. "It sings. It glitters. It has a market value. Focus!" Molly tapped the ketchup map. "Entry is at 2:00 PM, peak chaos during musical chairs. That's when we move." "Musical chairs? So we just... sit down?" "Exactly. We blend. You'll wear the spare clown wig from the trunk. I'll pose as a caterer delivering the 'special' cake." "The one we're eating?" "The one we're *carrying*. It's a decoy. Listen closely: Phase one, distraction. You'll trip near the balloon animal station. While they're laughing at the 'clumsy new clown,' I slip into the present pile." "Trip? How?" "Stumble dramatically. Perhaps fall into the punch bowl. Create a scene." "I don't want to get sticky." "Bill, the fate of the glitter-unicorn rests on your willingness to be sticky. This is non-negotiable." "Fine. What's in the decoy cake?" "Vanilla, mostly. With a layer of instant mashed potatoes in the middle." "Why?" "Because when the mother cuts into it expecting funfetti, the texture will confuse and dismay her. That buys us approximately forty-five seconds of stunned silence." "That's... evil." "It's professional." Molly slid a small, garbled diagram forward. "Phase two: The extraction. You'll create a diversion with the dinosaur." "By driving it into the clown car?" "By driving it into the *bouncy castle*. While everyone is rescuing deflated reptile and children, you load the haul into the wagon disguised as a giant stroller." "A giant stroller. For the giant babies." "For the presents, Bill. Then we saunter out with the other parents, looking tired and sensible. Simple." "It sounds insane. What's our exit strategy if the six-year-old spots us?" "Offer him a slice of mashed-potato cake. His brain will short-circuit. Now, your cue is when the mom brings out the unicorn. It will start singing 'You Are My Sunshine.' That's our signal. The louder it sings, the more cover we have." "And if it's a sad, quiet version?" "Then we improvise. Maybe cause a minor frosting-related incident." "You're enjoying this, aren't you?" "It's a challenge, Bill. The security is emotional, the defenses are chaotic, and the loot is whimsical. It's the perfect job." She folded the ketchup napkin. "We're in. We take the unicorn, the Mega-Dungeon, and one LEGO set for good measure. We leave the dinosaur; it's too conspicuous." "Right. No dinosaur. So we're really doing this." "We're really doing this. Now, practice your trip. And for heaven's sake, look more pathetic. You're a clown who just learned his dog ran away." Bill practiced a wobbly, sad walk. "Like this?" "Less like you're in pain, more like you're confused by gravity. And Bill?" "Yeah?" "Don't eat the cake. It's not cake." "Mashed potatoes and glitter-unicorn money. What a world."
created_at
"2026-04-24T15:15:47.312Z"
4/24/2026, 3:15:47 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
586
moderation_latency
(null)
generation_time
13674
tokens_prompt
77
tokens_completion
1120
native_tokens_prompt
311
native_tokens_completion
986
native_tokens_completion_images
(null)
native_tokens_reasoning
179
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00203496
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"ec247fc9e1f445538a23f0a78eae6d9a"
is_byok
false
latency
537
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777043747-IDi0URCQyzxE253e2vlx"
session_id
(null)
api_type
"completions"
id
"gen-1777043747-sUafuvmICQNRyV29TTNi"
upstream_id
"ec247fc9e1f445538a23f0a78eae6d9a"
total_cost
0.00203496
cache_discount
0.00006144
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
535
0.00%
Dialogue to Total Word Ratio
Ratio: 90.74%, Deviation: 20.74%
neededClean
false
wordsTotal
551
wordsDialogue
500
0.0000%