NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ00SBEB18SG2QZBWN072GYF
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
19.0s
Cost:
$0.002525
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.21%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.1070%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The backyard of the Pemberton residence was a pastel-colored crime scene in the making. Pink streamers dangled from oak trees like party supply tinsel. A bouncy castle inflated itself with the kind of optimism that Molly found personally offensive. She pressed the binoculars against her eyes and studied the terrain with the seriousness of a general before D-Day. Beside her, Bill chewed his sandwich and waited. "Look at that," Molly said, lowering the binoculars. "Three separate gift tables. We're going to need the van." "I told you we needed the van." "You're right. You told me. I didn't listen. This is me acknowledging that you told me." Bill took another bite. "So what's the plan?" Molly unfolded a hand-drawn map she'd sketched on a placemat from Denny's. She'd used colored pencils. The bouncy castle was rendered in exquisite magenta. The entrance points were marked with tiny skulls. "Okay. The party starts at two. By two-fifteen, every parent will be parked in lawn chairs, staring at their phones like zombies. That's our window. I go in dressed as a clown." "A clown." "Professional clowns are trusted at children's parties. It's science." "Do you know how to be a clown?" "Bill, I have been performing a clown act for most of my adult life. It's just never been literal until today." She traced a path on the map with her finger. "I mingle. I make balloon animals. I build trust. Meanwhile, you're in the van with the back doors open." "What if someone asks you to do a trick?" "I'll honk my nose." "That's not a trick." "It is if you commit to it." Bill looked at the map. There was an arrow pointing toward the side fence with the word "EGRESS" written in red. He admired her penmanship. She'd even drawn a tiny wheelbarrow near the gift tables. "What's the wheelbarrow for?" "That's the getaway vehicle." "I thought we had a van." "The wheelbarrow gets us from the tables to the fence. The van gets us from the fence to the warehouse. It's a two-stage extraction." "A wheelbarrow, Molly." "It was the only one I could find that had a flat-free tire." She folded the map back along its creases and tucked it into her jacket pocket. In the distance, a clown car pulled up to the Pemberton house and actual professional clowns began emerging, one after another, like a human accordion. Molly lowered her binoculars slowly. "Bill." "I see it." "There's already clowns." "So what do we do?" Molly watched as the last clown emerged and began unloading a trunk full of juggling pins. Her jaw tightened. She picked up the binoculars again and scanned the backyard one more time. "Okay. New plan. You're the clown. I'll be the wheelbarrow." "How would that even—" "Don't ask questions, Bill. Just trust the wheelbarrow."
created_at
"2026-04-24T15:12:29.976Z"
4/24/2026, 3:12:29 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2447
moderation_latency
(null)
generation_time
18492
tokens_prompt
77
tokens_completion
887
native_tokens_prompt
315
native_tokens_completion
788
native_tokens_completion_images
(null)
native_tokens_reasoning
143
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0025254
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"4078adad7e9649f1968150c2d668ce13"
is_byok
false
latency
2357
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777043550-WuPUkdDVNWpbsvNtRlXq"
session_id
(null)
api_type
"completions"
id
"gen-1777043550-zv0XgSOMV04IeobSNpkR"
upstream_id
"4078adad7e9649f1968150c2d668ce13"
total_cost
0.0025254
cache_discount
0.0001536
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.21%
Matches word count
n/a
neededClean
false
words
472
0.00%
Dialogue to Total Word Ratio
Ratio: 51.78%, Deviation: 21.78%
neededClean
false
wordsTotal
477
wordsDialogue
247
0.1070%