NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ05FDTVGQCJA93NT8DE668B
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
13.6s
Cost:
$0.002183
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the table, though they were just crayon drawings she'd lifted from a recycling bin outside the community center. The construction paper smelled like glue sticks and shattered dreams. Bill leaned in, squinting at a crude stick figure labeled "MOM WITH CAKE" standing near what appeared to be either a front door or a very tall window. "So what are we dealing with here?" Bill asked, brushing powdered donut sugar off the intelligence materials. "Saturday. Two to five. Jennifer's sixth birthday." Molly tapped a circle surrounded by smaller circles. "Thirty kids. Forty-two presents, minimum. I counted the RSVPs." "Forty-two presents for a six-year-old?" "Her mom's an influencer. The theme is unicorns. There will be a cotton candy machine, Bill. A cotton candy machine." Bill whistled low. "We're going to need a bigger bag." "We're going to need a distraction." Molly unrolled another piece of paper, this one a hand-drawn timeline with coffee ring stains. "At exactly three-fifteen, the clown arrives. That gives us a fifteen-minute window of maximum chaos while thirty children lose their minds over balloon animals." "What kind of clown?" "What does it matter?" "It matters if it's Gerald. Gerald knows me. I owe Gerald twenty bucks." Molly stared at him. "You owe a clown money." "It was a poker game. He cheated. He was doing that thing with the squirting flower." Bill waved his hand dismissively. "Forget Gerald. What's the entry plan?" "Back door. Kitchen. The caterer arrives at one to set up. We go in dressed as assistant caterers." Molly produced two aprons from her bag. They were neon pink and read "PARTY PERFECT CATERING" in glittery letters. "These are adorable," Bill said, putting one on immediately. "Focus." "I am focused. I look great." The plan was elegant in its simplicity. They would blend in during setup, hide in the coat closet when guests arrived, emerge during the clown distraction, fill their bags with wrapped gifts, and exit through the garage where a sedan would be idling with the engine running. "Who's driving the sedan?" Bill asked. "My niece. She's nineteen. Very responsible." "She got a license?" "She has a permit." "Molly." "She's very responsible." Bill picked up the crayon blueprint again, studying the layout. The coat closet was eight feet from the present table. The present table was four feet from the back exit through the kitchen. Simple math. Beautiful math. "What about the dog?" Bill said suddenly. "What dog?" "There's always a dog, Molly. Golden retriever. Named Biscuit or something. Loves everyone. Will absolutely not let us leave quietly." Molly flipped through her notes, panic flickering across her face for the first time. "There's no dog listed on the intelligence." "That's what they want you to think. The dog is never listed." Bill stood and paced. "We need dog treats. High-value dog treats. The fancy ones from the pet store that cost eleven dollars a bag." "I'm not spending eleven dollars on a dog we might not even encounter." "Would you spend ten?" "There's a coupon." "Then buy two bags. Insurance." Bill extended his hand. Molly shook it firmly. "Jennifer won't know what hit her." "Jennifer's going to get exactly the same Christmas she would have gotten anyway. We're not monsters." "Right. We're professionals." They packed the supplies into a duffel bag: aprons, zip ties for the coat closet door handle from the inside, dog treats, and a small Bluetooth speaker loaded with a playlist called "HEIST JAMS" that was mostly jazz. Bill paused at the door. "One more thing." "What?" "If Gerald's the clown, I'm going to need you to handle the distraction while I avoid eye contact." "Fine." "And I still think he cheated." "Go home, Bill. Practice walking like a caterer." He left, walking surprisingly gracefully in the neon pink apron. Molly watched him go, already dreading Saturday but also, if she was being honest, a little excited. Forty-two presents. She wondered what kind of toys six-year-olds even liked these days. Probably tablets or something equally uninspiring. She hoped at least one box contained something interesting. She started packing the dog treats.
created_at
"2026-04-24T16:34:27.304Z"
4/24/2026, 4:34:27 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
787
moderation_latency
(null)
generation_time
13501
tokens_prompt
77
tokens_completion
1280
native_tokens_prompt
311
native_tokens_completion
1060
native_tokens_completion_images
(null)
native_tokens_reasoning
135
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00218296
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"3687a228dc2445e09bfe1b8dbfa995a2"
is_byok
false
latency
736
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777048467-zeI6udVBCPSYqZgFXFd6"
session_id
(null)
api_type
"completions"
id
"gen-1777048467-w9AglDlUrZbS3DFnoZNt"
upstream_id
"3687a228dc2445e09bfe1b8dbfa995a2"
total_cost
0.00218296
cache_discount
0.00006144
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
685
0.00%
Dialogue to Total Word Ratio
Ratio: 50.29%, Deviation: 20.29%
neededClean
false
wordsTotal
696
wordsDialogue
350
0.0000%