NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ07HBQT21AP5CGY99B3KTX5
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
11.3s
Cost:
$0.001658
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
13.5829%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the table in their cramped apartment, though calling them blueprints was generous. They were crayon drawings she'd snatched from a recycling bin outside a kindergarten, hastily annotated with red marker. "Alright, Bill. Listen up. Operation Birthday Blitz is a go." She tapped the paper with a pen. "The target is 442 Maple Drive. Saturday afternoon. Two o'clock sharp." Bill leaned forward, squinting at the drawings. "These look like a five-year-old made them." "The intel is authentic. I bribed the kid on the tricycle with a fruit snack." "What kind of fruit snack?" "The good kind. Mango." Bill nodded, suddenly respectful. "Go on then." Molly cleared her throat and pointed to a crude circle on the paper. "The front door. Too risky. Mrs. Henderson runs the party, and she's got eyes like a hawk. Last year she caught Tommy Weaver sneaking cake before lunch and that kid didn't sit down for a week." "So how do we get in?" "The doggy door." "The doggy door? Molly, I'm six foot two." "Then lose some weight." "I'm not losing weight for a heist." She rolled her eyes. "Fine. The doggy door is for the backup plan. Primary entry goes through the kitchen window. The party schedule says pin the tail on the donkey at two fifteen. That's our window. Everyone will be in the living room, blindfolded and screaming." Bill pulled out a notepad and started scribbling. "What are we working with? Gift-wise?" "I did recon last Tuesday. I posed as a clown." "You don't know how to be a clown." "I made balloon animals. Two of them popped. A child cried. I still got intel." She pulled out a list. "We're looking at one gaming console, three tablets, two bicycles, a karaoke machine, and something wrapped in purple paper that squeaks." "Squeaks?" "Squeaks." Bill wrote it all down. "That's a solid haul. What about security?" "Mrs. Henderson's husband, Gary. He's the wild card. He usually falls asleep in the recliner by two thirty. Snores loud enough to rattle the windows." "And if he doesn't fall asleep?" "Then we hit him with the pie." "The pie?" "There's always a pie at these things, Bill. It's a birthday party. You improvise." He sat back and rubbed his chin. "What about the kids? Forty of them hopped up on sugar is a nightmare scenario." "Already handled. I called ahead and requested a magician. The magician will be doing close-up magic in the backyard for twenty minutes. It's the longest twenty minutes in children's entertainment. No kid leaves a magic show." "You called a magician?" "I called three. We've got coverage." Bill stared at her for a long moment. "You're beautiful, you know that?" "Flattery doesn't split the loot, Bill." She rolled up the blueprints. "Now suit up. You're wearing the gorilla costume." "Why the gorilla costume?" "Because if Gary wakes up, you're not a burglar. You're part of the entertainment." He groaned and reached for the costume bag. Somewhere across town, a child blew out practice candles and wished for the very things Molly and Bill were already counting.
created_at
"2026-04-24T17:10:27.851Z"
4/24/2026, 5:10:27 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
621
moderation_latency
(null)
generation_time
11223
tokens_prompt
77
tokens_completion
930
native_tokens_prompt
311
native_tokens_completion
808
native_tokens_completion_images
(null)
native_tokens_reasoning
94
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00165848
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"13d7b7aae4ec4036823c6e5981f3d06c"
is_byok
false
latency
554
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777050627-AUacJKgubd7gYWN3ZFSc"
session_id
(null)
api_type
"completions"
id
"gen-1777050627-MTgOw3rDpTNhzi4GhGQy"
upstream_id
"13d7b7aae4ec4036823c6e5981f3d06c"
total_cost
0.00165848
cache_discount
0.00008192
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
519
0.00%
Dialogue to Total Word Ratio
Ratio: 73.80%, Deviation: 43.80%
neededClean
false
wordsTotal
523
wordsDialogue
386
13.5829%