NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ01WVJD9FMJHCWDTS59DV9W
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
8.9s
Cost:
$0.001486
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
0.03%
Dialogue to Total Word Ratio
(details)
1.0227%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill spread the blueprints across the kitchen table, though they were clearly just a crayon drawing of a house he'd found in a recycling bin. "Alright, Molly, here's the target. The Henderson residence. Saturday. Three PM." Molly leaned over and squinted at the paper. "Bill, this is drawn in purple crayon." "That's tactical discretion. Nobody suspects crayon." "What's the job, Bill?" "Children's birthday party. Five-year-old's bash. The haul is massive, Molly. I'm talking gift bags stuffed with toys, wrapped boxes with those little curly ribbons, gift cards to toy stores. We're talking a fortune." Molly crossed her arms. "You want to rob a five-year-old's birthday party." "I want to rob a five-year-old's birthday party." "There will be adults there, Bill." "How many are we talking?" "I'd estimate at least fifteen parents standing around making awkward small talk and eating sheet cake." Bill waved his hand dismissively. "They'll be distracted. Have you ever watched parents at a kids' party? They're zombies. They've been awake since five AM. They've already survived a bouncy castle situation. They won't even see us." "See us how? We can't exactly walk in the front door." "We walk in the front door." "Bill." "We walk in the front door, Molly. I've already RSVP'd." Molly stared at him. "You RSVP'd to a child's birthday party." "On the Evite. I said I was coming. I'm Gary from Accounting." "You don't work in accounting." "I work in accounting now. I brought seven bottles of wine." "You can't bring wine to a five-year-old's party." "That's what makes it a perfect cover. Everyone's confused. When people are confused, they don't ask questions. I walk in confused. They're confused. We're all confused together. That's called blending in." Molly pinched the bridge of her nose. "And the exit strategy?" "I rented a minivan." "You rented a minivan." "I rented a minivan, Molly. It's got a sliding door on both sides. Maximum efficiency. We load the haul in the back while the kids are distracted by the clown." "There's a clown?" "There's always a clown. Nobody likes clowns, but everyone has to pretend. That's our window. When that clown starts making balloon animals, every parent in that room is going to be staring at their shoes praying for death. That's when we make our move." "And what do we do if someone notices two grown adults carrying armfuls of presents toward the exit?" "We cry." Molly paused. "We cry?" "Emotional overwhelm. I've been practicing." Bill's face crumpled into an exaggerated sob. "These kids, Molly. They have so much. It's just so beautiful watching them share." "That's disturbing, Bill." "It's Oscar-worthy is what it is. Now, are you in or not?" Molly looked at the crayon blueprints. She looked at Bill. She looked at the minivan keys dangling from his finger. "What flavor is the cake?" "Confetti." Molly sighed. "I'm in."
created_at
"2026-04-24T15:31:53.049Z"
4/24/2026, 3:31:53 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
626
moderation_latency
(null)
generation_time
8777
tokens_prompt
77
tokens_completion
834
native_tokens_prompt
311
native_tokens_completion
722
native_tokens_completion_images
(null)
native_tokens_reasoning
46
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00148648
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"a6ec8fc9088349dba8c7ebaf6d38b17e"
is_byok
false
latency
580
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777044713-EjwEirWexi31xHq1r7fP"
session_id
(null)
api_type
"completions"
id
"gen-1777044713-jfOXiJdrWcHmMlpOWXTB"
upstream_id
"a6ec8fc9088349dba8c7ebaf6d38b17e"
total_cost
0.00148648
cache_discount
0.00008192
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
475
0.03%
Dialogue to Total Word Ratio
Ratio: 82.64%, Deviation: 12.64%
neededClean
false
wordsTotal
484
wordsDialogue
400
1.0227%