NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ099F6B24RNH7W7SEJ45JS3
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
15.4s
Cost:
$0.001958
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
48.8138%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the blueprint across the sticky table of the 24-hour diner, stabbing a finger at a colorful blob labeled "PRESZENTZ." "Alright, Bill, listen up. This is a high-risk, high-reward operation. The target is the Henderson residence. Birthday party. Saturday. 3 PM sharp." Bill leaned forward, squinting. "A children's party, Molly? Is the loot that good?" "The loot is unparalleled," Molly whispered, her eyes gleaming. "We're talking premium LEGO sets, unopened video games, artisanal plush toys. The parents go all out. Mrs. Henderson posted the gift list online. It's a gold mine of pristine, boxed joy." "So, what's the entry plan?" Bill asked, slurping his coffee. "Front door with a 'Happy Birthday' card full of C4?" "Don't be an idiot, Bill. We're professionals. We blend in. We become part of the ecosystem." She pulled out two garish Hawaiian shirts from a bag. "You're Uncle Bill, a long-lost friend of the family from 'awaii. You're here for the aloha spirit. You get to work the bubble machine." "Excellent. I have a natural gift for bubble-based atmospheric enhancement," Bill nodded, approvingly. Molly continued, "I'll be 'Auntie Molly,' fresh from a yoga retreat. I'm in charge of 'mindful snack distribution.' That gives me access to the kitchen and the party favor staging area." "What about the big guy? The clown?" Bill gestured to a section of the blueprint marked with a frowny face. "That's Bubbles. The primary security detail. His jurisdiction is the backyard. His weapons are balloon animals and a squirt flower. You need to distract him. Challenge him to a bubble-blowing contest. Defeat him." "Consider him defeated. I was the bubbles champion of my cell block." "Good. Now, exfiltration. The presents will be in the living room, piled on the 'throne of greed,' as I call it. Once they start singing 'Happy Birthday,' we make our move. You load the large goods into the custom stroller—" she pointed to a sketch, "—while I secure the handheld loot in the diaper bag. We exit through the side gate, into the getaway minivan we'll have parked." "We're using a minivan?" "Of course we are. We're not monsters. What kind of monsters steal from a kids' party and use a two-door sports car? We blend in. We're just parents leaving early because 'little Timmy has a tummy ache.'" "The plan is beautiful, Molly," Bill sighed, a single tear forming. "It's got poise. It's got strategy. It's got a stroller." "One last thing," Molly said, her voice dropping. "The cake. It's a three-tiered monstrosity. Our intel suggests the middle layer is filled with rainbow sprinkle ice cream. If we can, we secure the cake. It's not part of the primary objective, but it's... bonus." "Understood. Primary objective: presents. Secondary objective: ice cream cake. Tertiary objective: total bubble supremacy over a man named Bubbles." "Exactly. Now, eat up. We've got a long night of practicing 'Aloha' and perfecting our mindful-snack faces ahead of us." Bill grinned, cracking his knuckles. "This is gonna be the best-planned, least violent, most festive heist in history."
created_at
"2026-04-24T17:41:06.391Z"
4/24/2026, 5:41:06 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
741
moderation_latency
(null)
generation_time
15190
tokens_prompt
77
tokens_completion
1080
native_tokens_prompt
311
native_tokens_completion
958
native_tokens_completion_images
(null)
native_tokens_reasoning
216
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00195848
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"44b06fd5f0a3403092ed6596752ed664"
is_byok
false
latency
685
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777052466-HFwBTzsh71qot0TCYIBr"
session_id
(null)
api_type
"completions"
id
"gen-1777052466-m68IgyzPTeJFVQDbfFAZ"
upstream_id
"44b06fd5f0a3403092ed6596752ed664"
total_cost
0.00195848
cache_discount
0.00008192
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
507
0.00%
Dialogue to Total Word Ratio
Ratio: 79.30%, Deviation: 29.30%
neededClean
false
wordsTotal
517
wordsDialogue
410
48.8138%