NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ04FGCPBZ740MMW997CKCSS
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
14.3s
Cost:
$0.002034
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0002%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crinkled blueprint across the dusty table, which was actually just a large pizza box lid. “Right, listen up. Saturday, 14:00 hours. Target: the Henderson residence. Backyard soirée. Objective: recover all presents. Primary objective: the mega-bouncer with integrated slide and water misting function.” Bill squinted. “That’s Timmy Henderson’s fifth birthday party, Molly. That’s not a ‘soirée.’ That’s a parental hostage situation.” “It’s a soft target, Bill. All those loot bags, the piñata filled with candy coins, the custom-decorated cupcakes. Think of the plunder.” “I am thinking. I’m thinking about the security. What security? It’s a garden. There’ll be a hundred screaming witnesses, all under the age of eight. They’re unpredictable. They’re sticky. They have no respect for personal space.” “That’s why we blend in,” Molly said, tapping the blueprint. “I have uniforms. We go in as ‘Uncle Gary’s Party Helpers.’ We carry the presents to the ‘safe vehicle.’ That’s my minivan with the non-functioning air conditioning.” “Uncle Gary doesn’t exist.” “He does now. He’s a very distant, very tired relative who sends his apologies and two replacements.” Molly handed Bill a garish, rainbow-striped vest. “You put this on, you’re invisible. To adults, you’re just another hired help. To kids, you’re furniture.” Bill held up the vest like it was a dead fish. “And what’s my role in this catastrophe?” “You’re on distraction duty. When I start loading the gifts, you engage the clown.” “Engage the clown? What does that even mean?” “Do whatever you have to. Challenge him to a balloon animal duel. Ask him profound questions about his greasepaint. Just keep his big, painted eyes off the prize wagon. His name is reportedly ‘Bubbles.’ He’ll be the center of their chaotic attention.” “This is insane. The risk-reward ratio is catastrophic. We could get a collective time-out.” “Or we could get a year’s supply of toy dinosaurs and a remote-control car that screams when it crashes. I want the car, Bill. I want it to scream for me.” Bill sighed, the sound of a man weighing felony against rubber chickens. “The clown knows the terrain. He has the kids’ trust. He’s probably ex-special forces.” “He’s a man in a wig making dogs out of long balloons. You have a knife and a healthy fear of small, sticky hands. You’ll manage.” “And if we’re caught?” Molly straightened up, a glint in her eye. “Then we deploy the fail-safe. We claim we’re part of an elaborate birthday surprise orchestrated by the children’s absent father. It works every time.” “It’s never been tried.” “Exactly. It’s statistically foolproof.” She shoved the rainbow vest into his hands. “Now put it on. We need to practice our ‘Uncle Gary’ gait. A little slouch. A bit of world-weariness. We look like we’ve just assembled a three-story swing set and are deeply questioning our life choices.” Bill reluctantly pulled the vest over his head. “I can’t believe I’m going to prison for stealing a ‘Paw Patrol’ playset.” “We’re not going to prison. We’re going to be legends. The Great Children’s Party Heist. Now, let’s go over the entry point. The gate has a simple latch, but watch out for the golden retriever. He’s a known barker. We bribe him with a single, delicious-looking hot dog.” “A hot dog? That’s the plan?” “Everything has a price, Bill. Even justice.”
created_at
"2026-04-24T16:17:01.346Z"
4/24/2026, 4:17:01 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1037
moderation_latency
(null)
generation_time
14108
tokens_prompt
77
tokens_completion
1208
native_tokens_prompt
311
native_tokens_completion
996
native_tokens_completion_images
(null)
native_tokens_reasoning
207
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00203448
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"f64c1f536ab14b06801831650ef3cd3b"
is_byok
false
latency
986
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777047421-tc5HERN19IbBzdTcPrOO"
session_id
(null)
api_type
"completions"
id
"gen-1777047421-rQLvUjzz3GiKNxfrk8jc"
upstream_id
"f64c1f536ab14b06801831650ef3cd3b"
total_cost
0.00203448
cache_discount
0.00008192
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
552
0.00%
Dialogue to Total Word Ratio
Ratio: 85.79%, Deviation: 15.79%
neededClean
false
wordsTotal
563
wordsDialogue
483
0.0002%