NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ01AHE1MM2FH3C8XJFCXEVE
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
24.0s
Cost:
$0.003153
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
39.44%
Dialogue to Total Word Ratio
(details)
69.6791%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly jabbed a finger at the blueprint spread across their van’s hood, her breath fogging in the chill night air. “Operation Teddy Bear Smash begins in fifteen. You got the distraction?” Bill sighed, adjusting his sequined fanny pack. “A hired clown named Bubbles. He’ll perform a ‘special juggling act’ involving rubber chickens near the bounce house. It’s degrading.” “Perfect. Degrading is memorable. I’ll be disguised as a balloon artist.” Molly popped the trunk, revealing two hideous, giant balloon animals—a green poodle and a purple giraffe. “Your cover is a parent. Here’s your ‘child’.” She shoved the purple giraffe at him. “Meet Gerald. He’s shy, so he won’t talk to anyone.” Bill looked at Gerald’s vacant, puffy head. “He has the same lifeless stare as my first partner, Gary. I feel a bond.” He took it, its neck wobbling. “What’s the loot haul?” “Per the invite list: twenty-two kids. Assuming two presents each, average value fifty bucks, that’s twenty-two hundred. Not bad for a Saturday.” Molly strapped a holster holding a glue gun onto her thigh. “We’re not monsters. We leave a note. ‘From the Grinch of Gift-Getting.’ It’s on brand.” “My brand is being in bed by nine with a warm cup of regret,” Bill muttered, checking his watch. “This is insanity, Moll. It’s a five-year-old’s superhero party. There will be cake. And screaming.” “Which is why the security is nonexistent.” She tapped the blueprint. “The gift pile is in the living room, by the fireplace. Classic amateur hour. We enter through the sunroom, blend during the cake chaos, swipe the presents into these.” She held up two large, empty duffel bags decorated with cartoon dinosaurs. “See? Child-friendly. Then we slip out via the back gate. Simple.” “And if some kid catches us?” Bill asked, watching her inflate the green poodle with a furious squeak. “We improvise. Tell them we’re testing a new toy-saving charity.” The poodle’s head inflated to grotesque proportions. “Or we run. We’re faster than toddlers. Probably.” Bill peered at her. “Probably?” “Seventy percent chance.” Molly tied off the poodle’s tail. “Look, the hostess, Mrs. Henderson, is notorious. She makes guests remove their shoes AND socks. We’re doing a public service. Let’s move.” They shuffled up the sidewalk, the absurd balloons leading the way, the faint sound of a clown’s honking horn already drifting from the backyard. “Remember,” Molly whispered as they approached the sunroom door, “walk with purpose. Parents are always frazzled. You are a frazzled parent of an inflatable giraffe. Act natural.” Bill tried to look frazzled, mostly just looking constipated. “I can’t believe I let you talk me into this.” “You believed in the Christmas Caroling Capers,” she said, pulling the door open. “And we ended up singing ‘Jingle Bells’ in a jail cell for eight hours.” “Still counts as a success. We raised that bail money ourselves.” Molly slipped inside. “Stay close to Gerald. And for God’s sake, whatever you do, don’t step on the presents. The noise will bring them all down on us.”
created_at
"2026-04-24T15:21:52.844Z"
4/24/2026, 3:21:52 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1898
moderation_latency
(null)
generation_time
23927
tokens_prompt
77
tokens_completion
1155
native_tokens_prompt
315
native_tokens_completion
946
native_tokens_completion_images
(null)
native_tokens_reasoning
225
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003153
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"e105aff9a4794b5799c459d19161b88f"
is_byok
false
latency
1898
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777044112-TRcCLz6L9fLRWgAYOW4i"
session_id
(null)
api_type
"completions"
id
"gen-1777044112-K8519dwtmIuvn33gWdUB"
upstream_id
"e105aff9a4794b5799c459d19161b88f"
total_cost
0.003153
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
39.44%
Dialogue to Total Word Ratio
Ratio: 65.69%, Deviation: 4.31%
neededClean
false
wordsTotal
510
wordsDialogue
335
69.6791%