NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ04DNJDC8HEMP118F0QXVV5
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
19.3s
Cost:
$0.002543
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped the faded flyer with a chipped nail. "Okay, Bill, this is it. Operation: Pied Piper." Bill squinted. "It's a six-year-old's birthday party, Molly. At a park. In the afternoon." "Exactly. Maximum chaos, minimum suspicion. Who's going to look twice at two adults enjoying a leisurely stroll by the swings?" Molly smiled. "We wait for the cake song. Standard procedure." "You mean 'Happy Birthday'?" "Do not get sentimental on me, Bill. The song is our cover. The presents will be piled by the picnic table, right out in the open. It's an insult to security, really." Bill nodded slowly. "So I just… start grabbing?" "No! You will be 'Uncle Bernie.' Distant relative. You'll carry the gift basket—the one with the pre-dented, refilled juice boxes—and place it at the end of the pile. Then, as a gesture of goodwill, you'll offer to take the gift pile to the car 'to keep the wrapping safe.'" "They'll let me?" "Bill, you'll be holding a basket of juice boxes. To them, you're a hero. While you schmooze, I'll be at the balloon arch." "The arch?" "It's structural! I'll loosen the main knot. When I give the signal—a sneeze, very subtle—the arch collapses. Distraction." "Then I run with the presents?" "You don't run. You're 'Uncle Bernie.' You look concerned. You pat a weeping child. You murmur 'structural failure.' Then, calmly, with the presents, you walk to the van." "The one with 'Happy Tymes Bouncing Castles' painted on it?" "It's called a layer of camouflage, Bill. We're bouncing castle technicians. Post-party inspection." "Right, right." Bill fidgeted. "What's my signal to leave?" "If you hear me yell 'Who left this casserole in the sun?!' that means I've loaded the cake. You must not react." "You're stealing the cake too?" "It's a marble cake, Bill. We're not animals." He sighed. "And if it goes wrong?" "Then you use your gift basket. You 'accidentally' spill the juice boxes. Chaos follows. We blend with the panicking families." "Panicking over juice boxes?" "Parents today, Bill. Anything can trigger them. It's our exit strategy." Molly folded the flyer neatly. "Any questions?" "Just one." Bill's eyes were wide. "What if the birthday boy recognizes 'Uncle Bernie' next time?" Molly stared at him. "Bill, we are stealing toys from a child. There is not going to be a 'next time' with the same family. This is a one-time moral bankruptcy. Now go practice looking avuncular." "Avun-what?" "Like you've held a baby before. Or at least a medium-sized dog. Go."
created_at
"2026-04-24T16:16:01.111Z"
4/24/2026, 4:16:01 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
759
moderation_latency
(null)
generation_time
19193
tokens_prompt
77
tokens_completion
907
native_tokens_prompt
315
native_tokens_completion
811
native_tokens_completion_images
(null)
native_tokens_reasoning
186
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0025432
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"c384b1f0ae1b474f9a7d9343e3baa4db"
is_byok
false
latency
670
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777047361-hVQhc5b5ryvsEnPdmjzs"
session_id
(null)
api_type
"completions"
id
"gen-1777047361-dSKA20i77sFbfXaBZkBG"
upstream_id
"c384b1f0ae1b474f9a7d9343e3baa4db"
total_cost
0.0025432
cache_discount
0.0002048
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
420
0.00%
Dialogue to Total Word Ratio
Ratio: 92.25%, Deviation: 22.25%
neededClean
false
wordsTotal
426
wordsDialogue
393
0.0000%