NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ09VFWPW0B9YJ1JQARBEFQ6
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
12.5s
Cost:
$0.001637
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
49.9998%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unfolded the blueprints on the cafe table. "We've got a twelve-minute window during Pin the Tail on the Donkey. All eyes on the birthday boy." Bill slurped his smoothie. "And the presents are right there? Just… sitting there?" "A pyramid. A glistening, gift-wrapped pyramid. But it's guarded." "Guards?" "Worse. Parents. Mothers with eyes like hawks and a sixth sense for tampering with loot bags. And the gift table is in the main room." "So we walk in. We're just guests. Uncle Bill and Aunt Molly." "Neither of us is anyone's uncle or aunt." "We will be. Today. I've got balloons." Bill produced a giant foil "7" from his coat. "For the kid. My niece. What's her name?" "Emma." "Perfect. I love Emma. We blow in, coo at the baby, hand over the balloon, and when they start the donkey game…" "We create a distraction. A loud, harmless one." "I can do loud." "The plan requires a smoke bomb. Not you singing." "Fine. What about the getaways? Minivan?" "Station wagon. Sticker-bombed. We'll blend right in." "Brilliant. They'll never suspect a couple in a sticker-covered station wagon." "Exactly. Now, do you have the laser pointer for the alarm system?" "The what?" "The gift table. It has a pressure-sensitive mat. I told you." "Right, right. The laser pointer. Yep. Got it." Molly leaned in. "You don't have a laser pointer, do you?" Bill smiled sheepishly. "I've got a pen that lights up when you click it."
created_at
"2026-04-24T17:50:56.942Z"
4/24/2026, 5:50:56 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1657
moderation_latency
(null)
generation_time
12370
tokens_prompt
77
tokens_completion
577
native_tokens_prompt
315
native_tokens_completion
509
native_tokens_completion_images
(null)
native_tokens_reasoning
144
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0016372
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"4cc0b16285d34c1ebc26f9bd7fb2f28b"
is_byok
false
latency
1580
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777053056-Qx3BwSWFa1ltEHrQHfMK"
session_id
(null)
api_type
"completions"
id
"gen-1777053056-7nVAlRbj1cw1K5kDBsgx"
upstream_id
"4cc0b16285d34c1ebc26f9bd7fb2f28b"
total_cost
0.0016372
cache_discount
0.0002048
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
244
100.00%
Dialogue to Total Word Ratio
Ratio: 89.56%, Deviation: 0.44%
neededClean
false
wordsTotal
249
wordsDialogue
223
49.9998%