NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ09A62C1NHAV70XQY639DZ0
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
20.5s
Cost:
$0.002873
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1376%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The balloon arch was a nightmare of pastel aggression. Molly crouched behind a particularly grinning clown statue, adjusting her periscope. "This is what we call a soft target, Bill. Maximum chaos, minimal supervision." Bill peered over her shoulder, his clown wig slightly askew. "You said there'd be a castle bounce-house blocking the view of the gift table." "There is a castle bounce-house. You're looking at the moat of discarded shoes." Molly tapped the blueprint. "We enter during the cake ceremony. All eyes will be on the frosting." "Right, right." Bill fumbled with his utility belt, sending a set of lock picks cascading onto the driveway. "Oops." "Those are for the prize cabinet at the arcade, Bill. The presents are secured with nothing more than optimism and sticky fingers." She sighed. "Just follow my lead. I'll create a diversion with the magician." "What's the diversion?" "I'm going to ask him to make my debt disappear." The party was in full, shrieking swing. Molly navigated the chaos with the grace of a cat in a minefield, while Bill got briefly tangled in a streamer. They converged on the gift table, a mountain of brightly wrapped boxes and tubes. "Go, go," Molly whispered. "I've got the oversized teddy bear as our primary haul. It's surprisingly dense." Bill grabbed a small, cylindrical package. "What about this one? It's light." "It's a roll of wrapping paper, you walnut." Suddenly, the birthday boy—a tiny tyrant named Bradley—appeared before them, a smear of blue icing on his cheek. He pointed a frosting-encrusted finger at Bill. "You," Bradley declared. "You're the bad guy from the pirate show." Bill froze, the roll of wrapping paper clutched to his chest. "I'm not... I'm a clown." "The pirate had a clown wig. He stole the treasure." Bradley's lower lip began to tremble. Molly stepped smoothly in front. "He's not a pirate, sweetie. He's a... interior decorator. We're here to rearrange the presents. For... feng shui." Bradley squinted. "What's feng shui?" "It means everything has its proper place," Molly said, plucking the wrapping paper from Bill's hands and placing it back on the table. "And this belongs here. With its friends." Bill, catching on, nodded vigorously. "Yes. Friends. Now if you'll excuse us, we must... consult with the bounce-house." He grabbed the teddy bear and began to walk stiffly toward the exit. Bradley watched them go, then turned to his mother. "Mommy, the feng shui man is stealing the bear." In the ensuing commotion, a rogue Super Soaker emerged, and a direct hit to Bill's clown wig sent it flying, revealing his bald, panicked head. They sprinted for the van, a platoon of face-painted children in hot pursuit. As they peeled away, Molly looked at the lone, sopping teddy bear in the back. "Well," she said, "we got the primary asset." Bill wrung out his wig. "Next time, can we rob a bank? Less sticky."
created_at
"2026-04-24T17:41:29.815Z"
4/24/2026, 5:41:29 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1263
moderation_latency
(null)
generation_time
20413
tokens_prompt
77
tokens_completion
1064
native_tokens_prompt
315
native_tokens_completion
921
native_tokens_completion_images
(null)
native_tokens_reasoning
235
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0028732
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"10900ecae9254f43a827a0cde86c1ecc"
is_byok
false
latency
1181
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777052489-qfiYnFCy5RmxGmeNy6no"
session_id
(null)
api_type
"completions"
id
"gen-1777052489-I7s6r7zzzagnxm2HT8Nk"
upstream_id
"10900ecae9254f43a827a0cde86c1ecc"
total_cost
0.0028732
cache_discount
0.0002048
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
485
0.00%
Dialogue to Total Word Ratio
Ratio: 46.94%, Deviation: 16.94%
neededClean
false
wordsTotal
490
wordsDialogue
230
30.1376%