NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ061WTH9246GFMAJRJ37JCT
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
4.9s
Cost:
$0.000809
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
25.9628%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill crouched behind the hedge, studying the house like a general planning D-Day. "Alright, here's the layout. Party starts at three. Thirty kids, minimum." Molly peered over his shoulder. "Thirty kids? That's chaos we can work with." "Exactly. Peak pandemonium hits around three-fifteen when Sugar Rush kicks in. That's our window." "You want to rob a party full of hopped-up six-year-olds?" "Rob is such an ugly word. We're redistributing presents." Molly snorted. "What's the take?" Bill unfolded a crumpled printout. "According to the registry, we're looking at six Lego sets, four Barbie Dreamhouses, and something called a Squishmallow that apparently retails for sixty dollars." "Sixty bucks for a stuffed blob?" "Don't judge the merchandise." Bill pointed at the back door. "Kitchen entrance. The mom will be busy performing Frozen for the fourteenth time. We slip in during 'Let It Go.'" "Harsh timing. That song goes on forever." "That's the point, Molly. She'll be fully committed." "What about the clown?" Bill froze. "There's a clown?" "Hired entertainment. Big red shoes, squeaky nose, the works." "Clowns are unpredictable. We wait for balloon animal duty. He'll be occupied." Bill rubbed his hands together. "In and out. Three minutes." Molly nodded slowly. "And if something goes wrong?" "We cry and say we're the cake delivery people. Nobody questions cake people."
created_at
"2026-04-24T16:44:32.474Z"
4/24/2026, 4:44:32 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
574
moderation_latency
(null)
generation_time
4833
tokens_prompt
77
tokens_completion
425
native_tokens_prompt
311
native_tokens_completion
373
native_tokens_completion_images
(null)
native_tokens_reasoning
46
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00080896
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"f74c97c591604bccabf3bc2454452b8d"
is_byok
false
latency
541
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777049072-bNuUe141cj5trBhNIVvx"
session_id
(null)
api_type
"completions"
id
"gen-1777049072-g7UAY7UFK49Vft3wQMcs"
upstream_id
"f74c97c591604bccabf3bc2454452b8d"
total_cost
0.00080896
cache_discount
0.00006144
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
216
0.00%
Dialogue to Total Word Ratio
Ratio: 81.36%, Deviation: 31.36%
neededClean
false
wordsTotal
220
wordsDialogue
179
25.9628%