NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ03FZDHMR411WZ4Q4ZYBBD4
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
16.7s
Cost:
$0.002462
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9935%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her face against the chain-link fence, her breath fogging in the November cold. Across the parking lot, the community center blazed with colored lights. A hand-painted banner read "HAPPY BIRTHDAY SOPHIE!" in wobbly letters. "Count the presents again," she whispered. Bill raised the binoculars. His nose was red and running, which he blamed on allergies and Molly blamed on incompetence. "I count thirty-six. No, wait. Thirty-seven. Someone just carried in a big one with a bow." "Thirty-seven presents." Molly leaned back against the van. "That's what, two hundred a piece on average?" "At least. Sophie's parents are loaded. I heard the dad invented some kind of app." "What kind of app?" "I don't know. An app app. Does it matter?" "It matters if it's a parenting app. Those people are organized. They'll have a guest list." Bill lowered the binoculars. "It's a children's party, Molly. There's going to be a bounce house and a clown named Mr. Sprinkles. We're not infiltrating the Pentagon." "Don't underestimate clowns. They notice things." "You're worried about a clown." "I'm worried about underestimating a clown. There's a difference." A minivan pulled into the lot and a woman wrestled out a gift roughly the size of a refrigerator. Bill watched it through the binoculars and let out a low whistle. "That's got to be a play kitchen. Those go for three hundred easy." "See, this is why we're here. Sophie's living better than us and she's six." "She's turning seven." "Even worse." Molly pulled out a crumpled floor plan she'd drawn on the back of a Denny's placemat. Bill squinted at it. "What's this circle here?" "The bounce house." "And this rectangle?" "The cake table." "And this little square with the star on it?" "That's Mr. Sprinkles. I marked him as a threat." Bill rubbed his temples. "Okay. So we go in through the side door during the magic show. The magician is some guy named Dennis who does birthday parties for eighty bucks. He'll have everyone's attention for at least twenty minutes." "Dennis could be good. Or Dennis could be bad. What if his tricks are terrible and the kids get restless? Then we've got forty sugared-up seven-year-olds roaming free." "Then we improvise." "I hate improvising." "You love improvising. You improvised that thing at the pet store and we got away with twelve parakeets." "We got away with eleven. You dropped one." "That parakeet wanted to be free, Molly." She stuffed the placemat back into her jacket. "Fine. Side door during the magic show. I'll handle the presents. You're the lookout." "Why am I always the lookout?" "Because last time you carried things, you dropped a parakeet." Bill opened his mouth, closed it, and picked up the binoculars again. Inside the community center, someone had just wheeled out an enormous cake shaped like a unicorn. The lights flickered. Music started. "Showtime in twenty minutes," Bill said. Molly cracked her knuckles. "Let's go steal a birthday."
created_at
"2026-04-24T15:59:48.161Z"
4/24/2026, 3:59:48 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1698
moderation_latency
(null)
generation_time
16624
tokens_prompt
77
tokens_completion
883
native_tokens_prompt
315
native_tokens_completion
767
native_tokens_completion_images
(null)
native_tokens_reasoning
73
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0024624
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"b36de359aacc491d93a2470d0686791c"
is_byok
false
latency
1601
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777046388-V401ZJYZdJDC0DYyvtmB"
session_id
(null)
api_type
"completions"
id
"gen-1777046388-TP7SMgzWsQCdE2Gq95w7"
upstream_id
"b36de359aacc491d93a2470d0686791c"
total_cost
0.0024624
cache_discount
0.0001536
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
0.00%
Dialogue to Total Word Ratio
Ratio: 66.20%, Deviation: 16.20%
neededClean
false
wordsTotal
500
wordsDialogue
331
47.9935%