NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ01SAQPTVB3J23KCC81G66E
Run Details
Model:
Xiaomi MIMO v2.5 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
19.3s
Cost:
$0.002663
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0002%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The basement smelled like mildew and broken dreams, which Molly thought was fitting for a planning session. She unrolled a hand-drawn map across the folding table while Bill leaned over her shoulder, squinting. "Okay, the party starts at two," Molly said, tapping the map. "We have a three-hour window before the first parent comes to pick up their kid." "Three hours to steal presents from children," Bill said, rubbing his chin. "I want you to know that this feels different from the museum job." "It pays less, too." "Then why are we doing this?" Molly sighed. "Because we owe Tanya a debt, and Tanya's kid wasn't invited to this party. Apparently the birthday boy called Tanya's kid a 'booger head' on the playground, and now Tanya wants revenge." "Revenge through gift theft." "Revenge through gift theft," Molly confirmed. Bill studied the map. "What's this circle here?" "That's the bounce house. It's the centerpiece of the operation. Every kid will be in that thing at some point. When they are, the living room where the presents are stacked will be empty." "How do you know that?" "Because I called the party planner pretending to be a health inspector. She told me everything. The bounce house is a professional-grade unit, two hundred square feet, maximum occupancy twelve children." Bill blinked. "You impersonated a government official to case a children's party." "Commitment to the craft, Bill." "Right." He pointed to another section of the map. "What about this entrance?" "That's the side door off the kitchen. Parents will be coming in and out through the front, which means the side door is our way in. I've got a caterer's uniform for each of us." "We're going as caterers?" "We're going as caterers. We walk in carrying trays. Nobody questions a person carrying a tray." "What's on the tray?" "Nothing. That's the beauty. If anyone asks, we're picking up the empty trays from the appetizer course." Bill nodded slowly. "That's actually not bad." "I also need you to handle the clown." "The clown?" "There's a clown booked for two-thirty. His name is Bumbles. He's going to be doing balloon animals near the front entrance. If he sees us leaving with armfuls of wrapped presents, he might cause a scene." "What do you want me to do?" "Distract him. You know any balloon animals?" "I can make a sword." "That'll work. Challenge him to a balloon sword fight. Kids will love it. He won't be able to refuse without looking like a bad sport." Bill crossed his arms. "I can't believe I'm going to sword fight a clown named Bumbles." "Think of it as improvisational theater." "And what about security? Any cameras?" "One, but it's pointed at the driveway. The homeowner installed it last month after someone kept stealing his newspaper." "Was that us?" "That was us." "Okay, so side door, caterer disguise, present extraction during bounce house peak, and I duel a clown." Bill took a deep breath. "What's the exit strategy?" "We load the presents into the van out back. You back it into the alley behind the fence at two-fifteen." "And if something goes wrong?" Molly smiled. "Nothing goes wrong at a children's party, Bill. What are they going to do, release the hounds?" "There could be a dog." "There's a golden retriever named Mr. Biscuits. He's eleven and sleeps twenty hours a day." Bill extended his hand. "Alright. I'm in." Molly shook it. "Welcome to Operation Booger Head."
created_at
"2026-04-24T15:29:57.51Z"
4/24/2026, 3:29:57 PM
model
"xiaomi/mimo-v2.5-pro-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1228
moderation_latency
(null)
generation_time
19210
tokens_prompt
77
tokens_completion
996
native_tokens_prompt
315
native_tokens_completion
851
native_tokens_completion_images
(null)
native_tokens_reasoning
67
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0026632
router
(null)
provider_responses
0
endpoint_id
"5c24ecc9-5ba1-4af0-8877-3402889e7d0a"
id
"386b1b593e484a2fa706c8b6d899f593"
is_byok
false
latency
1140
model_permaslug
"xiaomi/mimo-v2.5-pro-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777044597-waXq6zkL5jeVG6nCacHK"
session_id
(null)
api_type
"completions"
id
"gen-1777044597-Epvmrw9DN4MML0zYrtee"
upstream_id
"386b1b593e484a2fa706c8b6d899f593"
total_cost
0.0026632
cache_discount
0.0002048
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
577
0.00%
Dialogue to Total Word Ratio
Ratio: 85.74%, Deviation: 15.74%
neededClean
false
wordsTotal
582
wordsDialogue
499
0.0002%