NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ01256A8J1S3728N2MK9EN0
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
12.9s
Cost:
$0.002046
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0001%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the rhododendron bush, its leafy branches providing perfect cover. The suburban house was a riot of colour—streamers, balloons, a giant inflatable dinosaur guarding the front path. “Alright, Bill,” she whispered, peering through a gap in the foliage. “The target is a six-year-old’s birthday party. Security is minimal. A single, distracted dad is on grill duty, and the mother is inside singing ‘Happy Birthday’ for the third time. Phase one: infiltration.” Bill, a man whose broad shoulders were comically disguised beneath a child’s superhero cape, shifted beside her. “Got it. But why are we hitting a kid’s party? The take can’t be that good. My back hurts from sneaking through this stupid bush.” “The take, you magnificent simpleton, is exceptional,” Molly hissed, her eyes gleaming. “You’re not thinking tactically. We’re after the presents. Unopened. High-value items, untouched by sticky fingers. New video games, premium art sets, high-end electronics gifted by doting aunts. It’s a goldmine in wrapping paper.” “Huh. Never thought of it that way.” Bill scratched his head. “So, what’s the costume policy? I feel ridiculous in this cape.” “You look heroic. It’s called blending in. I’ll be the ‘fun’ magician’s assistant. You’re a… generic superhero. Now, listen. The main event is in the living room. I’ve confirmed the loot is piled under a banner that says ‘YOU’RE SIX, HURRAY!’ We slip in during the cake ceremony. The noise will cover our entry.” “And the extraction? I ain’t trying to run out the front with an armful of Lego boxes.” Molly patted a thick rope coiled over her shoulder. “We go out the back. Through the kitchen, past the chaos, and up the oak tree. I’ve already secured a line to the fence. Clean, quiet, professional.” A loud cheer erupted from the house, followed by the straining notes of ‘Happy Birthday.’ “They’re cutting the cake! That’s our window!” Bill’s stomach rumbled. “Smells like chocolate. We’re not swiping a cupcake too, are we?” “No! Focus, Bill! In, grab the gift bags near the fireplace, out before they notice. The dad will be loading the dishwasher. The mother will be taking photos. The children will be in a sugar frenzy. We will be ghosts.” Molly paused. “Very specific, gift-oriented ghosts.” “Right. Ghosts.” Bill cracked his knuckles. “You got a plan for the bouncy castle obstacle? I’m not as agile as I used to be.” “You roll under it. I’ll create a diversion. Now, follow my lead and try not to trip over any ride-on tractors. Operation Birthday Bash is a go.” They slinked around the side of the house, two shadows against the bright, cheerful siding. The party music thumped, a terrible cover of a popular pop song. Just as they reached the back door, it swung open, and a child in a dinosaur costume burst out, trailing streamers. They froze, pressing against the wall. The little dinosaur paused, looked at them, and then toddled off to chase a butterfly. “See?” Molly whispered, a grin spreading across her face. “They’re too young to be effective sentries. It’s almost too easy.” “Famous last words,” Bill muttered, but he followed her inside, the scent of chocolate cake and pure, unadulterated loot filling the air.
created_at
"2026-04-24T15:17:18.166Z"
4/24/2026, 3:17:18 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
987
moderation_latency
(null)
generation_time
12768
tokens_prompt
77
tokens_completion
1163
native_tokens_prompt
311
native_tokens_completion
1002
native_tokens_completion_images
(null)
native_tokens_reasoning
244
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00204648
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"46d38557cdf64f9083acfd14c533e409"
is_byok
false
latency
648
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777043838-vwq03HRFGqBB3YhlrJQy"
session_id
(null)
api_type
"completions"
id
"gen-1777043838-X4VMAzOMdDQtxUJn2IOg"
upstream_id
"46d38557cdf64f9083acfd14c533e409"
total_cost
0.00204648
cache_discount
0.00008192
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
534
0.00%
Dialogue to Total Word Ratio
Ratio: 64.07%, Deviation: 34.07%
neededClean
false
wordsTotal
540
wordsDialogue
346
0.0001%