NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ08F0824J45QJAQYEPG69WH
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
12.6s
Cost:
$0.001923
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.6885%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain drummed a chaotic rhythm on the dumpster lid. Molly checked her watch for the third time in a minute. "He's late. For a heist, that's a felony in the first degree." A soggy cardboard box shifted two dumpsters down, and Bill emerged, shaking a confetti-studded sneaker. "Had to take the scenic route. Mrs. Henderson's Chihuahua was on patrol." "Tell me you have the intel," Molly said, her voice low. "Better," Bill produced a crumpled sheet of paper, glistening with what might have been juice. "The floor plan. From the inside." "You went in?" "Of course I went in. The piñata was a donkey. A classic. I had to pay respects." He smoothed the paper on a wet lid. It was a child's drawing of a house. "Here's the target zone. The presents are stacked under a banner that says 'Happy 5th Birthday, Leo.'" "Beautiful," Molly murmured. "Five-year-olds have the best loot. Unregulated. No receipts." "What's the security situation?" Molly asked. "Dire. A single adult, designated 'Uncle Steve,' was tasked with watching the children. When I saw him, he was asleep on a chair, a half-eaten hot dog in his hand." "Amateur," Molly scoffed. "And the noise?" "The screams of delight are constant. Perfect cover. But the high-pitched shrieking does make comms difficult." He tapped his ear. "My hearing is still fuzzy from a game of musical chairs." "The approach?" Molly was already visualizing it. "We come in through the garden during the 'Duck, Duck, Goose' segment. The kids are mesmerized. We’re just two other adults. Maybe holding balloons." "Brilliant. Blending in is key. What’s the extraction?" "The 'Cooler.' You bring it,假装 to be full of ice cream. We load the presents in, carry it out. No one questions the ice cream man." "Bill, this is why you're the best." Molly clapped him on the shoulder. "Now, the objective. Are we talking everything? The plastic swords, the coloring books?" Bill gave her a look. "We're professionals. We go for the big-ticket items. The wrapped boxes. The mystery is where the value is. Could be a console, could be a lifetime supply of glitter." "Glitter is not valuable. Glitter is a curse." "In the right market, everything has value. Now, timing. We strike at the climax: the cake ceremony. When they sing 'Happy Birthday,' we sing too, then move. The emotional peak is our window." "Emotional peak. Got it." Molly cracked her knuckles. "Let’s go make some five-year-old’s birthday unforgettable." "For the wrong reasons," Bill added, pulling a black beanie over his head. "For the *right* reasons," Molly corrected with a grin. "We’re making it memorable. Now, about your disguise. You still look like a man who lost a fight with a piñata." "That was the point! I was establishing credibility." He plucked a tiny star-shaped sticker from his jacket. "See? Infiltration complete." They slipped into the night, two shadows on a quest for plastic and paper-wrapped treasure, leaving only the smell of wet cardboard and the faint echo of a children's song between them.
created_at
"2026-04-24T17:26:39.123Z"
4/24/2026, 5:26:39 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
648
moderation_latency
(null)
generation_time
12456
tokens_prompt
77
tokens_completion
1064
native_tokens_prompt
311
native_tokens_completion
930
native_tokens_completion_images
(null)
native_tokens_reasoning
206
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00192296
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"de538bc94ab74f029782dbfda5fd4741"
is_byok
false
latency
593
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777051599-pDjtzqYTW9kCn9DBoLYY"
session_id
(null)
api_type
"completions"
id
"gen-1777051599-H0zRq0wpI8EPBpydBHKP"
upstream_id
"de538bc94ab74f029782dbfda5fd4741"
total_cost
0.00192296
cache_discount
0.00006144
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
0.00%
Dialogue to Total Word Ratio
Ratio: 69.13%, Deviation: 19.13%
neededClean
false
wordsTotal
515
wordsDialogue
356
49.6885%