NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ069626YPHH7WS98MEQGPDS
Run Details
Model:
Xiaomi MIMO v2.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
11.8s
Cost:
$0.001816
Provider:
Xiaomi
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
9.95%
Dialogue to Total Word Ratio
(details)
4.9808%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the greasy garage workbench. It was just a crayon drawing of a house with a big sun in the corner. “Alright, Bill, look at this. The target is a classic suburban fortress. Bouncy castle perimeter defense, a possible clown asset—likely non-lethal—and a concentrated payload of prime, unopened loot in the designated ‘Presents’ zone.” Bill, polishing a tiny, rainbow-colored pry bar, looked dubious. “Molly, it’s a five-year-old’s birthday party. It’s not a fortress. It’s a house with streamers.” “Never underestimate streamers, William. They’re a tripping hazard and a visual deterrent.” Molly tapped the drawing. “Now, our window of opportunity is during ‘Pin the Tail on the Donkey.’ The auditory and visual distraction level is off the charts. Adults are blindfolded, kids are screaming. It’s our green light.” “Right, but what’s the getaway plan?” Bill asked. “We can’t just saunter out with armfuls of gifts. What about my van? It’s conspicuous. It says ‘Free Candy’ on the side.” “Exactly! It’s the perfect camouflage. Who suspects the creepy van when there’s a legitimate candy distribution event occurring? We drive up, I honk the clown horn, you throw open the doors. The kids will swarm us for the promised sweets. In the chaos, we transfer the goods. Textbook.” Bill frowned, rubbing his chin. “The parents will be watching. What about my cover story? I had ‘lost uncle’ last time, but Aunt Carol showed up and it got weird.” “New story,” Molly said, her eyes gleaming. “You are ‘Bouncy Steve,’ the entertainment inspector. You have a clipboard and a very serious face. Your job is to assess the structural integrity of the inflatable fun zone. You have full access, and no one questions a man with a clipboard.” “That’s… actually not bad,” Bill conceded. “But the presents. How do we move them?” “Phase two: The Loot Chute,” Molly whispered, producing a length of blue plastic tubing. “The children’s ball pit. We redirect the ball flow, creating a direct conveyor from the gift table, under the fence, and into the van. It’s genius.” “The balls will go everywhere!” “A necessary sacrifice! Think of it as a diversionary tactic. A cascade of colorful plastic, a symphony of popping sounds. No one will hear the gifts sliding down the chute into our waiting arms.” Bill stared at the tube, then at the crayon blueprint, then back at Molly’s determined face. A slow grin spread across his own. “Okay. I’m in. But I get first pick of the toys.” “Naturally,” Molly said, rolling up the blueprints. “Now, review the napkin protocol. You do not, under any circumstances, take the last juice box. It raises questions. Ready on my mark.” She pressed a button on a garage door opener. The door slowly creaked open, revealing the sunlit suburban street. “Let’s go to work.”
created_at
"2026-04-24T16:48:31.311Z"
4/24/2026, 4:48:31 PM
model
"xiaomi/mimo-v2.5-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
757
moderation_latency
(null)
generation_time
11750
tokens_prompt
77
tokens_completion
1098
native_tokens_prompt
311
native_tokens_completion
887
native_tokens_completion_images
(null)
native_tokens_reasoning
233
native_tokens_cached
256
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00181648
router
(null)
provider_responses
0
endpoint_id
"2af4bf94-26fa-42b9-bce0-8b11071e105a"
id
"6e1858ee6246403084d7f61d3926434c"
is_byok
false
latency
531
model_permaslug
"xiaomi/mimo-v2.5-20260422"
provider_name
"Xiaomi"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777049311-R9p0otdgn16cvxOO1xNW"
session_id
(null)
api_type
"completions"
id
"gen-1777049311-kJGKdhQZWzSS3iLGoCZD"
upstream_id
"6e1858ee6246403084d7f61d3926434c"
total_cost
0.00181648
cache_discount
0.00008192
upstream_inference_cost
0
provider_name
"Xiaomi"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
469
9.95%
Dialogue to Total Word Ratio
Ratio: 76.79%, Deviation: 6.79%
neededClean
false
wordsTotal
474
wordsDialogue
364
4.9808%