NC Bench
Overview
About
Tests
Models
Model
MoonshotAI: Kimi K2.5
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYYYAB8VHBTWHAYTFT2NZGX
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted at the glitter-caked invitation, flicking a speck of pink dust from her leather jacket. The van smelled like stale fries and desperation. "You do realize this says 'Princess Petunia's Enchanted Garden Extravaganza,' right? We're robbing a five-year-old." Bill adjusted his fake mustache in the rearview mirror of the stolen van, looking pleased with himself despite the sweat gathering on his forehead. "Technically, we're robbing thirty-seven five-year-olds. And one hedge fund manager who bought his daughter a pony wrapped in bitcoin. She's got three PlayStation 5s, Molly. Three. Plus a life-sized animatronic dinosaur." "It's a bouncy castle venue, Bill. With a face-painting station." "Exactly! Distraction infrastructure. We enter as entertainment, blend with the chaos, snag the gift table, vanish before Happy Birthday." Molly lowered her sunglasses, staring at him over the rims. "What do you mean 'we'? I'm not wearing a clown nose. Last time I wore rubber shoes, I got a fungal infection that required medical intervention." "Better. You're Princess Petunia's evil twin, Duchess Dread. I made you a sash. Velvet slippers, not rubber. Very classy." "You want me to impersonate a fictional royal at a party where parents memorized the guest list?" Bill waved his hand dismissively, nearly knocking over his coffee. "No, I hired us through a gig app. Very official. I told them you're from the agency. The evil twin agency." "That's not a thing." "It is now. I printed business cards." She rubbed her temples, feeling a headache bloom behind her eyes as she contemplated the moral implications. "What about the presents?" "Custom wagon. Looks like a caterer's cart, actually a high-speed extraction vehicle. Modified with cup holders." "For the juice boxes?" "For our nerves, Molly. We're professionals." "Professionals don't face security threats like 'nap time' and 'pin the tail on the donkey.' Or trigger-happy dads with camera phones and pepper spray." Bill unfolded a blueprint scrawled on the back of a coloring book page, pointing at a crayon drawing of a garden. "Gift table sits beside the face-painting station. Classic blind spot. Parents photograph kids as tigers while we load tablets and LEGO sets into the wagon." "You scoped this out?" "Attended the rehearsal. Brought my niece. Very disappointing cheese stick-to-carrot ratio in the goodie bags. Cheap stuff." Molly sighed, lifting a sequined villain cape from the pile of costumes on his dashboard. The fabric smelled like cotton candy and regret. "If we get caught, I'm blaming you entirely. I'll claim you kidnapped me." "Just yell 'surprise audit' and start checking gift receipts. No one questions bureaucracy at a children's party. They'll assume we're from the IRS or the clown union." "Exit strategy?" "Balloon drop at three PM. Visual cover. We roll out while they scream about sugar rushes and the magician starts his dove routine." She checked her watch, glancing at the cracked dashboard clock, then looked back at him with raised eyebrows. "You're disturbingly prepared for this." "I used to plan bank vaults in Luxembourg, Molly. This is easier. Less lasers, more glitter. Same principle: identify the mark, exploit the weakness, don't step on any small humans." "You brought a flare gun?" "For emergencies." She shook out the cape, watching sequins scatter across the torn upholstery of his passenger seat and stick to an old gum wrapper. "Fine. But I'm keeping the tiara. And if I see a single balloon animal near my face, I'm shooting it with your flare gun." "Only if you earn it, Duchess."
created_at
"2026-02-08T15:50:51.154241+00:00"
Sunday, February 8th, 2026 at 3:50:51 PM GMT+00:00
model
"moonshotai/kimi-k2.5-0127"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
821
moderation_latency
(null)
generation_time
240278
tokens_prompt
61
tokens_completion
12971
native_tokens_prompt
68
native_tokens_completion
12873
native_tokens_completion_images
(null)
native_tokens_reasoning
13256
native_tokens_cached
68
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0386258
router
(null)
provider_responses
0
id
"chatcmpl-6988afea4afc418ab6d96591"
status
200
is_byok
false
latency
821
endpoint_id
"49762299-66fd-4514-b0ee-c9e11e231e3b"
provider_name
"Moonshot AI"
model_permaslug
"moonshotai/kimi-k2.5-0127"
api_type
"completions"
id
"gen-1770565609-SMQFgdZzwZCc4RkNLlIG"
upstream_id
"chatcmpl-6988afea4afc418ab6d96591"
total_cost
0.0386258
cache_discount
0.000034
upstream_inference_cost
0
provider_name
"Moonshot AI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
589
95.4253%
Dialogue to Total Word Ratio
Ratio: 70.97%, Deviation: 0.97%
neededClean
false
wordsTotal
589
wordsDialogue
418
47.7127%