Model

MoonshotAI: Kimi K2.5

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2026-02-08-01KGYZYW2D43WDVPDFF487S4A0

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"You’re telling me we’re robbing a five-year-old?" Bill asked, adjusting his ski mask in the rearview mirror. "Six-year-old, actually. And it’s not just any six-year-old—it’s Brayden Worthington III. His parents bought him a solid gold rocking horse," Molly said, checking her lock picks. "Still feels like punching down." "It’s punching laterally. The kid’s net worth exceeds ours combined. Plus, he bites. I’ve seen the surveillance footage. He’s a biter." Bill sighed. "What’s the security situation?" "Clowns, mostly. One does magic, poorly. And a bounce house serving as the vault. It’s inflatable fortress technology. Very sophisticated." "Standard birthday protocol?" "Piñata at three, cake at four, present grab at five. We go in during the magic show distraction. That’s when the parents zone out completely, checking their phones for stock prices." "What about the chaperones?" "One’s doing balloon animals, the other’s face-down in the punch bowl. We’ll blend in as caterers. I’ve got the aprons with the ruffles." "With ski masks?" "We’re avant-garde caterers. Very exclusive. Michelin stars require mystery and face coverings. It’s art." Bill paused. "Do we at least leave the goodie bags?" "Bill, we’re thieves, not monsters. We take the gold horse, the gaming consoles, and maybe that robot dog, but the stickers and juice boxes stay. Those are sacrosanct. The juice boxes have straws, Bill. Straws." "Noble." "Professional standards, Bill. We have professional standards. I’m not spending Christmas in jail because I stole a juice box from a toddler. That’s bad karma."
created_at"2026-02-08T16:05:43.212793+00:00" Sunday, February 8th, 2026 at 4:05:43 PM GMT+00:00
model"moonshotai/kimi-k2.5-0127" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency804
moderation_latency(null)
generation_time66134
tokens_prompt61
tokens_completion3205
native_tokens_prompt68
native_tokens_completion3263
native_tokens_completion_images(null)
native_tokens_reasoning3211
native_tokens_cached68
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.0097958
router(null)
provider_responses
0
id"chatcmpl-6988b414409d90d3b0774dd0" 
status200
is_byokfalse
latency804
endpoint_id"49762299-66fd-4514-b0ee-c9e11e231e3b" 
provider_name"Moonshot AI" 
model_permaslug"moonshotai/kimi-k2.5-0127" 
api_type"completions" 
id"gen-1770566676-9N6N3UwrIiuZgCSJV9mg" 
upstream_id"chatcmpl-6988b414409d90d3b0774dd0" 
total_cost0.0097958
cache_discount0.000034
upstream_inference_cost0
provider_name"Moonshot AI" 

Evaluation details

Result Evaluator Details Meta Data
0.0000% Matches word count n/a
neededCleanfalse
words253
99.8076% Dialogue to Total Word Ratio
Ratio: 92.09%, Deviation: 2.09%
neededCleanfalse
wordsTotal253
wordsDialogue233
49.9038%