NC Bench
Overview
About
Tests
Models
Model
MoonshotAI: Kimi K2.5
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYYK0SATJB13XEYAJXCHHZQ
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her ski mask in the rearview mirror of the dented minivan, checking her reflection with the scrutiny of a jewelry appraiser. The vehicle smelled like stale french fries and regret, parked three blocks from the suburban house where helium balloons bobbed against a gray sky. Through the windows, they could hear the shriek of sugar-fueled children playing musical chairs. She pulled a crumpled flyer from her coat pocket featuring a Spider-Man clipart and a warning about peanut allergies. Bill sat in the passenger seat attempting to fit a crowbar into a gift bag decorated with cartoon dinosaurs, his clown nose honking softly every time he shifted. He wore a trench coat over striped pajamas, looking less like a master thief and more like a sleepwalking accountant. His fingers, stained with ink from previous heists, fumbled with the glittery handles until the bag ripped, spilling tissue paper everywhere. The red nose honked again as he sighed, a sound of pure clown depression. "You sure about this?" Bill asked, adjusting the red nose that had seemed hilarious during the planning phase but now felt like a war crime against dignity. "Feels beneath us. We're art thieves, Molly. We stole that forgery from the Met." "Beneath us?" Molly snatched the invitation back. "Do you know how much a PS5 costs? Little Tommy's getting three of them. We hit this party, we're clearing five grand easy. That's three months' rent, Bill. Three months." "But it's a bounce house, Mol. We're stealing from a bounce house. There are children wearing party hats. With elastic chin straps and everything." "We're stealing from the gift table while the bounce house distracts the witnesses. It's perfect. Low security, high emotional chaos, parents too busy preventing sugar comas to notice us. Besides, he'll get more presents by Tuesday." Bill honked his nose against the window. "What about the clown? I saw him juggling in the driveway." "What about him?" "Real clown. Professional. He's got the big shoes. Could be competition." "We go in as the cake people. I've got the clipboard. We grab the loot during the piñata—everyone's blindfolded, swinging sticks around. If he gets wise, we bribe him." Molly dug through a duffel bag, retrieving two white catering jackets that smelled like mothballs. She tossed one onto Bill's lap. "And if the birthday boy sees us?" Bill asked, trying to button the jacket over his ridiculous nose. "We tell him we're Santa's cleanup crew." She pulled out two badges she'd laminated that morning, edges sealed with glitter glue. "See? Official." Bill studied his badge. "This says 'North Pole Security.'" "It's thematic. Now stuff the gift bags under your shirt—we're going in hot. Remember, cry if anyone asks questions. Adults ignore crying caterers." "I feel bad about the Legos," Bill muttered, adjusting his padded stomach. "Kids love Legos. I loved Legos." "Then buy yourself the Death Star afterward. Focus, Bill. This is the big leagues." "Of children's entertainment theft." "Exactly." Molly pulled the ski mask down. "Now grab the present sack. And lose the nose before someone thinks you're part of the entertainment."
created_at
"2026-02-08T15:44:50.031186+00:00"
Sunday, February 8th, 2026 at 3:44:50 PM GMT+00:00
model
"moonshotai/kimi-k2.5-0127"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
922
moderation_latency
(null)
generation_time
249187
tokens_prompt
61
tokens_completion
12964
native_tokens_prompt
68
native_tokens_completion
12853
native_tokens_completion_images
(null)
native_tokens_reasoning
13610
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0385998
router
(null)
provider_responses
0
id
"chatcmpl-6988ae78d2d18fb693321e5c"
status
200
is_byok
false
latency
922
endpoint_id
"49762299-66fd-4514-b0ee-c9e11e231e3b"
provider_name
"Moonshot AI"
model_permaslug
"moonshotai/kimi-k2.5-0127"
api_type
"completions"
id
"gen-1770565239-SQYAMM1LVYAOKqRGw875"
upstream_id
"chatcmpl-6988ae78d2d18fb693321e5c"
total_cost
0.0385998
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Moonshot AI"
Evaluation details
Result
Evaluator
Details
Meta Data
20.1897%
Matches word count
n/a
neededClean
false
words
520
99.2631%
Dialogue to Total Word Ratio
Ratio: 49.62%, Deviation: 0.38%
neededClean
false
wordsTotal
520
wordsDialogue
258
59.7264%