NC Bench
Overview
About
Tests
Models
Model
MoonshotAI: Kimi K2.5
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYZV5KR7YB2BC9MQTE27RF2
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars, squinting through the frosting-smeared windshield. "You're telling me the target is a bouncy castle?" "Not the castle itself," Bill said, unfolding a map drawn in crayon. "The presents. They're stacked by the picnic table next to the clown. I've got intel that the birthday boy's grandparents went overboard. We're talking three PlayStations, Molly." "There's only one of you, Bill." "Exactly. That's why we need the diversion." She tapped the diagram where Bill had drawn a stick figure with exaggerated tears. "Your crying routine." Bill shifted, tugging at his too-small overalls. "I still don't see why I have to be the lost child. I'm thirty-four." "You're emotionally stunted, it works. You go in, throw a tantrum about dropped ice cream, and while the parents negotiate with your 'grief,' I rappel down from the oak tree." Bill stared at the tree. It was eight feet tall. "Rappel." "I've got three feet of rope and determination." "That's called falling." Molly checked her utility belt. "Details. The important thing is the goodie bags. I want the goodie bags." "Those contain stickers and juice boxes." "Organic juice boxes, Bill. Six dollars retail." He sighed, examining his prop ice cream cone, carved from soap. "What if they call the cops?" "Deploy the nuclear option." She handed him a whoopee cushion. "Blame the clown. Classic misdirection." "You're a monster." "I'm a professional. Now, code words. If I say 'the piñata is broken,' abort." "What if the piñata actually breaks?" "Grab the candy and run. That's just good business." Bill peered at the five-year-olds. "I see the target. The presents use dinosaur paper." "Raptors?" "T-Rex. Mint condition." He paused. "Molly, are we bad people? We're stealing from children." "We're redistributing wealth from those who can't appreciate it. Does Timmy need four gaming consoles? No. Do I need rent money? Yes." "He's turning six." "I'm turning thirty with an eviction notice. Look, you want out? Fine. But I'm taking the cake." Bill's eyes widened. "There's cake?" "Three tiers. Fondant." "I'm in." "Good. Wipe your nose. You're supposed to be sad, not suspicious." Bill smeared soap across his cheek. "How do I look?" "Like a vict—why are you wearing brass knuckles?" "For intimidation." "You're supposed to be six!" "Six and tough." Molly buried her face in her hands. "Just look vulnerable. If the magician asks you to volunteer, say no. I don't need you disappearing mid-heist." "Yes ma'am." "Let's go steal some Legos."
created_at
"2026-02-08T16:03:26.088704+00:00"
Sunday, February 8th, 2026 at 4:03:26 PM GMT+00:00
model
"moonshotai/kimi-k2.5-0127"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
758
moderation_latency
(null)
generation_time
50476
tokens_prompt
61
tokens_completion
2636
native_tokens_prompt
68
native_tokens_completion
2653
native_tokens_completion_images
(null)
native_tokens_reasoning
2238
native_tokens_cached
68
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0079658
router
(null)
provider_responses
0
id
"chatcmpl-6988b39b56fd1c8a49a41337"
status
200
is_byok
false
latency
758
endpoint_id
"49762299-66fd-4514-b0ee-c9e11e231e3b"
provider_name
"Moonshot AI"
model_permaslug
"moonshotai/kimi-k2.5-0127"
api_type
"completions"
id
"gen-1770566555-tw9kv6qpjE3w0FfXFAYc"
upstream_id
"chatcmpl-6988b39b56fd1c8a49a41337"
total_cost
0.0079658
cache_discount
0.000034
upstream_inference_cost
0
provider_name
"Moonshot AI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
415
11.1300%
Dialogue to Total Word Ratio
Ratio: 76.63%, Deviation: 6.63%
neededClean
false
wordsTotal
415
wordsDialogue
318
5.5650%