NC Bench
Overview
About
Tests
Models
Model
MoonshotAI: Kimi K2.5
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYY13D0YZZJBCHT0MEX323Y
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars, watching the bounce house sway in the suburban breeze like a giant, garish lung. The vinyl seats stuck to her thighs despite the air conditioning. The sounds of screaming children carried across three lawns, sharp and piercing. "Target is loaded," she said. "I count twelve gift bags, four bicycles, and a Power Wheels Jeep in bubble wrap." Bill tightened his clip-on tie in the passenger seat, sweating slightly in the unmarked white van. The unmarked white van smelled of french fries and regret. He squinted at the pastel chaos unfolding two houses down. "The parents look sticky," he said. "Are we sure about the window?" "That's why we wait for the piñata," Molly whispered, her voice dropping to a rasp though they were three houses down in a van with "Catering" stenciled crookedly on the side. "When they bring out the bat, chaos reigns. Sugar-addled minors scatter like buckshot. We enter as clown caterers." "Caterers?" "Specifically, catering clowns. Look professional but terrifying." Bill shifted in his seat, knee bumping the dash. His polyester pants squeaked against the vinyl. "No face paint after Brussels," he said. "I still have the rash." "That was greasepaint," Molly said, waving dismissively. She held up a glitter-covered clipboard. "This is water-based. Hypoallergenic. Floor plan: kitchen, through the living room, past the screaming circle—" "Is that a dance move?" "It's a literal circle of screaming children. We move fast, keep low, and avoid eye contact with the birthday girl. Her aunt has cop instincts and an iPhone." Bill checked his duffel bag, frowning. His bag held duct tape and a strobe light. "What about the dog?" he asked. "Muzzled by the magician at two o'clock," Molly said. "He's disappearing the Labradoodle. Classic misdirection." "You've thought of everything." "Except the glitter," Molly said, brushing her sleeve. "There's always glitter." She snapped on white gloves with theatrical flair. "Remember the code names. I'm 'Unicorn Princess,' you're 'Frosty the Snowman.'" "We're in July." "Exactly. No one expects Frosty in July." She handed him a baking tray covered in foil. "Cookies with melatonin. Not poison, just nap-inducing. We create a perimeter of sleeping toddlers, retrieve the loot, exfiltrate through the side gate." Bill stared at the cookies with horror. Sprinkles winked like sugary threats. "You want me to roofie children?" he asked. "Encourage natural circadian rhythms. It's homeopathic." "I thought we had standards." "We do," Molly said, pulling on a latex glove with a snap. "That's why we wear gloves. These presents won't steal themselves. That Jeep retails for four hundred." "Fine," Bill said. "But if I step in cake, you're buying me new shoes." "Deal," Molly said. She spoke into her wristwatch, her eyes steely. Static crackled. "Team Unicorn to Base." "Base is a bounce house," Bill muttered under his breath. "Base is wherever I say it is," Molly replied. She opened the van door, letting in humidity and party horn noise. "Initiating Operation Goodnight Moon. If someone asks why you're taking presents, say you're with 'Gift Maintenance.'" "That doesn't sound real," Bill said. "Nothing about this is real, Bill. That's why it's perfect." They stepped into the sunlight, clip-on ties gleaming, ready to liberate a seven-year-old. The grass was suspiciously green, the kind of green that only came from chemical warfare.
created_at
"2026-02-08T15:33:40.813746+00:00"
Sunday, February 8th, 2026 at 3:33:40 PM GMT+00:00
model
"moonshotai/kimi-k2.5-0127"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
460
moderation_latency
(null)
generation_time
157797
tokens_prompt
61
tokens_completion
8833
native_tokens_prompt
87
native_tokens_completion
8774
native_tokens_completion_images
(null)
native_tokens_reasoning
8290
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0246107
router
(null)
provider_responses
0
status
(null)
is_byok
false
latency
10077
endpoint_id
"c884a526-d529-4263-bd90-a58ea25e99d1"
provider_name
"Venice"
model_permaslug
"moonshotai/kimi-k2.5-0127"
1
id
"oWZjhj1-3QRW2y-9cac2bf4f1bd138a"
status
200
is_byok
false
latency
460
endpoint_id
"3438c28e-ebcb-4f46-8bab-f133912d5297"
provider_name
"Together"
model_permaslug
"moonshotai/kimi-k2.5-0127"
api_type
"completions"
id
"gen-1770564652-PPSbb7ArujHJEFaB2Oe2"
upstream_id
"oWZjhj1-3QRW2y-9cac2bf4f1bd138a"
total_cost
0.0246107
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
555
0.0000%
Dialogue to Total Word Ratio
Ratio: 52.25%, Deviation: 17.75%
neededClean
false
wordsTotal
555
wordsDialogue
290
0.0000%