NC Bench
Overview
About
Tests
Models
Model
MoonshotAI: Kimi K2.5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2026-02-08-01KGYYX5YYFJWWNQSHZC7Y64YN
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"You're telling me we scale the fence during musical chairs?" Bill asked, sweating through his ski mask. "Exactly when the parents are distracted by the impending sugar crash," Molly said. "Timmy's turning six. That means piñata protocol initiates at 3:15 sharp." "Piñata protocol?" "Blindfolded chaos, Bill. Complete sensory overload. We slip through the back while twenty toddlers are beating a cardboard unicorn with a bat." "What about the clown?" "Already handled. I swapped his balloon pump with a mild veterinary tranquilizer." "That's dark." "It's ketamine, Bill, not cyanide. He'll wake up in three hours thinking he's a very flexible giraffe." Molly unrolled a crayon map. "The presents are stacked by the cake. We grab, we go, nobody cries." "Kids always cry." "Not if we leave convincing decoys." She held up two wrapped boxes. "These contain steamed broccoli and remedial math worksheets." "You're a monster." "I'm a professional. Now stuff these juice boxes in your pockets—we're going in as caterers." "Caterers?" "Nobody questions the person holding the Capri Suns, Bill. Nobody." "What about the bouncy castle?" "Deflated. I poked it with a salad fork during the tour." "You brought a fork?" "I'm always prepared. That's why I'm the boss." "You're not the boss." "I have the crayon floor plan, Bill. I'm definitely the boss."
created_at
"2026-02-08T15:47:26.475636+00:00"
Sunday, February 8th, 2026 at 3:47:26 PM GMT+00:00
model
"moonshotai/kimi-k2.5-0127"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1289
moderation_latency
(null)
generation_time
73126
tokens_prompt
61
tokens_completion
3344
native_tokens_prompt
68
native_tokens_completion
3332
native_tokens_completion_images
(null)
native_tokens_reasoning
3027
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0100368
router
(null)
provider_responses
0
id
"chatcmpl-6988afc5bb12c86e1a93b28d"
status
200
is_byok
false
latency
1289
endpoint_id
"49762299-66fd-4514-b0ee-c9e11e231e3b"
provider_name
"Moonshot AI"
model_permaslug
"moonshotai/kimi-k2.5-0127"
api_type
"completions"
id
"gen-1770565572-7mVN3P2wVKHL83qnvsSL"
upstream_id
"chatcmpl-6988afc5bb12c86e1a93b28d"
total_cost
0.0100368
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Moonshot AI"
Evaluation details
Result
Evaluator
Details
Meta Data
68.1022%
Matches word count
n/a
neededClean
false
words
214
99.9982%
Dialogue to Total Word Ratio
Ratio: 90.65%, Deviation: 0.65%
neededClean
false
wordsTotal
214
wordsDialogue
194
84.0502%