NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0474PHB7RTDD8AD761G30M
Run Details
Model:
MoonshotAI: Kimi K2.6
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7m 16s
Cost:
$0.054226
Provider:
Venice
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
89.23%
Dialogue to Total Word Ratio
(details)
46.4282%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her binoculars to her nose, watching a five-year-old rip into a box containing a stuffed T-Rex. The backyard was a riot of balloons and a folding table groaning under wrapped boxes. She turned to her partner, who was studying a laminated map like they were storming Fort Knox. "Look at that pile, Bill. We're talking sixty, maybe seventy gifts. It's a goldmine wrapped in primary colors." "It's a five-year-old's birthday party, Molly. I feel genuinely weird about this." "Weird? We stole the Stanley Cup last month." "Exactly. We had standards. We had a reputation. Now we're hitting a bouncy castle?" "First of all, that castle is a tactical nightmare—no cover, constant motion. Second, the parents rented a pony. Do you know what pony rental costs? These people are loaded." Bill scratched his chin, eyes on the inflatable castle where children bounced like a shaken soda can. He sighed. "What if the kids see us? We can't traumatize small children." "We wear masks. Nice ones. Friendly animal masks. We'll look like entertainment that got lost." "And what about the parents?" "Distracted. Dad's doing magic tricks. Mom's guarding a piñata. We go in through the side gate during the cha-cha slide." "What about security?" "A balloon arch. Maybe a grandmother with a phone." Molly spread the invitation across the dashboard, pointing at the pirate map that marked the gift table with an X of glitter glue. She circled the refreshments twice. "We definitely need a code word." "Cake time." "Too obvious. How about 'the clown is sober'?" "Fine. When I say 'the clown is sober,' you grab the gift table." "Do we take the gift bags too, or just the boxes?" "Bags have clothes and books. We want the boxes. The rectangular ones that rattle. Those are the electronics." "What if a kid is actually holding a present?" "We don't mug children, Bill. We have a code." "So we only steal unattended loot. Like pirates, but polite." "Exactly. Polite pirates. We stack them on the wagon. I've greased the wagon wheels with cake frosting so they're silent." Bill paused over a bakery box smelling of buttercream. He pulled out a raccoon mask and sniffed it. "Wait, you stole cake frosting?" "I bought it. I'm a monster, not a savage. Now put on your mask." "Tell me, is this a fox or a bear?" "It's supposed to be a raccoon. We're raccoons. Because of the trash and thievery." "This mask smells strongly like vanilla." "I washed it. Again, standards. Now remember: in, stack, out. No face-planting in the ball pit, not this time." "That was one time. And I found a real watch." "It was a plastic Frozen watch. Move out." Bill hesitated, hand on the van door, eyes earnest behind his raccoon disguise. "Wait. Should we sing happy birthday to him first?" "Bill." "Strictly for cover! Not because I like the song." "If you sing, I will absolutely leave you in the bouncy castle. Understood?" "Understood." Molly shoved on her raccoon mask and grabbed the handle of a red wagon. She jerked her thumb toward the gate. "Good. Let's go steal some LEGOs."
created_at
"2026-04-24T16:12:27.226Z"
4/24/2026, 4:12:27 PM
model
"moonshotai/kimi-k2.6-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1260
moderation_latency
(null)
generation_time
435626
tokens_prompt
77
tokens_completion
11247
native_tokens_prompt
69
native_tokens_completion
11638
native_tokens_completion_images
(null)
native_tokens_reasoning
10413
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.054226281
router
(null)
provider_responses
0
endpoint_id
"aa515fc0-facb-44e0-934b-aa0fd1e5e60f"
id
"chatcmpl-bb69b5c06a1e651a"
is_byok
false
latency
1260
model_permaslug
"moonshotai/kimi-k2.6-20260420"
provider_name
"Venice"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777047147-xxt6v9EtvZKCDWyvy1cK"
session_id
(null)
api_type
"completions"
id
"gen-1777047147-deR4Py2AaYZUsRV3qbpI"
upstream_id
"chatcmpl-bb69b5c06a1e651a"
total_cost
0.054226281
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Venice"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
524
89.23%
Dialogue to Total Word Ratio
Ratio: 71.51%, Deviation: 1.51%
neededClean
false
wordsTotal
530
wordsDialogue
379
46.4282%