NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ019K5ZSY1N1AJMHEC6S4K6
Run Details
Model:
MoonshotAI: Kimi K2.6
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 45s
Cost:
$0.049765
Provider:
Cloudflare
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
10.10%
Dialogue to Total Word Ratio
(details)
55.0413%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly lowered her coffee and stared through the windshield at the ranch house. Streamers hung from the mailbox like festive crime scene tape. "Let me get this straight. You want us to crash a children's birthday party." "Not crash. Infiltrate. There's a difference." "Enlighten me." "Crashing implies we weren't invited. Infiltrating implies we weren't invited professionally. I thought you went to college." Bill spread a glitter-covered blueprint across the dashboard and tapped a crayon drawing of the backyard. "Look, the target is a five-year-old named Tyler. His mom hired a bounce house, a face painter, and a guy dressed as Captain Galaxy. Security's tight." "Bill. It's not a casino. It's a patio with a sandbox." "Exactly why they won't see us coming. Classic misdirection." Bill pulled two crumpled party hats from his trench coat. "Disguises." "We're forty." "Forty-year-old guests. Very sad, very believable. We walk in with a gift—" "We're stealing gifts, not bringing them." "Right, but we bring one to get past the perimeter. It's a Trojan horse situation. I filled a box with rocks so it rattles like a LEGO set." Molly stared at him until her eye twitched. "You want me to walk into a stranger's backyard wearing a pointy hat and a box of rocks." "And a smile. Kids can sense fear. Also, don't make eye contact with the dog." Bill checked his watch as an ice cream truck rolled past. "The cake drops at three. That's when the parents get distracted by sugar-fueled chaos. That's our window." "What's the exit strategy?" "Balloon animals." "I'm sorry?" "I've been practicing for six weeks. I pose as the backup entertainer, create a diversion near the pinata, and you sweep the gift table into garbage bags. I've got it timed to the second." Bill produced a mangled latex poodle and waved it weakly. "Garbage bags? Really?" "Black, unscented industrial grade. I'm not an animal, Molly." "What if someone asks why we're taking thirty wrapped presents to a Honda Civic in broad daylight?" "We say it's for charity. Operation: Birthday Hope. Very official. I made a logo and everything." She sighed. "And if Tyler cries? Actual tears." Bill looked wounded and clutched his heart. "We leave him the card and a juice box. I'm a thief, not a monster. Besides, he's getting a bounce house. Bounce houses are the real gift." He adjusted his party hat so the elastic dug into his chin. "One hour. In and out. Clean. Professional. No fingerprints, no face paint, no witnesses." "You're paying for therapy if I see a clown. Full coverage." "Deal. But if the clown sees you first, use the rocks. Don't hesitate." "Which rocks? The big ones?" "The ones in the box. Aim for the oversized shoes. Then run and don't look back." Molly picked up the garbage bags. "I miss robbing banks. The dignity. The alarms. The respect." Bill tied his sneaker with grim intensity. "Those days are gone, Molly. Welcome to the big leagues. Now smile." Molly opened the car door.
created_at
"2026-04-24T15:21:21.866Z"
4/24/2026, 3:21:21 PM
model
"moonshotai/kimi-k2.6-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
350
moderation_latency
(null)
generation_time
164246
tokens_prompt
77
tokens_completion
11305
native_tokens_prompt
69
native_tokens_completion
12426
native_tokens_completion_images
(null)
native_tokens_reasoning
10488
native_tokens_cached
6
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.04976481
router
(null)
provider_responses
0
endpoint_id
"85e7870f-601c-4891-98ab-f7ac2ed4d718"
id
"id-1777044082218"
is_byok
false
latency
350
model_permaslug
"moonshotai/kimi-k2.6-20260420"
provider_name
"Cloudflare"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777044081-zZ8EyMLMEY1TxMerGINj"
session_id
(null)
api_type
"completions"
id
"gen-1777044081-ODPAn4QPglqdS6cgRK4C"
upstream_id
"id-1777044082218"
total_cost
0.04976481
cache_discount
0.00000474
upstream_inference_cost
0
provider_name
"Cloudflare"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
502
10.10%
Dialogue to Total Word Ratio
Ratio: 76.77%, Deviation: 6.77%
neededClean
false
wordsTotal
508
wordsDialogue
390
55.0413%