NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ068NWGA9VFH2T02PJC0QM9
Run Details
Model:
MoonshotAI: Kimi K2.6
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4m 14s
Cost:
$0.088123
Provider:
Cloudflare
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
99.81%
Dialogue to Total Word Ratio
(details)
99.8960%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her night-vision goggles and stared through the windshield at the suburban ranch house glowing with fairy lights and desperation. A cardboard sign on the lawn read TYLER'S FIFTH BIRTHDAY, and beneath it, someone had drawn a dinosaur holding a balloon that looked suspiciously like a surveillance drone. It was, objectively, the most dangerous target they had ever surveilled, and that included the casino in Macau with the shark tank and infrared lasers. "The target is a five-year-old?" Bill asked from the passenger seat, clicking his retractable lockpick set open and shut like a nervous metronome in the dark. "Tyler got the Mega-Console 3000, four LEGO Death Stars, and a bike with gears, Molly," she said, not looking away from the bouncy castle wobbling in the front yard. "Why don't we just rob a bank like professionals?" "Banks have cameras, Bill. Tyler's mom has a Pinterest addiction and three glasses of rosé in her by two. She's practically inviting us." Bill squinted at a piñata dangling from the oak tree like a paper-mache grenade. "What about the stupid clown?" "The clown is our inside man." "You hired a clown?" "I AM the clown. I went to clown college for six weeks in '09. This is my moment." "You failed out of clown college." "I was expelled for being too scary. Perfect for a heist." Molly pinched the bridge of her nose. She had planned the Dubai diamond job in eighteen hours. She had once rappelled through the ceiling of a Saudi prince's panic room while he slept three feet away. Now she was inventorying juice boxes and calculating the tensile strength of a bounce house. The indignity of her career trajectory stung worse than the cheap tequila shot she'd taken for courage at noon, and possibly worse than the time she'd been tased in Brussels. "Fine. But balloon-animal the getaway route one more time—" "It's a proven mnemonic device!" "It's a poodle. We're driving a poodle." "A poodle with a 4.0 liter V8." Molly sighed and checked her watch. The cake was scheduled for three o'clock, which meant every child and parent would be corralled in the kitchen for the singing of the anthem. That gave them a four-minute window to breach the living room, load the presents into the laundry chute, and exfiltrate through the garage. Bill would create a diversion by pretending his clown shoe was on fire—classic misdirection, he insisted—and Molly would rappel from the second-story balcony because Tyler's father, a man who collected power tools he didn't understand, had left the ladder deployed. She looked at her partner. He was already applying greasepaint with the focus of a Renaissance master painting a particularly doomed saint. A red nose sat on the dash like a detonator waiting for the right thumb. "Remember," she said. "No face. No prints. And if anyone asks, you're Boppo." Bill grinned, snapping his oversized suspenders. "Boppo doesn't talk. Boppo only screams." "That's the spirit," Molly lied, and opened the van door.
created_at
"2026-04-24T16:48:14.758Z"
4/24/2026, 4:48:14 PM
model
"moonshotai/kimi-k2.6-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
592
moderation_latency
(null)
generation_time
253872
tokens_prompt
77
tokens_completion
17540
native_tokens_prompt
69
native_tokens_completion
22015
native_tokens_completion_images
(null)
native_tokens_reasoning
16755
native_tokens_cached
3
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.08812318
router
(null)
provider_responses
0
endpoint_id
"85e7870f-601c-4891-98ab-f7ac2ed4d718"
id
"id-1777049295333"
is_byok
false
latency
592
model_permaslug
"moonshotai/kimi-k2.6-20260420"
provider_name
"Cloudflare"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777049294-8whtUnoJN7Qw10EwJ5hs"
session_id
(null)
api_type
"completions"
id
"gen-1777049294-NKVePgGimP9YHcopMsI5"
upstream_id
"id-1777049295333"
total_cost
0.08812318
cache_discount
0.00000237
upstream_inference_cost
0
provider_name
"Cloudflare"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
502
99.81%
Dialogue to Total Word Ratio
Ratio: 30.20%, Deviation: 0.20%
neededClean
false
wordsTotal
510
wordsDialogue
154
99.8960%