NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ05VP9D47DK3K6VVXDD3QCG
Run Details
Model:
MoonshotAI: Kimi K2.6
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
10m 56s
Cost:
$0.047975
Provider:
Phala
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
99.97%
Dialogue to Total Word Ratio
(details)
99.9761%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill tightened his balaclava in the rearview mirror three blocks too early, his breath fogging the glass. He did this before every job, as if the suburban cul-de-sac required the same tactical mindset as a bank vault in Zurich. Beside him, Molly studied the blueprint—a crayon drawing of a house shaped like a potato, taped to a juice box. They had lifted it from the target's refrigerator during reconnaissance the previous Tuesday, right after Bill had eaten half a string cheese and left a trail of cracker crumbs that Molly insisted would compromise the operation. "We go in fast," Bill said. "I want that scooter and the Nerf guns." Molly didn't look up. "It's a child's party. Parents. Cake. A clown." "I've prepared." Bill patted his satchel, which contained three water balloons and a whoopee cushion he had deemed essential tactical gear. "Distraction is crucial. Once the adults herd toddlers toward the entertainment, we hit the gifts." They sat in a van decaled with Franklin's Pet Grooming, a front Molly found absurd given Bill's crippling allergies to anything with fur. Through the windshield, a bounce castle wobbled in the breeze like a drunk marshmallow. A woman in yoga pants carried a piñata shaped like a dinosaur toward the garage. Molly traced the crayon lines with her finger. "The invite said presents go straight to the designated table. Apparently, Tyler's mom runs a tight ship." Bill's eyes gleamed with the same intensity he'd once used while stealing the Mona Lisa's security schedule. "Systems are made to be breached. Two minutes through the back gate. I disabled the safety latch with a juice box straw. Then we blend." "Blend how?" He produced two hats from beneath his seat. One read Party Captain. The other read Balloon Artisan. They shimmered with assaultive levels of glitter. Molly felt her soul leave her body. "We're thirty-four." "And about to abscond with forty grand in plastic treasure," Bill shot back. "The Lego alone will fence for—" "Lego doesn't fence," Molly interrupted. "Market crashed after the Denmark job." Bill looked genuinely wounded. "Fine. Gift cards, cash envelopes, electronics in my pack. Perishables in yours." "Perishables?" "The cake, Molly. Four tiers of fondant. Client in Reno is very interested." Molly watched a child in a superhero cape crash into a rosebush and immediately get up screaming with joy. She wondered when her life had taken this turn. Last year she had been in Macau lifting diamonds. Now she was casing a bouncy house because Bill had already spent his advance on tactical earmuffs shaped like puppy ears. "Three minutes," Bill said, checking a watch that chirped the opening notes of Baby Shark. "If anyone asks, we're the advanced party team." "Advanced party team." "Trust the plan." He adjusted his Balloon Artisan hat. Molly grabbed the gift bag they had pre-packed with coal, which felt on-brand. She checked her pockets for the juice box straw. It was, she had to admit, a pretty good latch pick.
created_at
"2026-04-24T16:48:52.559Z"
4/24/2026, 4:48:52 PM
model
"moonshotai/kimi-k2.6-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1506
moderation_latency
(null)
generation_time
192548
tokens_prompt
77
tokens_completion
9147
native_tokens_prompt
69
native_tokens_completion
10413
native_tokens_completion_images
(null)
native_tokens_reasoning
8362
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.04797501
router
(null)
provider_responses
0
endpoint_id
"bb5de010-8234-4abb-b511-025d1817134a"
id
"chatcmpl-9bcd85cdb3b3daaa"
is_byok
false
latency
1506
model_permaslug
"moonshotai/kimi-k2.6-20260420"
provider_name
"Phala"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777049332-ZLmJqw0eiN6b4Gs98QBP"
session_id
(null)
api_type
"completions"
id
"gen-1777049332-EGROzHznk2cuFHWEy5TZ"
upstream_id
"chatcmpl-9bcd85cdb3b3daaa"
total_cost
0.04797501
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Phala"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
99.97%
Dialogue to Total Word Ratio
Ratio: 30.08%, Deviation: 0.08%
neededClean
false
wordsTotal
502
wordsDialogue
151
99.9761%