NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ011GD0Q7S7RWXPCYX8XVT7
Run Details
Model:
MoonshotAI: Kimi K2.6
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
10m 53s
Cost:
$0.075133
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
13.56%
Dialogue to Total Word Ratio
(details)
54.7756%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slapped a glitter-stained invitation onto the steering wheel and grinned. "Tell me again why we're heisting a child." Bill slumped lower in the passenger seat. "Not a child," Molly said, adjusting her fake beard in the rearview mirror. "A target. Little Timmy Van der Merwe. Daddy invented the app that puts hats on cats. These presents are retailing at ten grand easy." "Yeah, but there's gonna be parents." Bill pointed at the crayon drawing she was using as a blueprint. "And a clown. Last week we cased a casino. Now I'm wearing a hairnet near a ball pit." "Two clowns." Molly held up two hairnets and a tray of gray deviled eggs. "Which is why we're catering. Nobody interrogates the egg lady. We are invisible." "We look like meth dealers at a baptism." "Exactly. No one makes eye contact." She unrolled the napkin to reveal a bouncy castle. "We enter through the kitchen, scope the gift table, and wait for the piñata." "Then what? What about the aunt? Every party's got a purse-watching aunt." "I've paid the pony handler to sneeze on the cake." Molly strapped a fanny pack around her waist. "Pony sneezes, mom screams, forty children become a riot. The aunt flees. We bag the Legos and slip out through the hedge maze." "There's a hedge maze." It wasn't a question. "Rich people maze, Bill. Stay focused." She tossed him an apron that read Kiss the Cook. "You're the ice sculptor." "I don't sculpt ice." "You chip at a block and look tortured. It's performance art." Molly pulled out garbage bags. "While you're finding your muse, I'll swap the real gifts with sand-filled decoys. Same weight, no witnesses. The sand is from their own sandbox. It's poetic." "What if someone checks the boxes?" "That's why we use gift wrap. No one unwraps at the party. By the time Timmy opens a box of wet sand, we'll be emotionally distant." Bill sighed. "I did six months for wire fraud, Molly. Actual federal time. My cellmate was a forger. He'd weep if he could see me now shoplifting Optimus Prime from a bounce house." "Liberating," she corrected. "And he's vintage. Now grab the sausages." "Why sausages? I thought the dog was the lookout." "The dog is a variable. The sausages are a bribe." Molly started the engine. "There's always a dog. Also, take these." She handed him a bottle of bubbles. "If the balloon guy gets suspicious, blow in his face. It's disorienting." "What if he blows back?" "Then you run. Don't overthink it." "We need a safeword." "Safeword is piñata. If you yell it, I set off the smoke machine." "You brought a smoke machine?" "It's in the glovebox. Don't turn it on early again." Bill eyed the glovebox. "You're insane. This is why your therapist ghosted you." "I hate this job." "You love it." "I'm a professional." She checked her watch. "Pony's on a strict sneezing schedule. Move. We miss the window, we're doing a christening." He didn't argue. Bill fastened his seatbelt and groaned.
created_at
"2026-04-24T15:16:56.876Z"
4/24/2026, 3:16:56 PM
model
"moonshotai/kimi-k2.6-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1233
moderation_latency
(null)
generation_time
652696
tokens_prompt
77
tokens_completion
16294
native_tokens_prompt
68
native_tokens_completion
18767
native_tokens_completion_images
(null)
native_tokens_reasoning
15513
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0751326
router
(null)
provider_responses
0
endpoint_id
"af0285b7-6e2d-44e3-b867-c23c6c18e8cb"
id
"f504cbb2249b9feba31917be44ff6084"
is_byok
false
latency
1233
model_permaslug
"moonshotai/kimi-k2.6-20260420"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777043816-5VjZYyl8yExJYkTQ2jUQ"
session_id
(null)
api_type
"completions"
id
"gen-1777043816-6Y6SBwJU1sDfMOkXWBua"
upstream_id
"f504cbb2249b9feba31917be44ff6084"
total_cost
0.0751326
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
508
13.56%
Dialogue to Total Word Ratio
Ratio: 76.32%, Deviation: 6.32%
neededClean
false
wordsTotal
511
wordsDialogue
390
54.7756%