Run: 2026-02-20-01KHY9S48BJAVSMW16H8Y15GD2

Run Details

Duration:
1m 43s
Cost:
$0.009101
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
1.04% Matches word count (details)
100.00% Dialogue to Total Word Ratio (details)
50.5180% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly adjusts a neon clown wig in the dented cargo van, her fingers brushing a crumpled party invitation tucked in her overalls’ pocket. The community center across the street glows with rainbow streamers, a bouncy castle bobbing behind its chain-link fence. Bill leans over a smudged floor plan, his pencil tapping the supply closet where presents are stashed—he’d scouted it yesterday, pretending to fix a leaky faucet. He tucks a tiny crowbar into his sock, then glances at Molly’s nervous tic of twisting her empty clown nose holder. “Do you think cake chaos distracts the harried parents?” Molly says. “Only if we time our balloon pop perfectly at dusk now.” Bill responds. Molly nods, testing a noisemaker that will mimic a fire alarm if their first plan flops. She checks her watch—30 minutes until the gift-opening ceremony. Bill grabs a fake face paint kit, muttering about avoiding a toddler’s sticky grasp that could foil their stealth. The van’s radio sputters a tinny children’s jingle, making Molly wince. She tucks the clown nose on, ready to blend into the party’s rotating staff, while Bill folds the floor plan into a crumpled paper airplane to discard nearby before they move. He double-checks his crowbar’s hiding spot, then gestures toward the center’s back door where a harried teen volunteer slumps mindlessly scrolling a phone.
created_at"2026-02-20T19:52:41.493Z" 2/20/2026, 7:52:41 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency901
moderation_latency(null)
generation_time103160
tokens_prompt77
tokens_completion3346
native_tokens_prompt147
native_tokens_completion4532
native_tokens_completion_images(null)
native_tokens_reasoning4231
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00910075
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"021771617161633b965c2d2b2bb89ed3e2dfeb918ea3f764504f8"
is_byokfalse
latency901
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771617161-R0SKC64ZlXfs8aniIa8Q"
upstream_id"021771617161633b965c2d2b2bb89ed3e2dfeb918ea3f764504f8"
total_cost0.00910075
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
1.04% Matches word count n/a
neededCleanfalse
words226
100.00% Dialogue to Total Word Ratio
Ratio: 9.73%, Deviation: 0.27%
neededCleanfalse
wordsTotal226
wordsDialogue22
50.5180%