Run: 2026-02-20-01KHYB13SB7RGHA8EGTZY36681

Run Details

Duration:
2m 1s
Cost:
$0.010685
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
81.27% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
40.6364% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly leans against a park bench, crumpling a birthday flier in her fist. You sure this kids’ birthday bash is the score? Dead sure—mom’s a tech tycoon, presents are designer toys, luxury gadgets hidden in gift wrap. How we get past the front gate? I got a clown costume—you’ll be my balloon-animal sidekick, skip the gate via backyard fence. Balloon animals? I don’t do lopsided poodles for brats. Fake it—twist one dog, no one’s auditing your skills. What about the bouncer by the gift table? He’s got a sweet tooth—slip him a lollipop spiked with sleepy syrup, he’ll nap an hour. Sleepy syrup? You stole that from your grandma’s vet visit? Don’t ask questions—focus on the gift table by the pinata. Pinata’s a hazard—kids’ll swarm when it breaks. Timing’s key—we hit the table before the pinata drop, stuff presents into my clown bag. What if a kid catches us? Distract ’em with a balloon sword—they’ll forget everything. You owe me a better score after this. Deal—once we flip these, we’re off to the beach with piña coladas. Fine, but if I twist one more balloon, I’m hiding your clown nose in a diaper pail. Fair—don’t let a toddler yank your hair.
created_at"2026-02-20T20:14:31.734Z" 2/20/2026, 8:14:31 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency643
moderation_latency(null)
generation_time120921
tokens_prompt77
tokens_completion4018
native_tokens_prompt147
native_tokens_completion5324
native_tokens_completion_images(null)
native_tokens_reasoning5023
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.01068475
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"02177161847187665da4203214fbe81b3e8151fb2922fb1918a4c"
is_byokfalse
latency643
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771618471-rrDjVEOwKk3hSn1K8yIo"
upstream_id"02177161847187665da4203214fbe81b3e8151fb2922fb1918a4c"
total_cost0.01068475
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
81.27% Matches word count n/a
neededCleanfalse
words212
0.00% Dialogue to Total Word Ratio
Ratio: 2.83%, Deviation: 87.17%
neededCleanfalse
wordsTotal212
wordsDialogue6
40.6364%