Run: 2026-02-20-01KHYA4287T24YPXP6D1EJZPBQ

Run Details

Duration:
52.6s
Cost:
$0.004489
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
68.10% Matches word count (details)
99.79% Dialogue to Total Word Ratio (details)
83.9445% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly picks at a lollipop pilfered from a corner store, squinting at the community center’s neon “Teddy Bear Tea Party” sign strung above its entrance. The hum of balloon pumps drifts over, mingling with the clink of plastic teacups. Bill fidgets with a crumpled floor plan he’d sketched after sneaking a peek through a back window—marked Xs for present stacks, a wobbly arrow for the fire escape exit. He tucks a rubber chicken (his go-to distraction) into his jacket pocket, its squawk muted by a tissue. Bill leans in: “Wait—will the bubble machine block our back exit dash?” Molly tosses her lollipop stick into a nearby bush, tapping the floor plan’s escape arrow with a chipped nail. “Only if we forget to unplug it before grabbing gifts.” She pulls a tiny pair of safety scissors from her pocket, intended to snip through the ribbon tying the present bin shut. A group of costumed kids bursts out the door, chasing a stray balloon, and both duck behind a park bench leg. Bill’s rubber chicken slips, emitting a faint squawk, and they freeze until the kids’ laughter fades. Molly counts the present stacks again in her head, mentally mapping the best path to avoid the party’s harried volunteer chaperones she’d spotted earlier while scouting.
created_at"2026-02-20T19:58:39.89Z" 2/20/2026, 7:58:39 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1330
moderation_latency(null)
generation_time52555
tokens_prompt77
tokens_completion1617
native_tokens_prompt147
native_tokens_completion2226
native_tokens_completion_images(null)
native_tokens_reasoning1936
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00448875
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"021771617520301ef9a07adf34cf0fac1320e81624057e0ee40a1"
is_byokfalse
latency1330
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771617519-FkQA1jfFdr2o301It2pb"
upstream_id"021771617520301ef9a07adf34cf0fac1320e81624057e0ee40a1"
total_cost0.00448875
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
68.10% Matches word count n/a
neededCleanfalse
words214
99.79% Dialogue to Total Word Ratio
Ratio: 12.15%, Deviation: 2.15%
neededCleanfalse
wordsTotal214
wordsDialogue26
83.9445%