Run: 2026-03-03-01KJTZASR94WXA3XVYXVTEPMS9

Run Details

Duration:
2m 14s
Cost:
$0.002098
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
75.16% Matches word count (details)
99.02% Dialogue to Total Word Ratio (details)
87.0860% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly and Bill huddle behind an overgrown azalea bush at the edge of a tree-lined suburban cul-de-sac, the distant clink of plastic party cups and the high-pitched squawk of a piñata whacking drift over the picket fence. Molly adjusts a frayed baseball cap pulled low over her eyes, jabbing a crumpled sticky note scrawled with “SIDE GATE” at Bill’s chest. “Plan’s simple: slip through the side gate by the doghouse. I bribed the homeowner’s 7-year-old with a sour lollipop to prop the latch open, their golden retriever only barks at mailmen.” Bill fidgets with black latex gloves, eyes darting to the kitchen window where a toddler in a sparkly tulle tutu waves a cake-smeared spoon. “Bribe? You called that ‘community outreach.’ Last time you traded a granola bar for a stray cat’s shot records.” Molly snorts, shoving needle-nose pliers into his back pocket. “Quit complaining—these presents are net profit: the Lego set, art easel, fancy stuffed sloth. We’ll dump ’em in the trunk, hit tacos afterward, no one’s the wiser.” Bill glances furtively over the bush, cheeks paling at the piñata’s sharp tear. “What if the mom catches us?” Molly pats his bicep, grinning under her cap. “She’ll think we’re last-minute caterers. Now move before the clown runs out of balloon dogs.”
created_at"2026-03-03T23:08:04.757Z" 3/3/2026, 11:08:04 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency3901
moderation_latency(null)
generation_time133620
tokens_prompt77
tokens_completion4536
native_tokens_prompt101
native_tokens_completion5219
native_tokens_completion_images(null)
native_tokens_reasoning4894
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0020977
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"021772579285169f2dfc9d17a425ad03ee233374d2636618bdc5d"
is_byokfalse
latency3901
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772579284-CZiMk0fQiB5LYgFZvKtf"
upstream_id"021772579285169f2dfc9d17a425ad03ee233374d2636618bdc5d"
total_cost0.0020977
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
75.16% Matches word count n/a
neededCleanfalse
words213
99.02% Dialogue to Total Word Ratio
Ratio: 46.85%, Deviation: 3.15%
neededCleanfalse
wordsTotal222
wordsDialogue104
87.0860%