Run: 2026-03-03-01KJTYF6JNQV11BDM0YSF3850Q

Run Details

Duration:
2m 46s
Cost:
$0.002497
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
3.62% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
1.8117% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Crouched behind a suburban hedge, hoods pulled low against the twilight, Molly and Bill ducked as a minivan rumbled past. Molly nudged Bill’s elbow with her half-empty coffee cup. “You positive the lawn guy left the side gate unlocked like he promised?” Bill flipped a crumpled grocery list scrawled with stick-figure blueprints of the backyard. “Said he bailed at 3:45, party wraps at 7. Nine minutes between piñata smash and cake line.” Molly snatched the list, crumpling it into a tight ball in her fist. “What about Grandma Mabel? She’s glued to the present table all night, tucking kids’ handwritten cards back into piles like a hawk.” Bill snickered, darting a quick glance toward the backyard’s glowing string lights. “Told her I’m a local nursery rep, asked about her prize roses. She’s rambling about aphids over by the flower bed.” Molly stifled a snort, clapping a hand over her mouth. “What if the birthday mom catches us? She’s got eyes like a hawk.” “I brought a pack of cheap glow sticks,” Bill whispered. “Will distract her for two minutes—enough time to stuff the duffel.” Molly grabbed the canvas duffel from the hedge’s shadow, her grin sharp under her hood. “Let’s go scoop all the birthday presents. And don’t even think about skipping the giant rainbow lollipops by the snack table—non-negotiable spoils.”
created_at"2026-03-03T22:53:09.845Z" 3/3/2026, 10:53:09 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1269
moderation_latency(null)
generation_time156723
tokens_prompt77
tokens_completion6270
native_tokens_prompt101
native_tokens_completion6216
native_tokens_completion_images(null)
native_tokens_reasoning5884
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0024965
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"02177257839012485fbb282d67f6232524942e9d2086bad40ef80"
is_byokfalse
latency1268
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772578389-985HG3gYB9AATVHhr9HA"
upstream_id"02177257839012485fbb282d67f6232524942e9d2086bad40ef80"
total_cost0.0024965
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
3.62% Matches word count n/a
neededCleanfalse
words224
0.00% Dialogue to Total Word Ratio
Ratio: 57.52%, Deviation: 32.48%
neededCleanfalse
wordsTotal226
wordsDialogue130
1.8117%