Run: 2026-03-03-01KJTXSK7384NTBBSBX7CSBZJ6

Run Details

Duration:
2m 39s
Cost:
$0.002436
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
68.10% Matches word count (details)
38.33% Dialogue to Total Word Ratio (details)
53.2153% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The backyard hummed with sugar-fueled chaos: a neon bounce house blaring off-key nursery rhymes, a lopsided sheet cake oozing pink frosting, and yelling toddlers chasing a bubble machine. Molly and Bill hunched behind an overflowing trash can, Molly clutching crumpled party plates, Bill twisting a half-deflated beach ball he’d fished from a curb pile, his toolbelt clinking with cheap scissors and a stolen plastic tiara. “Last week’s ‘pool cleaner’ got tackled by a princess-dressed mom after grabbing a kid’s Lego set,” Bill whispered. “You sure we don’t need a better cover?” Molly grinned, holding up the plates. “We’re volunteer gift organizers. I’ll distract the birthday kid by gushing over his dinosaur piñata, you dart to the presents and stuff every wrapped box into this frayed backpack.” “What if we get caught?” Bill yelped, eyes darting to the giant remote-control truck propped by the fence. “Just pretend you slipped on cake frosting,” Molly nudged him with her elbow. “Don’t touch the lollipop display—those are backup bribes.” “Ready?” Bill asked, slinging the frayed backpack over his shoulder. “Go,” Molly replied, already stepping toward the glitter-draped picnic tables.
created_at"2026-03-03T22:41:12.739Z" 3/3/2026, 10:41:12 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1076
moderation_latency(null)
generation_time158812
tokens_prompt77
tokens_completion6305
native_tokens_prompt101
native_tokens_completion6064
native_tokens_completion_images(null)
native_tokens_reasoning5789
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0024357
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"02177257767364150b1c418616fd9b5f78a16c5c9d3416260f6a8"
is_byokfalse
latency1076
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772577672-b1mrORzfw5vP7cBg28zJ"
upstream_id"02177257767364150b1c418616fd9b5f78a16c5c9d3416260f6a8"
total_cost0.0024357
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
68.10% Matches word count n/a
neededCleanfalse
words186
38.33% Dialogue to Total Word Ratio
Ratio: 40.10%, Deviation: 9.90%
neededCleanfalse
wordsTotal192
wordsDialogue77
53.2153%