Run: 2026-03-11-01KKFDW5DYFH62MW09KFRJ6JTH

Run Details

Duration:
43.6s
Cost:
$0.003601
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
35.00% Matches word count (details)
82.08% Dialogue to Total Word Ratio (details)
58.5388% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Crouched behind a thorny rose bush across from the suburban bungalow, Molly and Bill stared at the neon blue streamers strung across the porch, the tinny wail of *Let It Go* bleeding through open first-floor windows. “Molls, this is deranged. We rob armored trucks, not a 7-year-old’s Elsa birthday party. What if the piñata guy has a taser? Those costume characters are always jacked enough to body slam a squirrel.” “Relax, I scouted. The piñata guy’s a 16-year-old from the local high school, he’s been hiding behind the garage vaping for 40 minutes straight. The only on-site security is Grandma Linda, who passes out 90 seconds into that song every single party.” “Wait, the face paint clown? She waved a spray glitter can that could blind a moose. And what’s the haul even? A bunch of plastic dolls and some dumb Lego set?” “That Lego set’s the discontinued Millennium Falcon, I can flip it for $1200 on eBay. The LOL dolls are for my little cousin’s birthday, I’m multitasking. Now shut up—they just lit the candles. That’s our window to sneak in the sliding back door before they start opening presents. And if you stop to hoard goldfish crackers again like you did at the last kid’s gig, I’m locking you in the bounce house overnight. Move.”
created_at"2026-03-11T21:47:02.474Z" 3/11/2026, 9:47:02 PM
model"bytedance-seed/seed-2.0-lite-20260309"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency888
moderation_latency(null)
generation_time43561
tokens_prompt77
tokens_completion1866
native_tokens_prompt115
native_tokens_completion1786
native_tokens_completion_images(null)
native_tokens_reasoning1479
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00360075
router(null)
provider_responses
0
endpoint_id"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id"0217732656225959aa9d0b0344e85196ac345634277259c5e594d"
is_byokfalse
latency888
model_permaslug"bytedance-seed/seed-2.0-lite-20260309"
provider_name"Seed"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773265622-JLGAI4sjzUznmf7G5lxu"
upstream_id"0217732656225959aa9d0b0344e85196ac345634277259c5e594d"
total_cost0.00360075
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
35.00% Matches word count n/a
neededCleanfalse
words218
82.08% Dialogue to Total Word Ratio
Ratio: 83.33%, Deviation: 6.67%
neededCleanfalse
wordsTotal222
wordsDialogue185
58.5388%