Run: 2026-03-11-01KKFD4F58AB8VH4WVC46QVCAB

Run Details

Duration:
37.5s
Cost:
$0.003267
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
6.09% Matches word count (details)
12.34% Dialogue to Total Word Ratio (details)
9.2178% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
They hunched behind an overgrown hydrangea bush across from the suburban ranch’s fenced backyard, tinny Baby Shark seeping over the picket line. Bill tugged his slipping black ski mask up his nose. “Molly, this is insane. That’s a seven-year-old’s unicorn birthday party. What if we get arrested? I still have a misdemeanor from that grocery store pickle heist last year.” “Insane is leaving $400 worth of LOL dolls and a brand-new OLED Switch on the patio table, Bill. We scouted this for three whole days.” “What about the clown? The balloon-twisting one? I saw him punt a squirrel that stole a cupcake last weekend, he’s lethal.” “Clown’s on a smoke break, I distracted him with a gas station chili dog. Only thing between us and the loot is the birthday girl’s golden retriever, and I brought a whole bag of cheese puffs to neutralize him.” Bill gaped. “Neutralize? You’re gonna feed a golden cheese puffs? What if he barfs on the Switch? That’s our main haul!” Molly hissed, yanking him down as a mom wandered past the fence with a juice box. “Relax, I brought wet wipes. When the piñata breaks, everyone swarms the candy, that’s our window. We sneak in, grab the pile, and bolt before anyone clocks we’re not the out-of-town teen cousins. C’mon, that unicorn piñata’s gonna split any second—”
created_at"2026-03-11T21:34:06.005Z" 3/11/2026, 9:34:06 PM
model"bytedance-seed/seed-2.0-lite-20260309"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency705
moderation_latency(null)
generation_time37439
tokens_prompt77
tokens_completion1687
native_tokens_prompt115
native_tokens_completion1619
native_tokens_completion_images(null)
native_tokens_reasoning1299
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00326675
router(null)
provider_responses
0
endpoint_id"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id"021773264846524e0fa5d59a7b50e9f748f37815b909e28bdd619"
is_byokfalse
latency705
model_permaslug"bytedance-seed/seed-2.0-lite-20260309"
provider_name"Seed"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773264846-WpfmmXbwbXfSxIb3T0tR"
upstream_id"021773264846524e0fa5d59a7b50e9f748f37815b909e28bdd619"
total_cost0.00326675
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
6.09% Matches word count n/a
neededCleanfalse
words223
12.34% Dialogue to Total Word Ratio
Ratio: 77.97%, Deviation: 12.03%
neededCleanfalse
wordsTotal227
wordsDialogue177
9.2178%