Run: 2026-03-11-01KKFDZ9RVAZVN4VF05C3DDQFZ

Run Details

Duration:
40.6s
Cost:
$0.003433
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
90.48% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
45.2419% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Huddled behind an overgrown azalea bush at the edge of a sun-dappled suburban lawn, Molly and Bill peek through a gap in the string of unicorn balloons bobbing above the seven-year-old birthday boy’s driveway, a crumpled list of target presents crumpled in Molly’s rubber-gloved hand. “Molls, this is insane. What if the party clown catches us? Dude’s got a unicycle, he could flank us before we even stuff the first Lego set in my duffel,” Bill hisses, yanking his neon fanny pack tighter. “Relax, that clown’s three vodka-spiked seltzers deep, I watched him face-plant into a bowl of cheese puffs five minutes ago. The only person guarding the present pile is Mrs. Hale, passed out on the couch rewatching Encanto for the ninth time,” Molly snorts, adjusting the cheap T-Rex mask slung over her arm. “Wait, what about little Jax? That gremlin bit my ankle last week when I tried to steal his gummy bears, what if he recognizes my sneakers?” Bill squawks. “Your sneakers are covered in dinosaur stickers too, you idiot. He’ll think you’re a friendly T-Rex come to borrow presents. That Switch in the pile’s mine anyway—his mom borrowed my Amazon account and never paid me back. The piñata’s about to break, that’s our distraction. Move.”
created_at"2026-03-11T21:48:45.224Z" 3/11/2026, 9:48:45 PM
model"bytedance-seed/seed-2.0-lite-20260309"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency419
moderation_latency(null)
generation_time40526
tokens_prompt77
tokens_completion1753
native_tokens_prompt115
native_tokens_completion1702
native_tokens_completion_images(null)
native_tokens_reasoning1403
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00343275
router(null)
provider_responses
0
endpoint_id"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id"021773265725351681426abb3fc8758bc9af65311c7ae31281dee"
is_byokfalse
latency419
model_permaslug"bytedance-seed/seed-2.0-lite-20260309"
provider_name"Seed"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773265725-mA2zLFXF02O8uledI4uf"
upstream_id"021773265725351681426abb3fc8758bc9af65311c7ae31281dee"
total_cost0.00343275
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
90.48% Matches word count n/a
neededCleanfalse
words210
0.00% Dialogue to Total Word Ratio
Ratio: 67.59%, Deviation: 22.41%
neededCleanfalse
wordsTotal216
wordsDialogue146
45.2419%