Run: 2026-03-11-01KKFD1NE2021WWFHRNQZ5BV0Z

Run Details

Duration:
47.6s
Cost:
$0.003787
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
2.01% Matches word count (details)
97.04% Dialogue to Total Word Ratio (details)
49.5262% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
They huddle behind a thorny rose bush at the edge of a sun-dappled suburban driveway, a unicorn piñata and rainbow streamers glowing 50 yards away from the 7th birthday party they’re casing. “Quit squishing my Spider-Man face paint, Bill! I spent 45 minutes smudging it just right so the mom thinks I’m a chaotic 7-year-old, not a 38-year-old who once stole an entire jar of school fundraising cookie dough.” “Your face paint’s fine, I’m squirming ‘cause this rose bush just stabbed through my work boot! What’s the play? We can’t just waltz in and stuff a pillowcase with LOL dolls. There’s a real clown twisting balloon poodles out there. I hate balloon poodles.” “Relax, the clown’s my cousin, he owes me for bailing him out of that pickle jar incident last summer. He’ll herd all the kids into a limbo line in 10 minutes, that’s our window. And if you sneak fruit snacks into your duffel again I’ll leave you here to explain the power drill in your ‘trick-or-treat bag’ to a room of 8-year-olds.” “Fruit snacks are high-value! That drill’s for the locked present chest, anyway! Wait, you said that chest was push-button! I didn’t bring a combo-cracking tool!” “Calm down, I paid a first-grader at the party with temporary tattoos to crack it. We’re in and out before cake, no one’s the wiser.”
created_at"2026-03-11T21:32:34.128Z" 3/11/2026, 9:32:34 PM
model"bytedance-seed/seed-2.0-lite-20260309"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1107
moderation_latency(null)
generation_time47494
tokens_prompt77
tokens_completion1889
native_tokens_prompt115
native_tokens_completion1879
native_tokens_completion_images(null)
native_tokens_reasoning1541
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00378675
router(null)
provider_responses
0
endpoint_id"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id"021773264754663d4c69f90bdf814940b1abafd46c62ec664cdd0"
is_byokfalse
latency1107
model_permaslug"bytedance-seed/seed-2.0-lite-20260309"
provider_name"Seed"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773264754-t0I8e56lF2JFi7xY12Gy"
upstream_id"021773264754663d4c69f90bdf814940b1abafd46c62ec664cdd0"
total_cost0.00378675
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
2.01% Matches word count n/a
neededCleanfalse
words225
97.04% Dialogue to Total Word Ratio
Ratio: 85.84%, Deviation: 4.16%
neededCleanfalse
wordsTotal233
wordsDialogue200
49.5262%