Run: 2026-02-20-01KHYAFX42KDTEZ9SBG3QE8Z9M

Run Details

Duration:
1m 44s
Cost:
$0.009561
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
0.08% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
0.0424% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
They huddle on a splintered park bench beside the party’s neon tent, Molly gripping a crumpled laundry basket, Bill twisting a whoopee cushion in his pocket. You sure this ridiculous kids’ party heist’s solid enough to pull off without getting caught? Dead sure—parents’ll be distracted by the piñata meltdown I planted. Piñata meltdown? Taped a whoopee cushion to the piñata string—yanks it, chaos erupts. What about the gift table guard everyone’s talking about? That’s the birthday kid’s grandma—she nips to the snack table every three minutes for cotton candy. Three minutes? Enough to stuff all wrapped presents in our laundry basket? Barely—you grab big doll houses first; I’ll snag flashy remote cars. What if a curious kid spots us mid-grab? Flash ’em a sparkly space sticker—they’ll gush to friends and forget we existed. Sticker trick? You tested that? Tested on my niece—she traded her favorite glitter crayon for a dinky space sticker last week. What about the crowded exit by the swing set? We duck through the sandbox tunnel—parents’ll think we’re weird uncles. Weird uncles hauling a laundry basket of presents? Tell ’em we’re taking donations to the local toy drive—they’ll thank us. And if grandma comes back early? Duck behind the gift table’s streamers. You’re absolutely crazy. Crazy enough to pull this off and buy our fancy taco dinner.
created_at"2026-02-20T20:05:07.852Z" 2/20/2026, 8:05:07 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency983
moderation_latency(null)
generation_time103557
tokens_prompt77
tokens_completion4207
native_tokens_prompt147
native_tokens_completion4762
native_tokens_completion_images(null)
native_tokens_reasoning4445
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00956075
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"0217716179081052b8720b6d986e81ce95e9b006de0a2cb29427d"
is_byokfalse
latency983
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771617907-Jc4p4C1RrLNa4Zepsm2R"
upstream_id"0217716179081052b8720b6d986e81ce95e9b006de0a2cb29427d"
total_cost0.00956075
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
0.08% Matches word count n/a
neededCleanfalse
words229
0.00% Dialogue to Total Word Ratio
Ratio: 4.80%, Deviation: 85.20%
neededCleanfalse
wordsTotal229
wordsDialogue11
0.0424%