Run: 2026-03-03-01KJTYKZECTVZYZF7SB539ZDSP

Run Details

Duration:
1m 40s
Cost:
$0.001550
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
0.00% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
0.0000% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly and Bill huddle behind the rusted municipal dumpster, the tinny trill of a toy xylophone drifting over the chain-link backyard fence. “Okay, final plan,” Molly says, yanking a neon pink unicorn backpack from her frayed tote. “We pose as last-minute party helpers. I’ll tell the host I’m the substitute face painter, you’ll be the balloon artist who forgot his air pump—just carry scrap latex bits in your pocket and look panicked.” Bill wipes sweat from his jaw, staring at the stack of wrapped presents glinting by the back porch. “Last time your ‘blend in’ scheme had you in a chicken suit at a library book sale, where you stole a signed *Dog Man* copy. The librarian still posts about the ‘mysterious feathered thief’.” “That was a win!” Molly hisses, shoving the backpack into his hands. “We’ll use this to haul all the presents—no one’ll suspect a pair of thieves showing up with a kids’ backpack.” “Wait, what about the mom with the walkie-talkie watching the craft table?” Bill whispers, nodding toward the fence line. “Distraction!” Molly pulls a crumpled whoopee cushion from her jeans pocket. “I’ll plop it on the punch bowl stool and sit down right as we ‘set up’. All the moms will crowd over laughing, we’ll grab the pile and bolt.” Bill gapes at the cushion. “You’re gonna sit on a whoopee cushion in front of a room full of helicopter parents?” “Your call: Either that, or you belt out your ‘steal the loot’ jingle again. The grocery store security guard still brings it up at brunch with his friends.”
created_at"2026-03-03T22:55:36.92Z" 3/3/2026, 10:55:36 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency3781
moderation_latency(null)
generation_time99967
tokens_prompt77
tokens_completion4058
native_tokens_prompt101
native_tokens_completion3849
native_tokens_completion_images(null)
native_tokens_reasoning3481
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0015497
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"021772578537034c0f1672f4b1c3107d0b8535f43b945c74ac084"
is_byokfalse
latency3781
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772578536-MdF7aiTdXIuhscjHNaeD"
upstream_id"021772578537034c0f1672f4b1c3107d0b8535f43b945c74ac084"
total_cost0.0015497
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
0.00% Matches word count n/a
neededCleanfalse
words266
0.00% Dialogue to Total Word Ratio
Ratio: 69.14%, Deviation: 20.86%
neededCleanfalse
wordsTotal269
wordsDialogue186
0.0000%