Run: 2026-03-03-01KJTZP848EQEZ08RZDM2SN8JQ

Run Details

Duration:
2m 3s
Cost:
$0.001971
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
0.00% Matches word count (details)
99.99% Dialogue to Total Word Ratio (details)
49.9960% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Golden hour bleached the cul-de-sac in warm amber as Molly and Bill pressed flat behind an overgrown rose bush, their shoulders catching on sharp pink thorns. Bill clutched a crumpled plastic gift bag emblazoned “My First Soccer Star,” its handles digging into his bruised knuckles—scraped that morning prying open a rusted playground lock. Molly tucked a roll of heavy-duty tinfoil into her waistband, using it to shade her face from streamers fluttering out the home’s open sliding glass door. Inside, a clown in a too-tight red suit twisted balloon animals, while toddlers shrieking after a bubblemachine drowned out the soft scuff of their sneakers on dewy lawn. “Stick to the bows that still have their original price tags,” Molly hissed, voice tight enough only Bill could hear. “Don’t forget the ones with the shiny foil wrappers,” Bill grunted, already stepping toward the stack of presents by the patio heater. Bill snagged two thickly wrapped boxes first, shoving them into his overstuffed bag, while Molly darted under the picnic table to grab a pile of smaller shinier packages tucked between a folding lawn chair and a half-eaten cupcake plate. A fluffy golden retriever wandered over, sniffing Bill’s shoe, and he held his breath until it trotted off toward the clown. They’d planned this heist for three weeks: parents would be glued to cake and photos, no one would spot two volunteer helpers slipping away with loot. As Molly stuffed a stuffed unicorn into her jacket, she glanced at the kitchen window, counting seconds till the clown pulled his rabbit trick.
created_at"2026-03-03T23:14:19.924Z" 3/3/2026, 11:14:19 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1309
moderation_latency(null)
generation_time122970
tokens_prompt77
tokens_completion4489
native_tokens_prompt101
native_tokens_completion4902
native_tokens_completion_images(null)
native_tokens_reasoning4558
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0019709
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"02177257966037234cac430b5ac258905380d809540adb4d73e7e"
is_byokfalse
latency1309
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772579659-NR8jdiS0nVC07r4MI9IW"
upstream_id"02177257966037234cac430b5ac258905380d809540adb4d73e7e"
total_cost0.0019709
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
0.00% Matches word count n/a
neededCleanfalse
words260
99.99% Dialogue to Total Word Ratio
Ratio: 9.06%, Deviation: 0.94%
neededCleanfalse
wordsTotal265
wordsDialogue24
49.9960%