Run: 2026-02-20-01KHYB684RKCS1CNK0S5YJ8VY2

Run Details

Duration:
1m 7s
Cost:
$0.006747
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
99.38% Matches word count (details)
1.37% Dialogue to Total Word Ratio (details)
50.3750% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly picks at a lollipop pilfered from a convenience store, squinting at the community center’s neon “Happy 7th Birthday Lila” banner flapping in the gusty wind. Bill adjusts his too-big clown wig, rubber nose slipping down his chin, prods a crumpled map in his lap. “You sure the gift table’s by the back exit?” Molly says, popping her lollipop out and gesturing to the door. “Positive—my cousin’s kid ratted it out for a shiny dinosaur sticker,” Bill replies. Molly tucks dyed blue hair behind her ear, grabs a fake flower lapel from her bag. “That’s low even for us.” “It’s stacked with remote cars, fancy dolls, cash in gift cards,” Bill says, tapping the map. Bill shifts, gaze darting to the party’s bustling window. “What’s the distraction plan?” Molly asks, lollipop hovering. “I’ll do a terrible balloon animal show, send kids screaming,” Bill says. Molly leans forward, smirks, flicking her lollipop’s stick. “Perfect—they’ll laugh at your deflated giraffe instead of noticing our loot,” she says. Bill adjusts his wig again, grins. “Meet you at the dumpsters in ten minutes,” he says. Molly nods, popping her lollipop back in, gaze sharpening. She checks her watch, shifts her weight, scans the area for lingering security guards.
created_at"2026-02-20T20:17:20.035Z" 2/20/2026, 8:17:20 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency818
moderation_latency(null)
generation_time66465
tokens_prompt77
tokens_completion2955
native_tokens_prompt147
native_tokens_completion3355
native_tokens_completion_images(null)
native_tokens_reasoning3044
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00674675
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"021771618640451a491eea4a025beb52fc51c9506f8c2b2abe697"
is_byokfalse
latency818
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771618640-KurmRwbqU0yvBQgxNmK9"
upstream_id"021771618640451a491eea4a025beb52fc51c9506f8c2b2abe697"
total_cost0.00674675
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
99.38% Matches word count n/a
neededCleanfalse
words205
1.37% Dialogue to Total Word Ratio
Ratio: 35.61%, Deviation: 14.39%
neededCleanfalse
wordsTotal205
wordsDialogue73
50.3750%