Run: 2026-03-03-01KJTYY70JC88B35B6E32Q7HFX

Run Details

Duration:
5m 24s
Cost:
$0.004395
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
27.17% Matches word count (details)
99.70% Dialogue to Total Word Ratio (details)
63.4307% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
They huddle behind Bill’s rusted van, eyeing the loud children’s party across the street. “Okay, plan: I grab the picnic table’s unopened gift bags, you take the stacked wrapped presents by the side porch. The bounce house blocks the yard from the road.” “Wait, what if a kid catches me? Or the mom in the frilly princess dress? Or the guy in the scratchy T-Rex costume?” “Pretend you’re fixing the cotton candy machine’s frayed cord. I’ll fake-cry about my escaped helium balloon to distract anyone who asks.” “That dumb elf hat you made me wear is gonna give us away. Last time you cried over a lost lollipop for ten full minutes.” “Please, that worked on the cop! Kids adore elves—you’ll look like a last-minute party volunteer.” “Three minutes till the piñata breaks. Everyone’s staring at the flying candy.” “What if I trip over the porch step stool?” “Trip like a clumsy elf. No one’ll notice. Grab the blue plastic bin labeled ‘PARTY SUPPLIES’.” “What if I drop the sack?” “Run faster! Just go! We don’t have time for this!”
created_at"2026-03-03T23:01:12.35Z" 3/3/2026, 11:01:12 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency3821
moderation_latency(null)
generation_time323812
tokens_prompt77
tokens_completion11153
native_tokens_prompt101
native_tokens_completion10962
native_tokens_completion_images(null)
native_tokens_reasoning10699
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0043949
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"021772578872768c95e3350f28d4e5f7ad155450ef541a32cb2a9"
is_byokfalse
latency3821
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772578872-2J8PxJTFHi4hBa6fZrg7"
upstream_id"021772578872768c95e3350f28d4e5f7ad155450ef541a32cb2a9"
total_cost0.0043949
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
27.17% Matches word count n/a
neededCleanfalse
words181
99.70% Dialogue to Total Word Ratio
Ratio: 92.35%, Deviation: 2.35%
neededCleanfalse
wordsTotal183
wordsDialogue169
63.4307%