Run: 2026-03-03-01KJTZWPWPPNW5PK4NXBKJGQP7

Run Details

Duration:
5m 1s
Cost:
$0.004261
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
75.16% Matches word count (details)
99.01% Dialogue to Total Word Ratio (details)
87.0830% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly and Bill huddle behind an overgrown hedge, watching the backyard children’s party. “Okay, playbook: slip through the side gate when the clown does his rubber duck magic trick, stack all the wrapped presents in your backpack, and bolt before the piñata even gets whacked.” “Playbook? Last playbook you had we stole a vending machine full of Skittles and got stuck in a mall fountain for three hours.” “That was a tiny mess. The dad’s manning the charcoal BBQ, the aunt’s yelling about kids getting cupcake frosting in the sandbox, and no one’s guarding the dining room piled full of presents. Perfect.” “What if the toddlers spot us? They’ll scream bloody murder, and I can’t handle high-pitched snot-induced panic.” “They’ll forget in two minutes flat. Last week I swiped a gummy bear from a toddler’s high chair, and she was over it by the time her mom wiped sprinkles off her face. Ready?” “Fine, but if we get caught I’m blaming you entirely. Also, can we grab the sugar cookie platter on the way out? I’m starving.” “One cookie. That’s non-negotiable. Now, stay down and stay quiet.”
created_at"2026-03-03T23:17:51.65Z" 3/3/2026, 11:17:51 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1319
moderation_latency(null)
generation_time301275
tokens_prompt77
tokens_completion10009
native_tokens_prompt101
native_tokens_completion10626
native_tokens_completion_images(null)
native_tokens_reasoning10367
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0042605
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"02177257987205977dd13f2098096eba9569e39b59e750c54653e"
is_byokfalse
latency1319
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772579871-sGQvEn04RN0nlXoNt6NE"
upstream_id"02177257987205977dd13f2098096eba9569e39b59e750c54653e"
total_cost0.0042605
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
75.16% Matches word count n/a
neededCleanfalse
words187
99.01% Dialogue to Total Word Ratio
Ratio: 93.16%, Deviation: 3.16%
neededCleanfalse
wordsTotal190
wordsDialogue177
87.0830%