Run: 2026-03-03-01KJV01S9EAZRG3GDP1R8HCKW6

Run Details

Duration:
4m 25s
Cost:
$0.003947
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
98.71% Matches word count (details)
20.22% Dialogue to Total Word Ratio (details)
59.4671% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Huddled behind an oak bush on a sunlit cul-de-sac, Molly and Bill eye the suburban home where a neon bounce house blares Baby Shark to a cluster of giggling toddlers. Bill squeezes a crumpled plastic clown nose, sweat beading on his forehead as he watches a ragtag piñata hang from the deck railing. The stack of wrapped birthday gifts perched by the back porch rail catches Molly’s eye, and she plucks a stick-on mustache from a discarded party goodie bag to press to her lip. “You promised easy work,” Bill grumbles, jerking his chin toward the backyard. “Last month’s dog walker gig? A golden retriever chased me all the way to the grocery store.” “This is simple,” Molly says, nodding at the gift pile. “We dress as clowns, grab all the presents, and leave. No one will spot us—they’re all shoveling cupcake frosting off their fingers.” Bill’s eyes dart to the harried birthday mom swigging from a plastic sangria pitcher. “What if she catches us? She’s already staring right at me.” “We say we’re the freelance face painters she booked,” Molly replies, tapping her fanny pack full of balloon animal wands. “She’ll buy it. I practiced a wonky kitty cat on my forearm this morning.”
created_at"2026-03-03T23:20:37.947Z" 3/3/2026, 11:20:37 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency481
moderation_latency(null)
generation_time264733
tokens_prompt77
tokens_completion9088
native_tokens_prompt101
native_tokens_completion9841
native_tokens_completion_images(null)
native_tokens_reasoning9562
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0039465
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"021772580038078f6ba3ba09f8d739ff3017972d8947b2ea29e9f"
is_byokfalse
latency481
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772580037-Ma83LmW5hG2vF8IIxRqw"
upstream_id"021772580038078f6ba3ba09f8d739ff3017972d8947b2ea29e9f"
total_cost0.0039465
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
98.71% Matches word count n/a
neededCleanfalse
words206
20.22% Dialogue to Total Word Ratio
Ratio: 38.76%, Deviation: 11.24%
neededCleanfalse
wordsTotal209
wordsDialogue81
59.4671%