Run: 2026-02-20-01KHY8NEQ9029QN4PH0NCPW081

Run Details

Duration:
2m 12s
Cost:
$0.012771
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
0.00% Matches word count (details)
94.39% Dialogue to Total Word Ratio (details)
47.1948% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly leans against a dented minivan, squinting at the pastel party tent strung with balloon animals across the street. A pilfered lollipop stick juts from her jacket pocket; she twists it idly. Bill hunches beside her, adjusting the tiny top hat perched on his bald head—his half-baked “magician assistant” cover. The tent’s door flaps flutter, revealing a stack of wrapped boxes tied with rainbow ribbons; their target glints under string lights. A toddler’s high-pitched giggle drifts over, making Bill flinch. He tucks a frayed mesh bag into his coat. Molly yanks a glittery apron over her jacket, hiding the empty duffel at her hip. She checks her watch—15 minutes until the piñata reveal, their only window to strike. “Your top hat looks ridiculous,” Molly says. “Better than your glittery apron—kids’ll spot a fake helper fast,” Bill retorts. “Glitter’s distraction. They’ll stare at my sleeves while I stuff presents in the bag,” she says. “Then what about the piñata guy? He’s got his eye on the gift table,” Bill mutters. “I’ll trip him with my apron strings when he’s swinging the bat,” she smirks. “Won’t that make the parents scream?” he asks. “I’ll blame it on a balloon animal tangle—they’ll buy it,” she says. “Fine. But if we get stuck in a game of pin the tail, I’m ditching you,” he huffs. “Deal. But you owe me a giant cupcake from the snack table afterward,” she says. “Only if you don’t drop the stuffed bear collection mid-escape,” he replies.
created_at"2026-02-20T19:33:12.563Z" 2/20/2026, 7:33:12 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1249
moderation_latency(null)
generation_time132320
tokens_prompt77
tokens_completion4971
native_tokens_prompt147
native_tokens_completion6367
native_tokens_completion_images(null)
native_tokens_reasoning5992
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.01277075
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"021771615992850e1f2c28e220bf76d8a56e2a46eaa08e9c3e5e8"
is_byokfalse
latency1249
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771615992-R0G96FgLeY4jxvpUuXq2"
upstream_id"021771615992850e1f2c28e220bf76d8a56e2a46eaa08e9c3e5e8"
total_cost0.01277075
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
0.00% Matches word count n/a
neededCleanfalse
words255
94.39% Dialogue to Total Word Ratio
Ratio: 45.10%, Deviation: 4.90%
neededCleanfalse
wordsTotal255
wordsDialogue115
47.1948%