Run: 2026-02-20-01KHYAD64B2PGH3FFHN3ZY1T4A

Run Details

Duration:
16.5s
Cost:
$0.000898
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
68.10% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
34.0515% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Backyard, sun dappling balloons that sagged like deflated sighs. A clown’s red nose twitched as he chased a toddler, who wailed over a stolen lollipop; a pomeranian napped on a toy bone nearby, snoring. Molly leaned against a shed, nails picking splinters, eyeing presents: a teddy with one ear, a “Superheroes” box (stickers, not the real stuff), a glittery wrap that oozed. “Piñata’s empty,” she said, nodding at the paper cow by the grill, its stick lying forgotten. Bill, in a too-small suit, fumbled with a lollipop, crumpling the wrapper. “Walkie-talkie.” She nodded at the picnic table, where gifts sagged, tape flaking. “Squeeze the boxes—something heavy, not toys.” He flexed gloved hands, leaving lint on a balloon. “Dog?” “Belt’s loose. Clasp looks like a key,” Molly said, snorting. He pointed to an aluminum ladder by the shed. “Bends easy. Jenkins’—he bends over when he talks, too.” Bill paused, then held up the lollipop. “But candy? The piñata’s full.” She grinned. “Take the piñata, presents, and his hat. No candy—pride. We’re thieves, not monsters.” He saluted. “Pride first.” She smirked. “Lift his belt. Fast.”
created_at"2026-02-20T20:03:38.774Z" 2/20/2026, 8:03:38 PM
model"bytedance-seed/seed-1.6-flash-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency806
moderation_latency(null)
generation_time16365
tokens_prompt77
tokens_completion2744
native_tokens_prompt149
native_tokens_completion2956
native_tokens_completion_images(null)
native_tokens_reasoning2636
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.000897975
router(null)
provider_responses
0
endpoint_id"e01a1227-001b-44d4-a376-90850668fa84"
id"02177161781923579d39dabdd16daed9e301f9238148b07a2ef8f"
is_byokfalse
latency611
model_permaslug"bytedance-seed/seed-1.6-flash-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771617818-hZd7iPxoiscbvPbl2rG7"
upstream_id"02177161781923579d39dabdd16daed9e301f9238148b07a2ef8f"
total_cost0.000897975
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
68.10% Matches word count n/a
neededCleanfalse
words186
0.00% Dialogue to Total Word Ratio
Ratio: 28.49%, Deviation: 18.49%
neededCleanfalse
wordsTotal186
wordsDialogue53
34.0515%