Run: 2026-02-20-01KHYA41Q7CJSAYR9BRBPEY4RT

Run Details

Duration:
1m 48s
Cost:
$0.009175
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
99.38% Matches word count (details)
82.86% Dialogue to Total Word Ratio (details)
91.1163% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly picks at a cherry lollipop pilfered from a corner convenience store, squinting at the pastel bounce house bobbing behind the neighbor’s white picket fence. Bill adjusts his ill-fitting neon clown wig, the elastic digging into his forehead, his fake red nose askew. The party’s cackling shrieks prick their ears; a harried parent carries a stack of glossily gift-wrapped boxes to a wooden picnic table. Molly tucks her tiny lockpicks into her scuffed boot sock, her sneakers scuffing the dusty dirt below the park bench as she quietly watches the parent tape a “Presents Here” sign to the table’s edge. “You sure the clown getup’s not too obvious?” Molly says. “Kids love clowns—they’ll beg me to help carry presents inside when I offer to rearrange them,” Bill replies. “What if the parent counts them before we strike?” she asks. “I’ll distract them with a balloon animal disaster—pop three at once, chaos erupts fully,” he grins. “What about the little yappy dog by the gate?” she presses. “I’ll toss it a small stolen peanut butter cookie I stashed earlier,” he says. “Timing’s key—we quickly strike at cake cutting,” she says. “Agreed—cake brain makes everyone oblivious to missing boxes entirely,” he replies.
created_at"2026-02-20T19:58:39.348Z" 2/20/2026, 7:58:39 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency618
moderation_latency(null)
generation_time108116
tokens_prompt77
tokens_completion3365
native_tokens_prompt147
native_tokens_completion4569
native_tokens_completion_images(null)
native_tokens_reasoning4280
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00917475
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"02177161751949282fba1344a9ed2fba97aa536d307b6cfb3bb99"
is_byokfalse
latency618
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771617519-rJ6X5s0goliaFzL4r502"
upstream_id"02177161751949282fba1344a9ed2fba97aa536d307b6cfb3bb99"
total_cost0.00917475
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
99.38% Matches word count n/a
neededCleanfalse
words205
82.86% Dialogue to Total Word Ratio
Ratio: 43.41%, Deviation: 6.59%
neededCleanfalse
wordsTotal205
wordsDialogue89
91.1163%