Run: 2026-02-20-01KHYAS15PYXV16WYMN0DVZ40Y

Run Details

Duration:
1m 18s
Cost:
$0.007033
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
98.71% Matches word count (details)
39.85% Dialogue to Total Word Ratio (details)
69.2813% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly sucks a cherry lollipop pilfered from a stray bike basket, leaning against a splintered park bench opposite the balloon-draped community center. The children’s shrieks bubble through open windows, and she tracks a parent hauling a stack of wrapped presents inside. Bill fidgets with a crumpled princess party hat he grabbed earlier, his scuffed boots scuffing the dirt. He glances at his cracked pocket watch, jaw tight, as a kid chases a butterfly past their hideout. A gust of wind tugs at Molly’s hood, exposing a streak of neon pink hair she forgot to cover, quickly tucking it back. “You’re absolutely sure the cake’s distraction’s foolproof?” Bill mumbles. “Positive—my cousin’s the baker, and she’s lacing the sprinkles with extra sugar rush powder,” Molly says. “They’ll be too chaotic to notice presents vanishing?” Bill asks. “Exactly—parents’ll be herding tantrums, not counting gift boxes,” Molly replies. “What about the party clown?” Bill presses. “He’s my old partner—he’ll ‘accidentally’ knock over a balloon arch right when we sneak in,” Molly smirks. “We’re taking all of ’em? Even the stuffed bear ones?” Bill says. “Every last wrap—those designer dolls fetch a fortune online,” Molly says.
created_at"2026-02-20T20:10:06.913Z" 2/20/2026, 8:10:06 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1091
moderation_latency(null)
generation_time77873
tokens_prompt77
tokens_completion3031
native_tokens_prompt147
native_tokens_completion3498
native_tokens_completion_images(null)
native_tokens_reasoning3214
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00703275
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"021771618207046eed3fa2be8288e45c6ad198b6ab1561a5183ee"
is_byokfalse
latency1091
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771618206-J7ZffgKghMOtxhXL5Ej1"
upstream_id"021771618207046eed3fa2be8288e45c6ad198b6ab1561a5183ee"
total_cost0.00703275
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
98.71% Matches word count n/a
neededCleanfalse
words194
39.85% Dialogue to Total Word Ratio
Ratio: 40.21%, Deviation: 9.79%
neededCleanfalse
wordsTotal194
wordsDialogue78
69.2813%