Run: 2026-02-20-01KHY83GPXMQDGAFKRJY93MXTX

Run Details

Duration:
29.3s
Cost:
$0.002365
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
2.01% Matches word count (details)
56.88% Dialogue to Total Word Ratio (details)
29.4475% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The dented van idled across from the community center, its windows plastered with crumpled party flyers to blend in. Molly twisted a neon toy crowbar between her fingers—she’d swiped it from a dollar store to avoid scaring crying kids mid-heist—while squinting at the stream of balloon-toting toddlers filing inside. Bill leaned over the steering wheel, his notebook open to a scribbled map of the party’s gift stations: a stack by the cake table, a bin by the craft corner, and a hidden cache behind the piñata stand. He paused, then glanced at Molly. That piñata delay could mess our tight 3:15 gift-snatch window. Molly’s jaw tightened; she’d spent three nights practicing her “harried party parent” accent to bluff past the clown chaperone. Bill nudged her with a crumpled balloon animal he’d made earlier. Relax—we’ll distract the clown with a fake balloon emergency. She huffed, tucking the toy crowbar into her fanny pack and checking her watch—2:47. She scanned the back door, where a propped screen wiggled in the breeze, and adjusted her frilly party hat (swiped from a mailbox). Bill flipped his notebook to a list of fake kid names he’d memorized, muttering about avoiding the cookie break that would draw all adults to the cake table, leaving the gift bins unguarded for their quick snatch-and-run.
created_at"2026-02-20T19:23:24.776Z" 2/20/2026, 7:23:24 PM
model"bytedance-seed/seed-1.6-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1194
moderation_latency(null)
generation_time29266
tokens_prompt77
tokens_completion1111
native_tokens_prompt147
native_tokens_completion1164
native_tokens_completion_images(null)
native_tokens_reasoning856
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00236475
router(null)
provider_responses
0
endpoint_id"d85c3862-4e0d-4f8d-955b-babc62f98124"
id"021771615404922e8ad6f8fac80a9b78040141fa10ae51dce00db"
is_byokfalse
latency1194
model_permaslug"bytedance-seed/seed-1.6-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771615404-hgAET2qQ7xkYtl4A6Idk"
upstream_id"021771615404922e8ad6f8fac80a9b78040141fa10ae51dce00db"
total_cost0.00236475
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
2.01% Matches word count n/a
neededCleanfalse
words225
56.88% Dialogue to Total Word Ratio
Ratio: 1.33%, Deviation: 8.67%
neededCleanfalse
wordsTotal225
wordsDialogue3
29.4475%