Run: 2026-03-03-01KJTYR2AQZ9D2MR32K8BVSRWC

Run Details

Duration:
3m 59s
Cost:
$0.003523
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
51.93% Matches word count (details)
99.97% Dialogue to Total Word Ratio (details)
75.9489% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Dusk bleached pink over the suburban cul-de-sac, where a group of kids laughed inside a backyard bounded by a white picket fence, their voices carrying over a table piled high with wrapped birthday presents and a lopsided unicorn piñata. Molly and Bill huddled behind an overgrown rose bush, their backpacks slung over their shoulders stuffed with empty reusable gift bags they’d stolen from a dollar store the week prior. Bill tapped his crumpled birthday banner—mistaken for a lockpick tension wrench—against his thigh, bouncing on his heels. “Quit rushing, we’ve got five minutes before the parent chaperone heads to the bathroom,” Molly whispered, tapping a folded napkin map of the backyard she’d doodled that morning. “But the giant teddy bear is mine,” Bill mumbled, already creeping a hand toward the fence. Molly grabbed his hoodie collar, yanking him back into the shadows as a bumblebee buzzed past their ears. They’d spent two weeks plotting this heist, mapping parent shift times and noting which presents were the bulgiest, until Bill’s twitchy urge to grab every shiny object kept threatening to blow their entire plan.
created_at"2026-03-03T22:57:50.946Z" 3/3/2026, 10:57:50 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency3658
moderation_latency(null)
generation_time238862
tokens_prompt77
tokens_completion7823
native_tokens_prompt101
native_tokens_completion8783
native_tokens_completion_images(null)
native_tokens_reasoning8536
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0035233
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"02177257867122879d39dabdd16daed9e301f9238148b07a3bcb9"
is_byokfalse
latency3658
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772578670-fX7Rz4tbBNCZeHmlN0FS"
upstream_id"02177257867122879d39dabdd16daed9e301f9238148b07a3bcb9"
total_cost0.0035233
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
51.93% Matches word count n/a
neededCleanfalse
words184
99.97% Dialogue to Total Word Ratio
Ratio: 11.29%, Deviation: 1.29%
neededCleanfalse
wordsTotal186
wordsDialogue21
75.9489%