Run: 2026-03-03-01KJTYD1MNT0W30KR2R5A7GG2X

Run Details

Duration:
5m 8s
Cost:
$0.004194
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
0.00% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
0.0000% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly shifted her weight, her black socks squelching in damp clover-scented mud as she peered warily over the front hedge. Tattered pink streamers dangled from the wooden backyard fence, and a half-eaten tray of dinosaur-shaped sugar cookies sat crusting over on the chipped wooden picnic table. The birthday party had wrapped an hour prior, but the sliding glass kitchen door was propped open a crack with a bright blue plastic watering can, revealing a teetering mountain of wrapped presents tucked beside the dishwasher. Bill slapped glow-in-the-dark star stickers over his scuffed white sneakers, trying to hide his tracks. “Relax,” Bill muttered. “We’ll grab three big unopened gifts—no one will notice a thing.” Molly hissed, jabbing a finger at the softly snoring golden retriever curled on the patio heater’s base. “You promised peanut butter crackers to distract him! I saw him steal a whole pizza slice earlier.” Bill fumbled in his canvas duffel, pulling out a crumpled granola bar bag. “The bodega only had these! The cashier asked if they were for my niece.” “Quit yakking,” Molly whispered, waving a crumpled napkin to shoo a small tabby cat from the gift pile. “Let’s dart in and bolt before anyone catches us.” Bill snatched his heist bag, quickly creeping toward the door. “This beats the scout troop cookie heist—we’ll get rich off stuffed toys.” Bill added, grinning loud enough to scare a sparrow from the fence. “They’ll never even see us coming.”
created_at"2026-03-03T22:51:49.795Z" 3/3/2026, 10:51:49 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1625
moderation_latency(null)
generation_time307444
tokens_prompt77
tokens_completion10796
native_tokens_prompt101
native_tokens_completion10459
native_tokens_completion_images(null)
native_tokens_reasoning10119
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0041937
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"021772578310250c2670fe791a5f29d61bdea8e0ada6c5de37578"
is_byokfalse
latency1625
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772578309-EsqPDvHn29PWopUE4ld4"
upstream_id"021772578310250c2670fe791a5f29d61bdea8e0ada6c5de37578"
total_cost0.0041937
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
0.00% Matches word count n/a
neededCleanfalse
words242
0.00% Dialogue to Total Word Ratio
Ratio: 29.84%, Deviation: 20.16%
neededCleanfalse
wordsTotal248
wordsDialogue74
0.0000%