Run: 2026-03-03-01KJV02SFQ7M89BJPJ7BN8ZNVV

Run Details

Duration:
3m 54s
Cost:
$0.003612
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
95.99% Matches word count (details)
97.50% Dialogue to Total Word Ratio (details)
96.7413% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The dappled afternoon sun seeped through the chain-link fence, carrying the tinny honk of a party bear-clown horn and the high-pitched shriek of toddlers chasing a bubble machine. Bill hunched behind a hydrangea bush, clutching a crumpled laundry basket and sweating through his faded tee. “You sure this isn’t gonna backfire?” he whispered, nodding at the back porch where a stack of wrapped boxes peeked out from under a gingham sheet. Molly adjusted the fuzzy, mismatched kids’ gloves she’d pilfered from a dollar bin, her eyes locked on the sliding glass door. “Last week you tried to steal a lawn flamingo because you thought it ‘had spirit’ and tripped over a sprinkler halfway there. Cut the jitters.” Bill’s jaw tightened, kicking a plastic toy dinosaur across the grass. “This is legitimate! The flamingo had a *personality*. What’s the play again?” “I disabled the motion sensor by spray-painting its eye with chalk—don’t ask. Slip under the fence, grab the sheeted pile, dump it in the basket, and haul it to the minivan I hotwired. I’ll distract the clown.” A bubble popped between their sneakers, making Bill fumble the basket, its crumpled fabric spilling onto the grass. The bear-clown tilted its fuzzy head, and Molly checked her cracked phone.
created_at"2026-03-03T23:21:10.917Z" 3/3/2026, 11:21:10 PM
model"bytedance-seed/seed-2.0-mini-20260224"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1164
moderation_latency(null)
generation_time234117
tokens_prompt77
tokens_completion7364
native_tokens_prompt101
native_tokens_completion9005
native_tokens_completion_images(null)
native_tokens_reasoning8714
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0036121
router(null)
provider_responses
0
endpoint_id"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id"021772580071173e14146857e7ceaf6846d2c1b56e6ac776b13b0"
is_byokfalse
latency1164
model_permaslug"bytedance-seed/seed-2.0-mini-20260224"
provider_name"Seed"
status200
api_type"completions"
id"gen-1772580070-lSE4ZpkXugqRhIDbqT9V"
upstream_id"021772580071173e14146857e7ceaf6846d2c1b56e6ac776b13b0"
total_cost0.0036121
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
95.99% Matches word count n/a
neededCleanfalse
words208
97.50% Dialogue to Total Word Ratio
Ratio: 46.01%, Deviation: 3.99%
neededCleanfalse
wordsTotal213
wordsDialogue98
96.7413%