Run: 2026-03-11-01KKFD9J84BVBYMSXA5H2YM7BD

Run Details

Duration:
25.8s
Cost:
$0.002359
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
0.49% Matches word count (details)
0.05% Dialogue to Total Word Ratio (details)
0.2708% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The hydrangea bush dug into Bill’s ribs as they huddled behind a suburban split-level, tinny piñata music drifting through the open front window, a cluster of squealing kids echoing off the siding. “Quit squirming, you’re gonna blow our cover before the cake’s even cut,” Molly hissed, prodding his side with a glue stick she’d stuffed in her hoodie pocket. “The clown by the door is holding a balloon sword that’s pointed right at us. Also this bush has ticks. Why are we breaking into a first grader’s unicorn birthday party to steal presents, again?” Bill whispered, swatting at a stray leaf that landed on his nose. “Because her mom scored that limited-edition Blippi glitter studio that goes for $450 on Facebook Marketplace, and all those $25 Visa gift cards the great-aunts drag in? That’s our gas money for two whole months. We’re not monsters,” Molly said. “Stealing from a kid’s party is literally the monster origin story! What’s the escape plan, anyway? The golden retriever next door stared me down ten minutes ago, I swear it texted the neighborhood watch.” “Escape plan is we walk in as the last-minute face painters I booked. I printed Etsy badges. Laminated. Laminated badges fool every suburban mom within a 10-mile radius. Wait—did you just stuff a Sour Patch Kid in your jacket?” “Stress eating! This is a high-stakes heist!”
created_at"2026-03-11T21:36:53.009Z" 3/11/2026, 9:36:53 PM
model"bytedance-seed/seed-2.0-lite-20260309"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency771
moderation_latency(null)
generation_time25690
tokens_prompt77
tokens_completion1246
native_tokens_prompt115
native_tokens_completion1165
native_tokens_completion_images(null)
native_tokens_reasoning844
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.00235875
router(null)
provider_responses
0
endpoint_id"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id"021773265013462c5f50319cb4d4388d7836967b82aebe5bada2c"
is_byokfalse
latency771
model_permaslug"bytedance-seed/seed-2.0-lite-20260309"
provider_name"Seed"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773265013-pGA5n7ddf2zedg3EhYCa"
upstream_id"021773265013462c5f50319cb4d4388d7836967b82aebe5bada2c"
total_cost0.00235875
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
0.49% Matches word count n/a
neededCleanfalse
words227
0.05% Dialogue to Total Word Ratio
Ratio: 73.39%, Deviation: 16.61%
neededCleanfalse
wordsTotal233
wordsDialogue171
0.2708%