Run: 2026-02-20-01KHY84KJ29HHVA8Q1E46FY13R

Run Details

Duration:
9.5s
Cost:
$0.000316
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
100.00% Matches word count (details)
99.61% Dialogue to Total Word Ratio (details)
99.8051% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The bouncy castle deflated into a sad lump behind them, strings of "Happy Birthday" balloons still flapping weakly in the breeze. Molly adjusted the fold of her hoodie, the lockpick in her palm biting into her skin. "Easier than robbing a bank," she said, nodding at the mansion up the drive. Bill popped a lollipop from his pocket, the wrapper crinkling. "Yeah, 'cause kids don't call the cops. They just cry and dump cake on your shoes." "Cops are boring," Molly said, grinning. "Presents? Gold, silver, toys—tonight's haul's gonna pay for that flat tire. And the lollipop's for later." Bill squinted at the driveway, where a table groaned under a pyramid of wrapping paper. "What about the piñata? You sure we don't need to crack that first?" "Piñatas are just a distraction," she said, patting her shoulder holster. "Last party I hit, the kid cried over a broken candy cane for three hours. Let 'em. We take the big boxes." He nodded, popping the lollipop. "Bet the star of the show's got a PS5 in there." "Bet you're right," she said, already moving toward the hedge. Moonlight glinted off her lockpick. "C'mon. Let's steal some joy... and a console."
created_at"2026-02-20T19:24:00.461Z" 2/20/2026, 7:24:00 PM
model"bytedance-seed/seed-1.6-flash-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1264
moderation_latency(null)
generation_time8976
tokens_prompt77
tokens_completion1014
native_tokens_prompt149
native_tokens_completion1017
native_tokens_completion_images(null)
native_tokens_reasoning718
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.000316275
router(null)
provider_responses
0
endpoint_id"e01a1227-001b-44d4-a376-90850668fa84"
id"021771615441376e1f2c28e220bf76d8a56e2a46eaa08e91910e8"
is_byokfalse
latency1060
model_permaslug"bytedance-seed/seed-1.6-flash-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771615440-Pj83MxtNBoujjgM6eZXm"
upstream_id"021771615441376e1f2c28e220bf76d8a56e2a46eaa08e91910e8"
total_cost0.000316275
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
100.00% Matches word count n/a
neededCleanfalse
words200
99.61% Dialogue to Total Word Ratio
Ratio: 52.50%, Deviation: 2.50%
neededCleanfalse
wordsTotal200
wordsDialogue105
99.8051%