Run: 2026-02-20-01KHY93JABS9Y4EDCBB1K3P550

Run Details

Duration:
8.9s
Cost:
$0.000390
Provider:
Seed

Evaluation summary

Score Evaluator
Details Link
81.27% Matches word count (details)
26.24% Dialogue to Total Word Ratio (details)
53.7573% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly sat on the sticky park bench, popping a lollipop. Bill fidgeted, his tailored coat askew. “You sure about this? Balloons? A piñata shaped like a unicorn?” “Unicorns don’t guard anything,” she said. “The presents’ll be under that white tent—see the two goons? Guy with the mustache’s sipping a soda; the other’s napping on a cooler. He’ll miss the loot by 10 minutes if we work fast.” “What if the cake has candles?” “Candles are drama. The presents? Wrapped in sparkles. Silver tape. Worth every grain of that sugar he’s probably sneaking.” Bill groaned. “My suit. Powdered sugar’s a nightmare to remove.” “Then put on the lint-free gloves. And stop sweating—you’ll leave a wet spot. They’ll think we’re weirdos.” He sighed, grinning. “Lead the way, caper queen.” “Caper queen? I’m still the one picking the lock on the gift table.” “Don’t forget the confetti. It’ll get everywhere.” She stood, dusting off her pants. “Good. Confetti’s a distraction. Now come on—if we’re late, they’ll make the kids sing again. I’ve heard that three times this week.” She started walking, pausing to glance back. “You comin’, or you gonna let a room full of toddlers outbid us?” Bill stuffed his hands in his pockets, smiling. “Always follow the caper queen.”
created_at"2026-02-20T19:40:54.997Z" 2/20/2026, 7:40:54 PM
model"bytedance-seed/seed-1.6-flash-20250625"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency732
moderation_latency(null)
generation_time8619
tokens_prompt77
tokens_completion1342
native_tokens_prompt149
native_tokens_completion1264
native_tokens_completion_images(null)
native_tokens_reasoning936
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.000390375
router(null)
provider_responses
0
endpoint_id"e01a1227-001b-44d4-a376-90850668fa84"
id"021771616455467c1826595e581f7a440f247fe98485b57969dad"
is_byokfalse
latency643
model_permaslug"bytedance-seed/seed-1.6-flash-20250625"
provider_name"Seed"
status200
api_type"completions"
id"gen-1771616455-Eg1y02pjV8uX8bu254NC"
upstream_id"021771616455467c1826595e581f7a440f247fe98485b57969dad"
total_cost0.000390375
cache_discount(null)
upstream_inference_cost0
provider_name"Seed"

Evaluation details

Result Evaluator Details Meta Data
81.27% Matches word count n/a
neededCleanfalse
words212
26.24% Dialogue to Total Word Ratio
Ratio: 79.25%, Deviation: 10.75%
neededCleanfalse
wordsTotal212
wordsDialogue168
53.7573%