NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFJP33YBVRXCAFQCE72AXTT
Run Details
Model:
ByteDance Seed 2.0 Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 32s
Cost:
$0.007933
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
49.42%
Dialogue to Total Word Ratio
(details)
24.7099%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
They crouched behind an overgrown rhododendron that reeked of wet dog and lawn fertilizer, 20 yards from a suburban ranch house’s wraparound porch, where a stack of crinkle-wrapped presents teetered under a plastic folding table. Their black ski masks clung to sun-warmed foreheads, streaked with sweat; Bill had strapped a pair of iridescent child-sized butterfly wings to his tactical vest that morning, convinced they’d blend with the party’s garden theme, and Molly had only stopped cackling long enough to snatch a crumpled birthday clown lanyard from a gas station rack, figuring it was the only cover story that wouldn’t get them tackled by a stay-at-home dad with a pellet gun. A shriek echoed from the backyard, followed by the thwack of a piñata spilling Skittles across the grass, and a golden retriever bolted past them, a lollipop stuck to its floppy ear. “Remind me again why we didn’t just hit the CVS down the street? They have a whole rack of unopened AirPods that’d be way less work than chasing a pile of Paw Patrol toy trucks,” Molly muttered, prodding Bill’s side with a flathead screwdriver pulled from her hoodie pocket. Bill fumbled in his neon fanny pack, squeezing a squishy rubber duck that blared a loud quack, earning them a wide-eyed stare from a woman walking a poodle three houses down. “Because this is a real score, Moll! All those parents dropped hundreds on that junk, brand new, no serial numbers, easy to flip. Cake’s coming out in ten minutes, everyone’ll crowd the picnic table, we’ll sneak up, grab the whole stack, and bolt. What could go wrong?” Molly nodded at the yard, where the birthday kid’s 6’4” dad, whose bicep beer-can tattoo peeked out from his “World’s Best Dad” tank, leaned against the grill scanning the perimeter like he’d already fended off four groups of teen cupcake thieves. The hired party clown was passed out on a lawn chair, a root beer can tipping slowly out of his limp hand. “What could go wrong? Bouncer dad over there? The sugar-crazed 7-year-olds that can smell a stranger faster than a shark smells blood? Your butterfly wings are so bright they’re reflecting the sun into the living room. I’m not getting arrested because you wanted to play fairy godmother to a bunch of resold Legos.” Bill deflated, wings drooping against his back, and squeezed the duck again, a tiny squeak that sent a sparrow exploding from the bush above them. Their last heist—boosting a truckload of unopened Girl Scout cookies—had gone sideways when they’d accidentally donated half the boxes to a church bake sale before they could flip them. This job was supposed to be their comeback. Molly sighed, tugging the clown mask over her head, plastic eye holes digging into her temples. “Fine. But if that golden retriever chases us, I’m leaving you to hold his leash. And if you quack that duck one more time, I’m hiding all your Thin Mints. Got it?” Across the yard, the birthday mom yelled that cake was ready, and a stampede of tiny sneakers thundered toward the picnic table, leaving the presents momentarily unguarded. Bill grinned, hefting a canvas tote over his shoulder, wings flapping as he readied to slip out of the bush.
created_at
"2026-03-11T23:11:06.379Z"
3/11/2026, 11:11:06 PM
model
"bytedance-seed/seed-2.0-lite-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
653
moderation_latency
(null)
generation_time
92404
tokens_prompt
77
tokens_completion
4278
native_tokens_prompt
115
native_tokens_completion
3952
native_tokens_completion_images
(null)
native_tokens_reasoning
3210
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00793275
router
(null)
provider_responses
0
endpoint_id
"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id
"021773270666497373615d827d98655cdf28f8c9eebc817dcf55f"
is_byok
false
latency
653
model_permaslug
"bytedance-seed/seed-2.0-lite-20260309"
provider_name
"Seed"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773270666-N4eCWIueCHZCp6RZwIef"
upstream_id
"021773270666497373615d827d98655cdf28f8c9eebc817dcf55f"
total_cost
0.00793275
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
544
49.42%
Dialogue to Total Word Ratio
Ratio: 33.75%, Deviation: 3.75%
neededClean
false
wordsTotal
554
wordsDialogue
187
24.7099%