NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTYJFC7Q20PVZ9G31PZHWBY
Run Details
Model:
ByteDance Seed 2.0 Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 24s
Cost:
$0.001422
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
1.58%
Dialogue to Total Word Ratio
(details)
50.7916%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled behind the overgrown boxwood hedge lining quiet Maplewood Drive, the sharp scent of overwatered grass and spilled fruit punch curling through the warm late-August air. Through the chain-link fence of the backyard, they could see the annual Preschool PTA party in full swing: a lopsided piñata dangling from the oak tree, its string of candy trailing halfway to the ground; a toddler face-down in a lime-green Jell-O salad, wailing as floss sugar stuck to their tear-streaked cheeks; and a stack of wrapped presents teetering by the back porch, swaddled in neon unicorn and dinosaur paper. Bill’s hoodie cuff was crusted with a half-crushed goldfish cracker, and he kept adjusting the wrinkly “Birthday Helper” vest they’d scavenged from the dollar store, the iron-on letters peeling into a lopsided mess that read “BIRHDAT HELPR.” “What if we get caught passing out a Princess Castle coloring set to a five-year-old?” Bill whispered, nodding at a group of giggling toddlers clustered around the piñata stick. “They’ll realize we’re not supposed to be here.” Molly nudged his elbow with her knee, her eyes fixed on the narrow gap between the porch railing and the hedge where they’d slip through once the host herded everyone inside for cake. “Relax. We’re wearing the vests. Last week I conned a neighborhood watch volunteer into letting me walk his dog by wearing a ‘Pet Wrangler’ vest with half the letters missing. They don’t look twice.” Bill fished a crumpled sticky note from his pocket, the list of party presents smudged with grape punch. “The host said the big ticket items are a remote-control dirt bike, a giant sloth stuffed animal, and a kit to make your own glitter slime. The slime kit’s gross, but the dirt bike would flip easy on Craigslist. What if the sloth won’t fit in our thrifted unicorn duffel?” “We’ll tie it to the roof,” Molly muttered, already creeping toward the fence line when the host’s voice boomed over a portable speaker: “All guests inside for the cake sing-along!” Bill yelped and tripped over a ceramic garden gnome, sending it clattering across the lawn. For a heartbeat, the entire party went quiet, the toddler in the Jell-O salad pausing mid-wail to stare. Molly clamped a hand over his mouth, her eyes wide, until the host waved it off, laughing, “Kids, don’t chase the garden gnights!” before turning to herd the group inside. Bill exhaled so hard his hoodie fluttered, wiping crumbs from his cheek. “I thought for sure we’d be doing time in the county youth detention center for stealing from preschoolers.” Molly snorted, pulling a pair of wire cutters from her frayed jacket pocket to snip through the fence’s bottom link. “Please. Stealing from preschoolers is the easiest heist there is. No security cameras, no alarm systems, just a bunch of sugar-crazed toddlers who’ll hand you a sticker if you ask nicely. Plus, we’re giving them a story to tell their moms. Way better than a boring bank robbery.”
created_at
"2026-03-03T22:54:49.659Z"
3/3/2026, 10:54:49 PM
model
"bytedance-seed/seed-2.0-mini-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
330
moderation_latency
(null)
generation_time
82068
tokens_prompt
77
tokens_completion
3756
native_tokens_prompt
101
native_tokens_completion
3531
native_tokens_completion_images
(null)
native_tokens_reasoning
2834
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0014225
router
(null)
provider_responses
0
endpoint_id
"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id
"021772578489772b1e926b94f741badb049b6681cf829b88e303d"
is_byok
false
latency
330
model_permaslug
"bytedance-seed/seed-2.0-mini-20260224"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1772578489-lFTL4dQ5c7mDZQ337g37"
upstream_id
"021772578489772b1e926b94f741badb049b6681cf829b88e303d"
total_cost
0.0014225
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
1.58%
Dialogue to Total Word Ratio
Ratio: 39.11%, Deviation: 9.11%
neededClean
false
wordsTotal
514
wordsDialogue
201
50.7916%