NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTZBSWT782HNC8DCQW4A8QM
Run Details
Model:
ByteDance Seed 2.0 Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 20s
Cost:
$0.002199
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
37.47%
Dialogue to Total Word Ratio
(details)
18.7354%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The beat-up 1998 Honda Civic idled across the street from Timmy Jenkins’ backyard, where a handwritten “BIRTHDAY BOY!” banner fluttered above a half-filled kiddie pool of blue plastic balls. Molly leaned across the front seat, crumpling a party flyer between her calloused fingers, while Bill fumbled with a cheap pink princess tiara perched crookedly on his buzzcut. The smell of burnt hot dogs wafted over the fence, followed by the high-pitched squeal of a toddler chasing a bubble wand. “Marked target window’s 2:15 to 2:20,” Molly said, nodding at the scrap of paper. “That’s when Bubbles the clown’s tied up making balloon dinosaurs, the birthday mom’s manning the piñata, and all the top presents — Lego sets, stuffed bears, that remote-control truck Timmy’s been begging for — are stacked in the wicker basket by the playhouse. No lock, just left out like a free lunch for thieves.” “Wait, the dog?” Bill yelped, adjusting his lopsided flannel shirt. “Mrs. Jenkins said her golden retriever, Max, is stuck in the garage behind a baby gate. I saw him through the fence earlier, he’s got a bite mark on his ear and looks like he’d chew my shoelaces off before I could grab a single present.” Molly snorts, gesturing to the glove compartment. “Peanut butter crackers under the seat. Toss one over the fence when we move in, he’ll be too busy licking crumbs off the grass to bark. I tested the tactic on a stray behind the grocery store last week; he didn’t so much as growl after I gave him a cracker.” “What if Bubbles the clown sees us?” Bill asked, tugging the tiara straight. “Last time I tried to sneak a free bagel, a chicken mascot chased me for three blocks. This clown’s gotta have worse stamina, right?” Molly rolled her eyes, flipping open a tattered spiral notebook. “Bubbles’ shift ends at 2:20, he’s got a bowling league meeting down the road. He’ll be packing his balloon kit the second we haul the basket to the van. Besides, you’re wearing the tiara. He’ll think you’re a last-minute face painter or a den mother.” “This tiara is chafing my temple!” Bill complained, prodding the plastic band. “And I don’t do face paint. Last time I tried to paint a cat for my niece, I turned her into a purple blob that cried for two hours and got my birthday cards boycotted by her entire family.” Molly passed him a pair of frayed plastic work gloves. “Wear the gloves, don’t touch anything sticky. Grab the sparkly stuffed unicorn from the trunk, too. If any tiny tots spot us, hurl it into the bushes. They’ll forget about the heist five seconds later chasing a rolling unicorn.” Bill fumbled the gloves, dropping one under the seat. “What if we get caught? What if the police show up? What if Timmy’s dad beats us with the piñata stick?” Molly checked her watch. “The piñata bursts at 2:17, right when we grab the basket. The noise will cover us hauling it to the van. If anyone asks, we’re here for the free cake samples. Now stop complaining and let’s move.” Bill adjusted the tiara again, his voice a whine. “Can I just grab one tiny cupcake? For the hassle of wearing this stupid crown?” Molly scoffed, reaching for the van door. “No. Rookie move. Now let’s go. And don’t trip over the kiddie pool.”
created_at
"2026-03-03T23:08:37.669Z"
3/3/2026, 11:08:37 PM
model
"bytedance-seed/seed-2.0-mini-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
597
moderation_latency
(null)
generation_time
140194
tokens_prompt
77
tokens_completion
5700
native_tokens_prompt
101
native_tokens_completion
5473
native_tokens_completion_images
(null)
native_tokens_reasoning
4662
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0021993
router
(null)
provider_responses
0
endpoint_id
"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id
"0217725793177774d21f3e3e165e25fcf12f86815b4407fe1bb37"
is_byok
false
latency
597
model_permaslug
"bytedance-seed/seed-2.0-mini-20260224"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1772579317-5amegvaeD4IgjyDWlW1I"
upstream_id
"0217725793177774d21f3e3e165e25fcf12f86815b4407fe1bb37"
total_cost
0.0021993
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
566
37.47%
Dialogue to Total Word Ratio
Ratio: 74.43%, Deviation: 4.43%
neededClean
false
wordsTotal
571
wordsDialogue
425
18.7354%