NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTZW7W73Q07TAHRZHXES0TW
Run Details
Model:
ByteDance Seed 2.0 Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4m 35s
Cost:
$0.003879
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
11.05%
Dialogue to Total Word Ratio
(details)
5.5230%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly propped her elbow on the dash of her dented 1998 minivan, staring through the windshield at the suburban backyard strung with tattered fairy lights. The birthday party was winding down, the last of the kids herded inside by frazzled parents, and wrapped presents stacked high on a weathered wooden picnic table. “Target’s clear. Mom just loaded the leftover pizza—we’ve 90 seconds before she comes back for her coat.” Bill slumped in the passenger seat, picking at the frayed edge of his ratty pirate costume, the plastic eye patch slipping off his forehead for the third time. “You said two minutes last time, and we almost got nailed by a dad swinging a piñata bat. What if a kindergartener sees us and screams?” Molly reached over, yanking the eye patch back into place with a sharp, irritated tug. “Calm down. The only kid outside is the birthday girl, snot-crying over a popped unicorn balloon. She won’t notice a thing. Grab this.” She handed him a beat-up canvas grocery bag and a crumpled cat-doodled napkin scavenged from the party’s craft table. “Tape this over your patch—no one’ll recognize you. Saw a guy do this at the grocery store last week.” Bill unfolded the napkin, squinting before taping it lopsidedly over his eye patch. “This napkin looks stupid. What if we get caught? Our heist resume’s just tailgate cooler theft, sidewalk chalk borrowing, and hiding from a Chihuahua for 45 minutes. This is the dumbest one yet.” “Our cooler heist made $42, which paid for your gas,” Molly retorted, grabbing her own canvas bag and hopping out of the van. “Stay close. We run to the table, grab as much as we can, bolt back to the van. No pocketing loot—especially that tiny Pokémon card I spotted tucked under the table edge.” Bill hesitated for half a second before jumping out, tripping over a curb and sending his bag flying open, a stuffed rabbit tumbling out before he snatched it back. “Whoops,” he mumbled, yanking the bag shut so fast the napkin slipped a little. They crept toward the backyard fence, Bill glancing over his shoulder every other step. A voice called from the back porch: “Lisa? Extra party napkins?” Molly pressed Bill down behind a thorny rose bush as the party mom’s sneakers scuffed the grass. When the footsteps faded, she whispered, “Now!” They darted across the lawn, Bill tripping over a popsicle-sticky sprinkler and fumbling to keep his bag from splitting open, a rubber duck tumbling out mid-run. Molly grabbed four wrapped boxes, her fingers brushing a sparkly unicorn backpack priced high enough to cover their last three heists combined. They skidded back behind the bush, panting, as the van’s passenger door squeaked open behind them.
created_at
"2026-03-03T23:17:36.276Z"
3/3/2026, 11:17:36 PM
model
"bytedance-seed/seed-2.0-mini-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
779
moderation_latency
(null)
generation_time
275299
tokens_prompt
77
tokens_completion
9917
native_tokens_prompt
101
native_tokens_completion
9673
native_tokens_completion_images
(null)
native_tokens_reasoning
9041
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0038793
router
(null)
provider_responses
0
endpoint_id
"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id
"021772579856388c95e3350f28d4e5f7ad155450ef541a342c73a"
is_byok
false
latency
779
model_permaslug
"bytedance-seed/seed-2.0-mini-20260224"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1772579856-5q0gq85q3ALFdAGZfmQN"
upstream_id
"021772579856388c95e3350f28d4e5f7ad155450ef541a342c73a"
total_cost
0.0038793
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
459
11.05%
Dialogue to Total Word Ratio
Ratio: 36.64%, Deviation: 6.64%
neededClean
false
wordsTotal
464
wordsDialogue
170
5.5230%