NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFDQH416Q4VSY6NY7QA5PR0
Run Details
Model:
ByteDance Seed 2.0 Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 32s
Cost:
$0.007825
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
31.18%
Dialogue to Total Word Ratio
(details)
15.5918%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The mid-July sun baked the back of Molly’s neck as she and Bill huddled behind an overgrown hydrangea bush two yards from the Wilsons’ backyard fence, the thrum of a kids’ birthday party loud enough to drown out their rustling hoodies. The air smelled like burnt hot dogs and synthetic cotton candy, thick enough to taste as they huddled, a stray bee buzzing around Bill’s ear that he swatted at uselessly. A neon Roblox bounce house trembled under a dozen stampeding kids, a half-broken Creeper piñata swung from the oak, and a stack of wrapped presents glowed under patio lights on the picnic table, just waiting. Bill tugged his fanny pack, bulging with what he’d called “mission critical gear,” and knocked a terracotta flowerpot off the neighboring empty house’s stoop, sending a clatter that made the Wilsons’ golden retriever bark once. Molly slapped a hand over his head, dragging him lower into the leaves, grass stains already blooming on her jeans. “If you blow this heist before we even climb the fence, I’m leaving you to sell your moldy 90s Beanie Baby collection out of a Walmart parking lot alone,” she hissed. “That was an accident! That pot was wobbly! And anyway, my gear’s solid, look.” Bill fumbled in his pack, pulling out a kid-sized T-Rex mask and a string of glow sticks. “We can blend in! No one’s gonna question two extra people at a kid’s party, c’mon.” “Blend in? You’re 42, your beard has more gray than my grandma’s wool blanket. The only way you’re blending in is as the tired dad who brought the wrong store-bought cake. You still think Fortnite is a type of margarita, for Christ’s sake.” A group of kids sprinted past the fence chasing a stray Mylar balloon, their screams making both freeze until they rounded the side of the house. Molly peeked over the hydrangea again, scanning the yard: the only parent near the present table was a mom on FaceTime, halfway across the yard arguing with a party supply store, her back fully turned. “Wait, what about the security camera by the back door? I spotted that when we cased the place yesterday,” Bill whispered. Molly snickered, pulling a crumpled empty Sour Patch Kids bag from her pocket. “Covered. I left a full bowl by the front porch an hour ago. The teen monitoring the cameras is shoveling candy so fast he didn’t even notice me walk up. The only thing he’s recording is his own soda burp.” Bill whistled low. “Damn, that’s way smarter than the jewelry store heist. Why didn’t we bait that security guard with candy?” “Because that guard was a 60-year-old retired cop who hates Sour Patch Kids, for one. And two, you tripped over your own feet climbing that jewelry store’s back wall and set off the alarm. This is low stakes: we climb the fence, stuff the biggest presents into the laundry basket I stashed, sneak out before cake. No one gets hurt, we flip that Lego Millennium Falcon and new Switch for enough to pay rent for two months.” Bill fumbled the T-Rex mask over his head, the eye holes lopsided. “Wait, can I still keep the mask on? I’ve always wanted to hit a piñata.” Molly groaned, shoving his shoulder hard enough to send him tumbling into the hydrangea. “If you wear that mask I’m leaving you here to work as the party’s $50 hired clown. Move.”
created_at
"2026-03-11T21:44:30.608Z"
3/11/2026, 9:44:30 PM
model
"bytedance-seed/seed-2.0-lite-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
541
moderation_latency
(null)
generation_time
91605
tokens_prompt
77
tokens_completion
4103
native_tokens_prompt
115
native_tokens_completion
3898
native_tokens_completion_images
(null)
native_tokens_reasoning
3110
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00782475
router
(null)
provider_responses
0
endpoint_id
"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id
"0217732654707450c64a0471125ddcff16fb53685966178d1012f"
is_byok
false
latency
541
model_permaslug
"bytedance-seed/seed-2.0-lite-20260309"
provider_name
"Seed"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773265470-tUedzu4HWZUAS0RYzGzH"
upstream_id
"0217732654707450c64a0471125ddcff16fb53685966178d1012f"
total_cost
0.00782475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
574
31.18%
Dialogue to Total Word Ratio
Ratio: 54.83%, Deviation: 4.83%
neededClean
false
wordsTotal
580
wordsDialogue
318
15.5918%