NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFJBSJB52X7XM358MS1XCW5
Run Details
Model:
ByteDance Seed 2.0 Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 51s
Cost:
$0.009323
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
14.30%
Matches word count
(details)
10.34%
Dialogue to Total Word Ratio
(details)
12.3223%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The April sun glinted off the lime-green bouncy castle in the Culpeppers’ front yard, a rainbow unicorn piñata dangling from the oak, and Molly pressed her back into the scratchy azalea bush, the crinkle of a granola wrapper the only sound not drowned out by seven shrieking seven-year-olds chasing a golden retriever in a party hat. Bill, crammed next to her, wore a lopsided thrifted clown mask, eye holes too small so he craned to squint at the stack of presents teetering on the patio, wrapped in Spider-Man and Frozen paper. Piñata glitter blew into their bush, sticking to his frizzy clown wig. “Molls, you’re a genius. This is the easiest heist we’ve ever pulled. No lasers, no grumpy museum security, just a bunch of kids too busy shoving cake in each other’s eyes to notice us.” Bill’s whisper carried across the lawn, and Molly slapped a hand over his mouth, ducking as the birthday boy’s mom glanced up from her juice box station, wiping frosting off her jeans. When the woman turned back to break up a fight over a sparkly party horn, Molly yanked her hand away, hissing, “First of all, stop yelling like we’re robbing Fort Knox, we’re robbing a four-year-old’s birthday. Second of all, this is the dumbest idea I’ve ever agreed to, and I once let you talk me into stealing a lawn gnome that turned out to be a hidden cop camera. What are we even gonna do with 15 sets of Legos and a glittery American Girl doll? You can’t flip kids’ toys for more than $20 a pop, this isn’t worth the misdemeanor.” A toddler wandered within three feet of their bush, dropped his blue raspberry lollipop, and froze. Molly waved frantically, forcing a smile she hoped read “friendly babysitter” not “small-time criminal here to steal your birthday swag.” The toddler’s mom called him back, and Molly exhaled, elbowing Bill hard in the ribs when he reached for the abandoned candy. “Ow! That was a perfectly good lollipop. And for your information, this isn’t a flip. Little Jax at the community center foster program’s birthday is next week, he’s been begging for that Lego Star Wars set, and Mia wants that ballet American Girl. All these kids here got three times more toys than they’ll ever open, half’ll be at a garage sale by July. What’s the harm in redistributing the wealth?” Bill rubbed his side, yanking his mask off to pull two “World’s Best Grandma” tote bags from his backpack. Molly blinked, then fumbled for the extra tote she’d stuffed in her bag that morning. “You didn’t lead with we’re stealing for foster kids? I would’ve worn a cuter mask. I would’ve brought cookies for that poor mom refereeing cake fights for an hour. C’mon, if we run, we can grab a slice of that unicorn cake on the way out too. I saw extra sprinkles.” They crept from the bush, sneakers quiet on the clover, the piñata swinging behind them, and for the first time all day, Molly thought this absurd heist might actually be worth every second of the risk.
created_at
"2026-03-11T23:05:28.918Z"
3/11/2026, 11:05:28 PM
model
"bytedance-seed/seed-2.0-lite-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
380
moderation_latency
(null)
generation_time
110977
tokens_prompt
77
tokens_completion
4933
native_tokens_prompt
115
native_tokens_completion
4647
native_tokens_completion_images
(null)
native_tokens_reasoning
3945
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00932275
router
(null)
provider_responses
0
endpoint_id
"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id
"021773270329043f419ea1e39f9dd10a4a699a297e9c9a1d6beb4"
is_byok
false
latency
380
model_permaslug
"bytedance-seed/seed-2.0-lite-20260309"
provider_name
"Seed"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773270328-OjhSLyOB6tPCp9M4tkad"
upstream_id
"021773270329043f419ea1e39f9dd10a4a699a297e9c9a1d6beb4"
total_cost
0.00932275
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
14.30%
Matches word count
n/a
neededClean
false
words
521
10.34%
Dialogue to Total Word Ratio
Ratio: 56.74%, Deviation: 6.74%
neededClean
false
wordsTotal
527
wordsDialogue
299
12.3223%