NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8P2MNW2D6AXQC3J4AAGBN
Run Details
Model:
ByteDance Seed 1.6
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 37s
Cost:
$0.008911
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
5.13%
Dialogue to Total Word Ratio
(details)
2.5673%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The vinyl booth at Pete’s Diner stuck to Molly’s jeans as she spread a crumpled birthday party flyer across sticky Formica, lukewarm coffee sloshing in her chipped mug that read “World’s Okayest Thief.” Bill: “You’re dead serious about robbing a seven-year-old’s unicorn-themed bash? I thought we had our sights on the pawn shop safe that’s been unguarded after 9 p.m. all week long.” Molly tapped the flyer’s “No Expense Spared” banner, her neon-pink lockpick twirling between her fingers—she’d scouted the parents’ social media, saw the designer gifts piled in their sunroom that overlooked a quiet alley. Bill: “Designer gifts? For a kid who’ll chew on a diamond bracelet like a teething ring? And what about the security? Do they have a doorman or just a harried mom with a juice box stash and a phone glued to her hand?” She slid a crudely drawn blueprint of the community center party room across the table, circling a back window screened by overgrown rose bushes that hadn’t been trimmed in a year and were full of dead beetles. Molly: “No doorman—just a part-time clown and a bunch of parents too busy documenting cake smears to notice a window slider creaking open. I’ve got a fake balloon delivery uniform for you with a hidden pocket for your lockpick.” Bill fidgeted with his worn leather glove, his eyes darting to the diner’s door—he hated crowded, loud spaces, and kids’ parties were the worst kind of chaos that made his hands shake. Bill: “Fake balloon uniform? What if the clown calls me out? I can’t do a balloon animal to save my life—last time I tried, I made a lopsided snake that scared a toddler at the grocery store for hours.” Molly snickered, sipping her coffee—she’d already arranged for a fire alarm decoy, timed to go off right when the cake was cut, distracting everyone long enough to grab the gift table haul before the fire department arrived. Molly: “The clown’s on break at cake time—trust me. I’ve got an ice cream truck decoy parked around the corner for the escape; we’ll pile gifts into the freezers and drive off before anyone’s done crying about burnt sprinkles or lopsided balloon animals.” She checked her waterproof watch, noting the party’s 2 p.m. start time—they’d need to arrive an hour early to disable the window’s flimsy latch with her rust-proof lockpick set. Bill: “What about the crying kids? If one spots me, I’ll panic—last time I heard a toddler wail, I dropped my lockpick and ran from a convenience store heist mid-job, leaving my favorite glove behind.” They clinked their chipped mugs together, the diner’s jukebox blaring a tinny children’s song that made Bill’s skin crawl—he still had doubts, but Molly’s plan was tighter than her lockpick grip, and he’d learned to trust her wilder ideas after the laundromat heist paid off. Molly: “Stick these in. You won’t hear a single wail. We’ll be in and out in ten minutes, and that seven-year-old’ll be none the wiser—she’ll just think her gifts vanished into unicorn magic like her lost tooth did.”
created_at
"2026-02-20T19:33:32.959Z"
2/20/2026, 7:33:32 PM
model
"bytedance-seed/seed-1.6-20250625"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
871
moderation_latency
(null)
generation_time
97093
tokens_prompt
77
tokens_completion
4396
native_tokens_prompt
147
native_tokens_completion
4437
native_tokens_completion_images
(null)
native_tokens_reasoning
3706
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00891075
router
(null)
provider_responses
0
endpoint_id
"d85c3862-4e0d-4f8d-955b-babc62f98124"
id
"021771616013413ceb2efb7fea58c55155fc9fb92515dcdb1f724"
is_byok
false
latency
871
model_permaslug
"bytedance-seed/seed-1.6-20250625"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1771616012-WBTnszJj7p0FyJj9ddZa"
upstream_id
"021771616013413ceb2efb7fea58c55155fc9fb92515dcdb1f724"
total_cost
0.00891075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
532
5.13%
Dialogue to Total Word Ratio
Ratio: 57.71%, Deviation: 7.71%
neededClean
false
wordsTotal
532
wordsDialogue
307
2.5673%