NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFCMWVDCD5SWZ9PXXB3THWY
Run Details
Model:
ByteDance Seed 2.0 Lite
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 12s
Cost:
$0.006157
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0001%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The sun dipped low over the cul-de-sac, painting the Simpsons’ driveway bounce house neon pink, the tinny blare of Baby Shark seeping through the open kitchen window to sting Molly and Bill’s eardrums as they hunched behind an overgrown hydrangea bush. Crumbs of the convenience store burritos they’d split for lunch stuck to Molly’s canvas ski mask, Bill’s half-laced work boot kicked a loose rock into a sprinkler that sputtered cold water up his stolen pizza delivery jacket’s cuff. Molly’s fanny pack jingled with lockpicks she’d swiped from a hardware store the week prior, the only successful heist either had pulled off all year. A group of kids sprinted past the bush chasing a golden retriever, a fizzing sparkler narrowly missing Molly’s ponytail, and she tugged her ski mask up to snap at Bill. “Quit squirming, we got a 12 minute window before the mom brings out the unicorn cake and all the kids cluster around the dining room. That leaves the present pile by the front closet unguarded. Un. Guarded.” Bill wiped water from his cheek, tugging his own ski mask further down over his nose, his voice dropping to a panicked hiss. “Wait, unguarded? I thought we had to distract the teenaged TikTok-scrolling babysitter on the porch? What happened to that fake Amazon package plan with the whoopee cushion? You made me wrap that stupid thing for three hours.” “The babysitter bailed. Left 20 minutes ago for a Harry Styles concert movie, I watched her sprint to her Honda Civic with a glow stick around her neck. Only adults left in there is Mrs. Simpson and her sister who’s spent the last hour crying in the hot tub about her ex-husband. She can’t tell a balloon from a lawn gnome, let alone two professional thieves hauling a garbage bag full of Lego sets and LOL dolls.” “Professional? Molly, last month we tried to steal avocados from Trader Joe’s and you tripped over a toddler and set off the door alarm twice. We’re not professional, we’re the only two idiots dumb enough to heist a 7-year-old’s birthday. What if some kid bites me? I heard 7-year-olds bite like feral raccoons when their swag is on the line.” “Quit being dramatic, we’ll walk in like the out-of-town aunt and uncle, compliment the balloon arch, grab the presents, load ’em into our minivan, sell the whole lot on Facebook Marketplace for three months’ rent. No one’s gonna question random adults at a kids’ party, half the guests are people Mrs. Simpson invited ‘cause she felt bad for them. Wait—” Molly froze, squinting at the front door, where a man in a USPS uniform hauled a black garbage bag stuffed to bursting to his truck. Bill followed her gaze, then winced. “Is that the lonely mailman?”
created_at
"2026-03-11T21:25:35.737Z"
3/11/2026, 9:25:35 PM
model
"bytedance-seed/seed-2.0-lite-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
398
moderation_latency
(null)
generation_time
72295
tokens_prompt
77
tokens_completion
3332
native_tokens_prompt
115
native_tokens_completion
3064
native_tokens_completion_images
(null)
native_tokens_reasoning
2434
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00615675
router
(null)
provider_responses
0
endpoint_id
"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id
"0217732643358473c1f91f892e9aff2a902d86b047fe242b2b432"
is_byok
false
latency
398
model_permaslug
"bytedance-seed/seed-2.0-lite-20260309"
provider_name
"Seed"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773264335-ZbEkObEFrP5S6cjqTF02"
upstream_id
"0217732643358473c1f91f892e9aff2a902d86b047fe242b2b432"
total_cost
0.00615675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
464
0.00%
Dialogue to Total Word Ratio
Ratio: 66.17%, Deviation: 16.17%
neededClean
false
wordsTotal
470
wordsDialogue
311
0.0001%