NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTZMW9VN3QBYW2Q6XN4XRKP
Run Details
Model:
ByteDance Seed 2.0 Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
2m 38s
Cost:
$0.002677
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.54%
Dialogue to Total Word Ratio
(details)
49.7678%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunkered behind an overgrown azalea bush, cotton candy tang clinging to her scuffed jeans as she squinted past a pastel bounce house and a face painter dabbing glitter on a toddler’s cheek. A mountain of wrapped presents sat piled beside a backyard grill, tied with neon ribbons glinting in the afternoon sun. Bill fumbled with disposable plastic gloves, dropping a roll of duct tape twice between his scuffed work boots, eyes darting at every screaming kid zooming past. “Quit making that racket,” Molly hissed, nudging his elbow. “The birthday mom’s too busy snapping bounce house photos to notice us.” Bill snatched the tape, ears flushing bright pink. “This is insane! Last week you made me steal lollipops from a grocery store checkout for ‘startup capital’—now we’re robbing a 7-year-old’s birthday party?” “These aren’t cheap coloring books and stuffed animals,” Molly whispered, nodding at the tallest stacked boxes. “There’s a Lego Jurassic set, three gift cards, and a tablet in that big blue wrapper. No one’ll miss a few when everyone’s screaming over the piñata.” “What if there’s a security guard?” Bill squeaked, darting his gaze to the backyard fence. “Those ‘security cameras’ are just fake solar lights I grabbed from the dollar store last night,” Molly said, patting her fanny pack full of lock-picks. “When the birthday song starts, that’s our window. Grab the top two stacks and bolt to the alley behind the laundromat. And don’t stop to pet the party golden retriever, okay? Last time that cost us our last heist.” Bill adjusted his faded baseball cap, swallowing hard. “Right. Birthday song. Got it.”
created_at
"2026-03-03T23:13:35.047Z"
3/3/2026, 11:13:35 PM
model
"bytedance-seed/seed-2.0-mini-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1148
moderation_latency
(null)
generation_time
157889
tokens_prompt
77
tokens_completion
6820
native_tokens_prompt
101
native_tokens_completion
6667
native_tokens_completion_images
(null)
native_tokens_reasoning
6292
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0026769
router
(null)
provider_responses
0
endpoint_id
"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id
"02177257961515934865fa4f4ec67e4268045741fa8093fc1cb53"
is_byok
false
latency
1148
model_permaslug
"bytedance-seed/seed-2.0-mini-20260224"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1772579615-bqtvDt8Ygjqu6dUndyPW"
upstream_id
"02177257961515934865fa4f4ec67e4268045741fa8093fc1cb53"
total_cost
0.0026769
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
267
99.54%
Dialogue to Total Word Ratio
Ratio: 52.61%, Deviation: 2.61%
neededClean
false
wordsTotal
268
wordsDialogue
141
49.7678%