NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFD1NE2021WWFHRNQZ5BV0Z
Run Details
Model:
ByteDance Seed 2.0 Lite
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
47.6s
Cost:
$0.003787
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
97.04%
Dialogue to Total Word Ratio
(details)
49.5262%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
They huddle behind a thorny rose bush at the edge of a sun-dappled suburban driveway, a unicorn piñata and rainbow streamers glowing 50 yards away from the 7th birthday party they’re casing. “Quit squishing my Spider-Man face paint, Bill! I spent 45 minutes smudging it just right so the mom thinks I’m a chaotic 7-year-old, not a 38-year-old who once stole an entire jar of school fundraising cookie dough.” “Your face paint’s fine, I’m squirming ‘cause this rose bush just stabbed through my work boot! What’s the play? We can’t just waltz in and stuff a pillowcase with LOL dolls. There’s a real clown twisting balloon poodles out there. I hate balloon poodles.” “Relax, the clown’s my cousin, he owes me for bailing him out of that pickle jar incident last summer. He’ll herd all the kids into a limbo line in 10 minutes, that’s our window. And if you sneak fruit snacks into your duffel again I’ll leave you here to explain the power drill in your ‘trick-or-treat bag’ to a room of 8-year-olds.” “Fruit snacks are high-value! That drill’s for the locked present chest, anyway! Wait, you said that chest was push-button! I didn’t bring a combo-cracking tool!” “Calm down, I paid a first-grader at the party with temporary tattoos to crack it. We’re in and out before cake, no one’s the wiser.”
created_at
"2026-03-11T21:32:34.128Z"
3/11/2026, 9:32:34 PM
model
"bytedance-seed/seed-2.0-lite-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1107
moderation_latency
(null)
generation_time
47494
tokens_prompt
77
tokens_completion
1889
native_tokens_prompt
115
native_tokens_completion
1879
native_tokens_completion_images
(null)
native_tokens_reasoning
1541
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00378675
router
(null)
provider_responses
0
endpoint_id
"bf1a3473-7a0a-46d3-aaa5-adb8f876a697"
id
"021773264754663d4c69f90bdf814940b1abafd46c62ec664cdd0"
is_byok
false
latency
1107
model_permaslug
"bytedance-seed/seed-2.0-lite-20260309"
provider_name
"Seed"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773264754-t0I8e56lF2JFi7xY12Gy"
upstream_id
"021773264754663d4c69f90bdf814940b1abafd46c62ec664cdd0"
total_cost
0.00378675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
225
97.04%
Dialogue to Total Word Ratio
Ratio: 85.84%, Deviation: 4.16%
neededClean
false
wordsTotal
233
wordsDialogue
200
49.5262%