NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7PY6PBE9JETTPBHXTK61A
Run Details
Model:
ByteDance Seed 1.6
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
37.9s
Cost:
$0.003315
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
49.9993%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leans against a scuffed playground slide, crumpling a neon children’s party invite in her fist as Bill jostles a crinkly clown costume bag beside her. “You sure this party’s the one with the diamond-encrusted doll the pop star mom splurged on?” “Dead sure—all presents are loaded, even the hand-carved gold puzzle set.” “How we get past the gruff bouncer by the door?” “Distract him with that giant lollipop I swiped—he’s got a massive sweet tooth, per my latest intel.” “I’m not wearing that frilly clown assistant outfit you brought.” “You’ll be the balloon animal sidekick—just wave a sparkly wand, fake a high lisp; the kids won’t notice your flaws.” “I can’t twist a balloon dog without popping it into a mess.” “Fake it—wave the wand, let the kids imagine the rest.” “When’s our window to grab the presents?” “Cake time—everyone’s glued to the candle-blowing spectacle.” “What if a kid catches us and cries?” “Blow a giant bubble, hand ’em a dud balloon; they’ll forget instantly.” “Got a big enough bag for the doll and puzzle?” “Hidden in my clown pants—expanding mesh, holds three laundry baskets’ worth.” “What if we get caught?” “Blame the imaginary clown rival—kids’ll back our story.” “Fine, but if I have to fake one more balloon twist, I’m stealing your lollipop.” “Deal—meet me at the park entrance at 2.” “And I get first dibs on the doll’s sparkly hair clip.” “Done.”
created_at
"2026-02-20T19:16:32.611Z"
2/20/2026, 7:16:32 PM
model
"bytedance-seed/seed-1.6-20250625"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
948
moderation_latency
(null)
generation_time
37841
tokens_prompt
77
tokens_completion
1711
native_tokens_prompt
147
native_tokens_completion
1639
native_tokens_completion_images
(null)
native_tokens_reasoning
1280
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00331475
router
(null)
provider_responses
0
endpoint_id
"d85c3862-4e0d-4f8d-955b-babc62f98124"
id
"021771614992749e14146857e7ceaf6846d2c1b56e6ac77359d72"
is_byok
false
latency
947
model_permaslug
"bytedance-seed/seed-1.6-20250625"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1771614992-Te8aHEclgIjuard5fvLh"
upstream_id
"021771614992749e14146857e7ceaf6846d2c1b56e6ac77359d72"
total_cost
0.00331475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
245
100.00%
Dialogue to Total Word Ratio
Ratio: 89.39%, Deviation: 0.61%
neededClean
false
wordsTotal
245
wordsDialogue
219
49.9993%