NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTZ9KK59WKW1JGAVFSFN1RE
Run Details
Model:
ByteDance Seed 2.0 Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 45s
Cost:
$0.002517
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.92%
Dialogue to Total Word Ratio
(details)
49.9600%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Crouched behind an overgrown rose bush at the edge of the Miller backyard, Molly plucked a crumpled cupcake wrapper from Bill’s flannel sleeve as a high-pitched shriek of piñata-induced glee washed over the wooden fence. A neon pink party crown, swiped from a favor bin ten minutes prior, slipped down over one of Bill’s eyes, and he clutched a reusable gift bag so tight his knuckles blanched. “Molls, have you seen a toddler hauling a present bigger than their own head? What if we waste the op chasing cheap toys instead of big-ticket stuff?” Bill whispered, voice squeaking like a party horn. “Relax, I mapped the patio gift stack last night when you bailed on stakeout for corn dogs. All velvet-bow wrapped gifts? That’s the good stuff. Moms are herding kids for cake right now, no one’s watching the loot.” “Stakeout! You sat in your car staring through their kitchen window for an hour while I waited at the gas station eating a slushie that gave me brain freeze? Last week I got banned from the community center for swiping bouncy balls—you want me kicked out of a kids’ party?” “Wait, what if the birthday kid wails when we take his presents? His mom’ll scream, and the clown’ll unicycle right into me!” Bill yelped, yanking the crown back into place. “Stick to the script, none of that happens,” Molly sighed, tapping the fake glitter bomb tucked in her hoodie pocket. “I’ll wave it over by the grill, Misty the clown will do her unicycle balloon routine. Everyone’ll look, including the birthday boy—he nailed the neighbor’s cat with a piñata stick last month.” “Wait, the clown’s a real paid employee? I thought that was just Linda smearing face paint! What if she recognizes me? I knocked over her face paint table at the block party last month!” “You won’t get recognized, you wore a different hoodie then,” Molly corrected, nudging him with her elbow. “Dart the second I wave the bomb, grab all the velvet-bow gifts, stuff ‘em in the bag, haul ass to the van. No cotton candy, no sticker favors, no petting the birthday dog.” “You knew I wanted blue raspberry cotton candy!” Bill protested, ears pink under the crown. “Forget the cotton candy. Plan’s non-negotiable.” Molly primed the glitter bomb’s plastic pin. “3… 2… 1—”
created_at
"2026-03-03T23:07:25.682Z"
3/3/2026, 11:07:25 PM
model
"bytedance-seed/seed-2.0-mini-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4728
moderation_latency
(null)
generation_time
164634
tokens_prompt
77
tokens_completion
6437
native_tokens_prompt
101
native_tokens_completion
6267
native_tokens_completion_images
(null)
native_tokens_reasoning
5721
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0025169
router
(null)
provider_responses
0
endpoint_id
"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id
"021772579246096db3bb9a137825ab7489fb7012a5d616fd8d020"
is_byok
false
latency
4728
model_permaslug
"bytedance-seed/seed-2.0-mini-20260224"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1772579245-sgJwdwFGIxsG00eAHSeY"
upstream_id
"021772579246096db3bb9a137825ab7489fb7012a5d616fd8d020"
total_cost
0.0025169
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
389
99.92%
Dialogue to Total Word Ratio
Ratio: 70.13%, Deviation: 0.13%
neededClean
false
wordsTotal
395
wordsDialogue
277
49.9600%