NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJV01S9EAZRG3GDP1R8HCKW6
Run Details
Model:
ByteDance Seed 2.0 Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
4m 25s
Cost:
$0.003947
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
20.22%
Dialogue to Total Word Ratio
(details)
59.4671%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Huddled behind an oak bush on a sunlit cul-de-sac, Molly and Bill eye the suburban home where a neon bounce house blares Baby Shark to a cluster of giggling toddlers. Bill squeezes a crumpled plastic clown nose, sweat beading on his forehead as he watches a ragtag piñata hang from the deck railing. The stack of wrapped birthday gifts perched by the back porch rail catches Molly’s eye, and she plucks a stick-on mustache from a discarded party goodie bag to press to her lip. “You promised easy work,” Bill grumbles, jerking his chin toward the backyard. “Last month’s dog walker gig? A golden retriever chased me all the way to the grocery store.” “This is simple,” Molly says, nodding at the gift pile. “We dress as clowns, grab all the presents, and leave. No one will spot us—they’re all shoveling cupcake frosting off their fingers.” Bill’s eyes dart to the harried birthday mom swigging from a plastic sangria pitcher. “What if she catches us? She’s already staring right at me.” “We say we’re the freelance face painters she booked,” Molly replies, tapping her fanny pack full of balloon animal wands. “She’ll buy it. I practiced a wonky kitty cat on my forearm this morning.”
created_at
"2026-03-03T23:20:37.947Z"
3/3/2026, 11:20:37 PM
model
"bytedance-seed/seed-2.0-mini-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
481
moderation_latency
(null)
generation_time
264733
tokens_prompt
77
tokens_completion
9088
native_tokens_prompt
101
native_tokens_completion
9841
native_tokens_completion_images
(null)
native_tokens_reasoning
9562
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0039465
router
(null)
provider_responses
0
endpoint_id
"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id
"021772580038078f6ba3ba09f8d739ff3017972d8947b2ea29e9f"
is_byok
false
latency
481
model_permaslug
"bytedance-seed/seed-2.0-mini-20260224"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1772580037-Ma83LmW5hG2vF8IIxRqw"
upstream_id
"021772580038078f6ba3ba09f8d739ff3017972d8947b2ea29e9f"
total_cost
0.0039465
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
206
20.22%
Dialogue to Total Word Ratio
Ratio: 38.76%, Deviation: 11.24%
neededClean
false
wordsTotal
209
wordsDialogue
81
59.4671%