NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-03-01KJTYKKRN3YKDYBZ0KKFCNE84
Run Details
Model:
ByteDance Seed 2.0 Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4m 20s
Cost:
$0.003798
Provider:
Seed
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
89.36%
Dialogue to Total Word Ratio
(details)
44.6799%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The mid-June twilight settled over the quiet cul-de-sac, where the McNallys’ backyard party glowed with pink string lights and echoed with delighted toddler screams. Molly and Bill huddled ten feet back in an overgrown privet hedge, Molly wearing a frayed pink princess sash and a sparkly fake tiara, Bill clutching a roll of duct tape and a lopsided fake birthday hat. “Molls, are you sure this is a good idea?” Bill whispered, voice cracking. “Last time we borrowed a bake sale cookie tin, we got chased by a scruffy terrier.” “Relax. We’re here for the presents, not cookies,” Molly said, twisting a bobby pin in her fingers. “The mom’s too busy mopping icing to notice two volunteer gift movers. I picked the back gate lock five minutes ago—it’s flimsy as a kid’s excuse for forgetting homework.” “You promised cake,” Bill grumbled, patting his empty stomach. “My growl was so loud a kid heard it through the sliding glass.” “After we grab the best presents, we’ll split the cake and piñata candy,” Molly promised, yanking him to his scuffed sneakers. “We’re Lila and Jake, here to deliver gifts to the children’s hospital down the road. No stealing, just donations. Got it?” “Got it,” Bill mumbled, fumbling his tape roll. “Wait, what if we get caught? The cops have our prints from the garden fountain heist last month!” A piñata thudded open inside the backyard, sending a shower of colorful candy spilling across the grass. “Can we grab a sour cherry lollipop first? Please?” Bill begged, nodding toward the patio table stacked with party treats. “Bill, the neighbor left a $250 Lego art set on the back porch ten minutes ago,” Molly hissed, yanking open the creaky garden gate. “That’s all that matters. Stop whining and move.” They crept toward the stack of brightly wrapped boxes piled tight against the porch rail, Bill fumbling two smaller glitter-covered packages in his shaking hands. The gate squeaked loud enough to make them both flinch, and a warm voice called from the kitchen sliding glass door: “Hello? Is someone out there helping with the party?” Molly waved wildly toward the glass, yelling, “Just picking up leftover streamers!” She hauled Bill toward the heaviest wrapped gift first, and they darted back toward the hedge, their arms stuffed tight with stolen presents, Bill fumbling one glittery package until he clutched it tight.
created_at
"2026-03-03T22:55:24.961Z"
3/3/2026, 10:55:24 PM
model
"bytedance-seed/seed-2.0-mini-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
837
moderation_latency
(null)
generation_time
260360
tokens_prompt
77
tokens_completion
9768
native_tokens_prompt
101
native_tokens_completion
9471
native_tokens_completion_images
(null)
native_tokens_reasoning
8920
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0037985
router
(null)
provider_responses
0
endpoint_id
"9265aa87-9657-4d40-b9d7-2a70318e57d9"
id
"021772578525428f6062c0432612ee8d02c106a4a445010bcda6d"
is_byok
false
latency
837
model_permaslug
"bytedance-seed/seed-2.0-mini-20260224"
provider_name
"Seed"
status
200
api_type
"completions"
id
"gen-1772578524-WiVulhLcjE1UIGYHOBDH"
upstream_id
"021772578525428f6062c0432612ee8d02c106a4a445010bcda6d"
total_cost
0.0037985
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Seed"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
396
89.36%
Dialogue to Total Word Ratio
Ratio: 48.50%, Deviation: 1.50%
neededClean
false
wordsTotal
400
wordsDialogue
194
44.6799%