NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5XFXR5CDVJFF5W1946F5T
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
23.3s
Cost:
$0.000827
Provider:
Novita
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dim glow of a single bulb flickered above Molly and Bill as they hunched over a crudely drawn map of the neighborhood. The air smelled of stale coffee and bad decisions. “Okay, listen,” Molly said, tapping the paper with a chewed-up pencil. “The Johnson kid’s birthday party starts at noon. That’s when we strike.” Bill scratched his scruffy chin. “A kid’s party? Really? That’s the big score?” “Think about it,” Molly said, leaning in. “No security. No alarms. Just a bunch of sugar-high toddlers and distracted parents. And presents—so many presents.” Bill frowned. “What kind of presents are we talking? Legos? Action figures?” “Better,” Molly said, grinning. “The Johnsons are loaded. Rumor has it little Timmy’s getting a mini Tesla.” “A what?” “A tiny electric car for kids. Worth a fortune on the black market.” Bill’s eyes widened. “Okay, I’m in. But how do we get in? We can’t exactly blend in with the parents.” Molly pulled out two oversized clown costumes from a duffel bag. “Ta-da.” Bill groaned. “You’ve got to be kidding me.” “Clowns are the perfect disguise,” Molly insisted. “No one questions clowns. They’re like the ninjas of the party world.” “I hate clowns.” “You hate jail more.” Bill sighed, grabbing the polka-dotted monstrosity. “Fine. But if I get recognized, I’m blaming you.” Molly rolled her eyes. “Relax. We’ll be in and out. Distraction first—I’ll trip the piñata, cause chaos. You grab the presents, load ‘em into the getaway wagon.” “The what?” She pointed out the window at a rusty ice cream truck parked crookedly on the curb. Bill’s jaw dropped. “You stole an ice cream truck?” “Borrowed,” Molly corrected. “And it’s genius. Who’s gonna question an ice cream truck at a kid’s party?” Bill rubbed his temples. “This plan has more holes than my last pair of socks.” “Trust me,” Molly said, clapping him on the back. “By the time they realize the presents are gone, we’ll be halfway to Mexico.” “We’re not going to Mexico.” “Figure of speech, Bill.” He sighed, pulling on the clown shoes. “If we get caught, I’m telling the cops this was all your idea.” Molly adjusted her rainbow wig. “Noted. Now let’s go steal some birthday joy.” Bill muttered under his breath as they headed out, the ice cream truck’s jingle playing ominously in the distance.
created_at
"2026-02-16T21:33:09.782045+00:00"
2/16/2026, 9:33:09 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1498
moderation_latency
(null)
generation_time
23214
tokens_prompt
77
tokens_completion
694
native_tokens_prompt
66
native_tokens_completion
616
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0008272
router
(null)
provider_responses
0
id
"8bf452d61be0499ccf734e4d01dce429"
status
200
is_byok
false
latency
1498
endpoint_id
"cefb5a1e-9fea-4496-9a64-4e1b11a7cc8c"
provider_name
"Novita"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277565-cLQwRGmlAohnfoMjRtxv"
upstream_id
"8bf452d61be0499ccf734e4d01dce429"
total_cost
0.0008272
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
392
4.9436%
Dialogue to Total Word Ratio
Ratio: 62.24%, Deviation: 7.76%
neededClean
false
wordsTotal
392
wordsDialogue
244
2.4718%