NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5DBFS787G7BZA15WXR7PX
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
23.3s
Cost:
$0.000932
Provider:
Novita
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse was drafty, the kind of place where even the rats wore tiny sweaters. Molly tapped her foot against a crate labeled *Fragile: Probably Just More Crates Inside*, while Bill squinted at a napkin covered in scribbles that may or may not have been a map. "You're telling me," Molly said, "that the biggest score of our careers is a six-year-old's birthday party?" Bill folded the napkin carefully, then unfolded it when he realized it was upside down. "Not just any party, Moll. *Timmy Thompson’s* party. Kid’s dad owns half the city. You think those presents are just finger paints and teddy bears? We’re talking solid gold rattles, diamond-encrusted sippy cups—" "—and a security detail that probably includes ex-Navy SEALs in clown costumes," Molly finished. She rubbed her temples. "Bill, we’ve stolen from banks, museums, even that one guy’s collection of rare toenail clippings—" "Which was a *mistake*," Bill interjected. "—but this? This is low." Bill gasped, clutching his chest. "Low? Molly, we’re *modern-day Robin Hoods*." "Robin Hood stole from the rich to give to the poor. We’re stealing from a *kindergartener* to give to ourselves." "Semantics." Bill waved a hand. "Besides, it’s not like Timmy’s gonna remember. Kids that age think a cardboard box is the height of luxury. We’re doing him a *favor*. Teaching him early that life’s unfair." Molly sighed. "Fine. What’s the plan?" Bill grinned and unfurled the napkin with a flourish. "We go in disguised as the entertainment." "Please don’t say clowns." "Better." He paused. "*Magicians*." Molly stared. "You can’t even shuffle a deck without dropping half the cards." "Details! All we need is a little misdirection. You’ll be the lovely assistant—" "I’m not wearing sequins." "—and I’ll do the classic ‘saw the kid in half’ bit—" "Absolutely not." "—*symbolically*, Moll. Then, while the parents are busy pretending to be amazed, we grab the gifts and vanish!" He mimicked an explosion with his hands. Molly pinched the bridge of her nose. "And how do we get out?" Bill blinked. "Hadn’t thought that far." "Of course not." She grabbed the napkin and turned it right-side up. "Here’s the *real* plan. We pose as caterers. Load the presents into a cake box. Walk out like we’re just taking the trash." Bill frowned. "Where’s the pizzazz?" "Pizzazz gets you caught. Stealth gets you paid." He sighed. "Fine. But I’m still bringing the magic wand." Molly rolled her eyes. "Just don’t wave it near the guard dogs." Bill paused. "There are guard dogs?" Molly smiled sweetly. "Life’s unfair, Bill."
created_at
"2026-02-16T21:24:21.060886+00:00"
2/16/2026, 9:24:21 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1495
moderation_latency
(null)
generation_time
23235
tokens_prompt
77
tokens_completion
729
native_tokens_prompt
66
native_tokens_completion
697
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0009325
router
(null)
provider_responses
0
id
"6d8eeb07174d234458c7dc4dced81b16"
status
200
is_byok
false
latency
1495
endpoint_id
"cefb5a1e-9fea-4496-9a64-4e1b11a7cc8c"
provider_name
"Novita"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277037-Ppz7bKWJcsJktG30Flca"
upstream_id
"6d8eeb07174d234458c7dc4dced81b16"
total_cost
0.0009325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
428
0.0000%
Dialogue to Total Word Ratio
Ratio: 63.55%, Deviation: 33.55%
neededClean
false
wordsTotal
428
wordsDialogue
272
0.0000%