NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5HBXX43C75W8PYGNE8TC8
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
22.9s
Cost:
$0.000666
Provider:
DeepInfra
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dimly lit room smelled vaguely of stale coffee and desperation. Molly leaned over the table, her hands gripping the edges of a scribbled map of a suburban neighborhood. Bill sat across from her, twirling a pen between his fingers like it was the greatest threat to mankind. He squinted at the map, then at Molly, then at the map again. “So,” Molly began, her voice sharp enough to cut through the tension, “the plan is simple. We hit the Johnson’s house during little Timmy’s birthday party. Kids will be distracted by cake, parents by wine, and we’ll be in and out with the loot before anyone notices.” Bill blinked slowly, like a cat contemplating a particularly unimpressive mouse. “You think little Timmy won’t notice us *stealing* his presents? Kid’s six, not blind.” Molly rolled her eyes so hard it looked like they might detach. “Timmy’s too busy stuffing his face with frosting to care. Besides, kids are easily distracted. Throw a toy car at him and he’ll forget we exist.” Bill leaned back in his chair, the legs creaking under his weight. “And the parents? They’re not exactly known for their observational skills, but they might notice two grown adults hauling presents out the front door.” “That’s why we’re not going out the front door,” Molly snapped, jabbing a finger at the map. “We go in through the kitchen, grab the presents from the living room, and exit through the backyard. Easy. Clean. No one catches us.” Bill raised an eyebrow. “What about the dog?” Molly froze mid-gesture. “The dog?” Bill nodded smugly. “Yeah. The Johnsons have a golden retriever. Name’s Max. Loves to bark at everything. Thought you’d done your homework.” Molly’s face flushed red. “Fine. We bring dog treats. Max gets a snack, we get the presents. Everyone wins.” Bill snorted. “You really think Max is gonna be fooled by a couple of Milk-Bones? That dog’s smarter than you.” Molly glared at him, her jaw tightening. “You got a better idea, genius?” Bill grinned, leaning forward. “Actually, I do. We dress up as clowns.” Molly stared at him, her expression blank. “Clowns.” “Yeah,” Bill said, gesturing wildly. “Kids love clowns. Parents tolerate clowns. Dogs... well, dogs hate clowns, but that’s beside the point. We blend in, entertain the kids, and while everyone’s distracted, we slip the presents out.” Molly opened her mouth, closed it, then opened it again. “That’s the dumbest thing I’ve ever heard.” Bill shrugged. “Got a better plan?” Molly sighed, pinching the bridge of her nose. “Fine. Clowns it is.” Bill clapped his hands together, grinning like he’d just won the lottery. “Great. I’ll get the costumes. You figure out how to keep Max from biting us.” Molly muttered something under her breath that sounded suspiciously like “I hate my life.” Bill ignored her, already scrolling through clown costume websites on his phone. The plan was ridiculous, dangerous, and almost certainly doomed to fail. But, Molly thought as she stared at the map, it might just be crazy enough to work.
created_at
"2026-02-16T21:26:32.031907+00:00"
2/16/2026, 9:26:32 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
407
moderation_latency
(null)
generation_time
22873
tokens_prompt
77
tokens_completion
832
native_tokens_prompt
66
native_tokens_completion
725
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00066637
router
(null)
provider_responses
0
id
"chatcmpl-R42EGAlDt4onTmx90PUUhI2g"
status
200
is_byok
false
latency
44
endpoint_id
"5294d55f-9012-496b-8f22-8cc919432dcd"
provider_name
"DeepInfra"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277168-Gt4omZGzdvQAnedw0nKo"
upstream_id
"chatcmpl-R42EGAlDt4onTmx90PUUhI2g"
total_cost
0.00066637
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
93.6496%
Matches word count
n/a
neededClean
false
words
509
0.0000%
Dialogue to Total Word Ratio
Ratio: 53.63%, Deviation: 23.63%
neededClean
false
wordsTotal
509
wordsDialogue
273
46.8248%