NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM63D058HQ64TAZRGQ8RE29
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
26.8s
Cost:
$0.000732
Provider:
DeepInfra
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned back in her folding chair, twirling a pen between her fingers. “Okay, Bill, listen up. This is our biggest job yet, and I don’t want you screwing it up like last time.” Bill frowned, scratching his scruffy beard. “Last time wasn’t my fault. Who knew the cat would set off the laser alarm?” “Everyone, Bill. Everyone knew that.” Molly sighed, flipping open a notebook. “Now, focus. We’re hitting a children’s party tomorrow. The client wants all the presents. Every last one.” Bill’s eyes widened. “A kid’s party? Molly, that’s... that’s low.” “Low pays the bills,” Molly snapped. “Besides, these kids aren’t even gonna remember this party in five years. Trust me, we’re doing them a favor.” Bill squinted at her. “How’s that?” Molly grinned. “Teaching them early that life’s unfair. They’ll thank us later.” Bill shook his head but didn’t argue. “Fine. What’s the plan?” Molly tapped her notebook. “The party’s at 3 PM. We go in disguised as clowns. No one suspects clowns.” Bill groaned. “Clowns? Really? I hate clowns.” “You’re wearing the nose, Bill. No arguments.” Molly flipped a page. “We blend in, do a few tricks to distract the kids, and then slip into the gift room. We load up the presents into our getaway car—an ice cream truck, by the way—and vanish.” “Ice cream truck?” Bill blinked. “Where’d you get an ice cream truck?” “Don’t ask questions you don’t want the answers to,” Molly said, waving him off. “Now, here’s the important part: we can’t let the kids see us taking the presents. If we get caught, it’s game over.” Bill scratched his head. “How do we avoid the kids?” Molly smirked. “We distract them with candy. I’ve got bags of it. Toss it in the opposite direction, and while they’re scrambling, we grab the loot.” Bill nodded slowly. “That... could work. But what if the parents notice?” “Parents at kids’ parties are too busy gossiping or pretending to care about finger painting,” Molly said dismissively. “Trust me, they won’t notice a thing.” Bill hesitated. “What about the clown costumes? Are they... scary?” Molly sighed. “They’re perfectly normal clown costumes, Bill. Bright colors, big shoes, the whole shebang. You’ll look ridiculous, but that’s the point.” Bill grimaced. “I don’t like this.” “You don’t have to like it,” Molly said, standing up. “You just have to do it. Now, let’s go rehearse your balloon animal skills.” Bill groaned again. “Balloon animals? Molly, I can’t even tie my own shoes properly.” “Then you’d better learn fast,” Molly said, tossing him a bag of balloons. “Because tomorrow, we’re stealing Christmas—or birthday, whatever—and there’s no room for mistakes.” Bill stared at the balloons like they were a death sentence. “This is gonna be a disaster.” Molly patted his shoulder. “Probably. But hey, if it works, we’re rich. If it doesn’t, we’re screwed. Either way, it’ll be entertaining.” Bill sighed. “Remind me why I work with you again?” Molly grinned. “Because no one else would put up with you. Now, blow up that balloon and start practicing. We’ve got a party to ruin.”
created_at
"2026-02-16T21:36:26.822789+00:00"
2/16/2026, 9:36:26 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
320
moderation_latency
(null)
generation_time
26714
tokens_prompt
77
tokens_completion
896
native_tokens_prompt
66
native_tokens_completion
799
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00073223
router
(null)
provider_responses
0
id
"chatcmpl-RXFm7vzWdAcD6pJpSZxj4v1k"
status
200
is_byok
false
latency
54
endpoint_id
"5294d55f-9012-496b-8f22-8cc919432dcd"
provider_name
"DeepInfra"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277759-6x7gwCfCYv5p1xkGbvrI"
upstream_id
"chatcmpl-RXFm7vzWdAcD6pJpSZxj4v1k"
total_cost
0.00073223
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
35.0022%
Matches word count
n/a
neededClean
false
words
518
0.0000%
Dialogue to Total Word Ratio
Ratio: 78.38%, Deviation: 28.38%
neededClean
false
wordsTotal
518
wordsDialogue
406
17.5011%