NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5YK1R5XHWMN1GMJEHCPYZ
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
27.3s
Cost:
$0.000657
Provider:
DeepInfra
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dingy basement of an abandoned toy store smelled of mildew and broken dreams. Molly sat cross-legged on a cracked beanbag chair, twirling a lockstick in her fingers. Bill paced the room, his neon green sneakers squeaking with every step. “Alright, Molly, here’s the plan,” Bill said, pointing to a hastily drawn diagram on a pizza box. “We infiltrate the party disguised as clowns. You’ll be ‘Sprinkles,’ and I’ll be ‘Bozo the Bomb.’ We blend in, distract the kiddos, and make off with the loot.” Molly raised an eyebrow. “Clowns? Really? You know clowns are terrifying, right? You’re going to scare the kids into calling the cops before we even get near the presents.” Bill waved his hand dismissively. “Clowns are classic. Kids love clowns! Besides, have you got a better idea?” Molly leaned forward, resting her elbows on her knees. “Yeah, actually. We go in as magicians. I’ll be ‘The Great Mollyini,’ and you can be ‘Billy the Magnificent.’ We do a few tricks, wow the crowd, and then—abracadabra—the presents disappear.” Bill scratched his chin. “Hmm. Magic does have flair. But do we even know any magic tricks?” Molly grinned and pulled a deck of cards from her pocket. “Watch and learn.” She shuffled the deck with a flourish, then fanned it out in front of Bill. “Pick a card, any card.” Bill rolled his eyes but plucked a card from the deck. Molly shuffled again, cut the deck, and pulled out—a banana. Bill stared at it. “What the—?” Molly shrugged. “Magic. It’s unpredictable.” Bill sighed. “Alright, fine. Magicians it is. But we need a distraction for the parents. They’re not going to just let us walk out with a pile of gifts.” Molly snapped her fingers. “I’ve got it. We hire the world’s worst ventriloquist to perform during the party. The parents will be so baffled by his terrible act, they won’t notice us slipping out the back with the goods.” Bill narrowed his eyes. “Where are we going to get a terrible ventriloquist on such short notice?” Molly smirked. “Don’t worry. I know a guy.” Bill groaned. “Please don’t tell me it’s your ex.” Molly’s grin widened. “Ding ding ding! You win a prize.” Bill threw his hands in the air. “Unbelievable. This plan is a disaster waiting to happen. Why do I let you talk me into these things?” Molly stood up and patted him on the shoulder. “Because, Bill, I’m the brains of this operation. And because no one else is dumb enough to team up with you.” Bill crossed his arms. “Fine. But if this goes south, I’m blaming you.” Molly winked. “Deal. Now, let’s go rehearse some magic tricks. And for the love of all that’s holy, don’t pull out another banana.” Bill muttered under his breath, “I’m surrounded by lunatics.” He followed Molly up the creaky stairs, already regretting every decision that had led him to this moment.
created_at
"2026-02-16T21:33:50.32701+00:00"
2/16/2026, 9:33:50 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
379
moderation_latency
(null)
generation_time
27258
tokens_prompt
77
tokens_completion
788
native_tokens_prompt
66
native_tokens_completion
715
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00065747
router
(null)
provider_responses
0
id
"chatcmpl-Rb6HtZWBXpKJRvYNGwhQCqtt"
status
200
is_byok
false
latency
43
endpoint_id
"5294d55f-9012-496b-8f22-8cc919432dcd"
provider_name
"DeepInfra"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277601-aKqQxLbb0t3hbejMLVvk"
upstream_id
"chatcmpl-Rb6HtZWBXpKJRvYNGwhQCqtt"
total_cost
0.00065747
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
86.3803%
Matches word count
n/a
neededClean
false
words
489
0.2008%
Dialogue to Total Word Ratio
Ratio: 61.15%, Deviation: 11.15%
neededClean
false
wordsTotal
489
wordsDialogue
299
43.2906%