NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5Y1SXG3RYDPQN774N8T0N
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
26.7s
Cost:
$0.000693
Provider:
DeepInfra
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her black beanie and leaned over the blueprints spread across the rickety coffee table. “Alright, Bill. This is it. The biggest score of our lives.” Bill squinted at the paper, scratching his scruffy chin. “A children’s birthday party? Molly, are you out of your mind? Last time we tried a kid’s thing, we got gum stuck in my hair, and I still haven’t recovered emotionally.” “This isn’t just any party, Bill,” Molly said, tapping the blueprint with her pen. “This is *the* party. Little Timmy’s seventh birthday. Do you know what that means? iPads, LEGO sets, and—get this—a brand-new Nintendo Switch. We’re talking a goldmine.” Bill crossed his arms, raising an eyebrow. “Goldmine? Molly, we’re talking about a bunch of screaming kids and cake. How are we even supposed to pull this off?” Molly grinned, her eyes gleaming. “Easy. We blend in. I’ll go as the balloon artist, and you’ll be the magician. No one will suspect a thing.” Bill groaned. “Balloon artist? Magician? Molly, I don’t know how to do magic tricks!” “You don’t have to, Bill! Just wave your hands around, say ‘abracadabra,’ and distract them while I grab the goods.” Bill threw his hands up. “Distract them? With what? A disappearing rabbit? I don’t even have a rabbit!” Molly rolled her eyes. “You’re overthinking this. Just pretend. Look, I’ll teach you a simple trick. You put a coin behind your ear, and then—poof!—it’s gone. Kids love that.” Bill mimicked the motion half-heartedly. “Poof? That’s your genius plan? What if they realize I’m faking it?” Molly smirked. “They’re seven, Bill. They’re not exactly master detectives. Trust me, this’ll work.” Bill sighed, running a hand through his hair. “Fine. But what about the parents? They’re not gonna fall for this.” “That’s where the balloons come in,” Molly said, holding up a brightly colored balloon. “Parents love balloons. They see me twisting these things into poodles and swords, and they’ll be too busy reminiscing about their childhood to notice us sneaking off with the presents.” Bill shook his head. “This is insane. What if someone catches us?” “No one’s catching us,” Molly said confidently. “We’ll be in and out before they even notice the cake’s gone stale.” Bill groaned again. “This is a disaster waiting to happen. What happens when the parents realize the magician and balloon artist didn’t actually get hired?” Molly shrugged. “We’ll say we were volunteers. Charity work. People love charity.” Bill stared at her, dumbfounded. “You’ve thought of everything, haven’t you?” “That’s why I’m the brains of this operation,” Molly said with a wink. “Now, let’s practice your coin trick. And for the love of all things holy, don’t drop it.” Bill reluctantly reached for a coin, muttering under his breath. “Balloon artist and a magician. What’s next? Clowns?” Molly grinned. “Don’t tempt me. Now, let’s get to work. Timmy’s party waits for no one.”
created_at
"2026-02-16T21:33:31.457122+00:00"
2/16/2026, 9:33:31 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
300
moderation_latency
(null)
generation_time
26613
tokens_prompt
77
tokens_completion
832
native_tokens_prompt
66
native_tokens_completion
755
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00069307
router
(null)
provider_responses
0
id
"chatcmpl-RZvmld7mKKwnzGYciapIcPOE"
status
200
is_byok
false
latency
63
endpoint_id
"5294d55f-9012-496b-8f22-8cc919432dcd"
provider_name
"DeepInfra"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277584-OH8unUou9nXBI9IJdGVW"
upstream_id
"chatcmpl-RZvmld7mKKwnzGYciapIcPOE"
total_cost
0.00069307
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
75.1556%
Matches word count
n/a
neededClean
false
words
487
11.3914%
Dialogue to Total Word Ratio
Ratio: 76.59%, Deviation: 6.59%
neededClean
false
wordsTotal
487
wordsDialogue
373
43.2735%