NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM60TXZAPP1NK169CXR6Y2B
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
32.3s
Cost:
$0.000774
Provider:
DeepInfra
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dimly lit basement smelled of mildew and old pizza boxes. Molly tapped a pencil against a crumpled piece of paper labeled *Operation Toy Grab*. Bill leaned back in a rickety chair, chewing gum with the intensity of a man who had just discovered flavor. “So,” Molly began, “the target’s a kid’s birthday party. Six-year-old Timmy. Theme: pirates. Cake shaped like a treasure chest. Sounds adorable, right?” Bill popped a bubble. “Adorable? Molly, we’re not here to critique their party-planning skills. We’re here to rob them.” “Exactly.” Molly pointed to the map. “The presents will be stacked near the cake table. That’s our gold mine. All we need is a distraction.” Bill frowned. “What kind of distraction? Balloon animals? Clown noses? I’m not wearing a wig, Molly.” Molly rolled her eyes. “Relax, Bill. No wigs. We’re going in as caterers. You’ll handle the drinks; I’ll handle the cake. When the time’s right, I’ll fake a cake collapse. Chaos ensues. You grab the presents. Easy.” Bill raised an eyebrow. “A cake collapse? That’s your master plan? What if the cake doesn’t collapse?” Molly smirked. “Oh, it’ll collapse. I’ll make sure of it. I’ve got a buttercream recipe that’s basically structural sabotage.” Bill shook his head. “You’re a menace. What if someone catches me hauling a pile of gifts out the door?” “You’ll say you’re moving them to a ‘safe location’ because of the cake disaster,” Molly replied without missing a beat. “Parents love safety. They’ll eat it up.” Bill crossed his arms. “What if Timmy’s dad is a cop? Or worse, a lawyer?” Molly shrugged. “Then we’ll say we’re from the toy company and we’re recalling defective gifts. Everyone hates recalls.” Bill groaned. “This is insane. We’re stealing toys from a kid’s party. What’s next? Robbing a lemonade stand?” “Don’t knock lemonade stands,” Molly said, wagging her pencil. “Those kids are sitting on a goldmine. But focus, Bill. This is about the presents. The loot. The shiny, wrapped booty.” Bill leaned forward. “Alright, say we pull this off. How do we even carry all the presents? There’s gotta be, what, twenty of them?” Molly grinned and pulled out a drawing of a makeshift sled. “We’ll rig up a sled with wheels. Attach it to your belt. You pull, I’ll push. We’ll be in and out before anyone notices.” Bill stared at her. “A sled? Molly, we’re not Santa Claus.” “No,” Molly said, folding her arms triumphantly. “We’re better. Santa brings toys. We take them.” Bill sighed, chewing his gum harder. “You know, when I got into this life, I pictured robbing banks, jewelry stores—big scores. Not a pirate-themed birthday party.” Molly patted his shoulder. “Every master thief starts small, Bill. This is our training wheels. Besides, think of the resale value. Toys are practically currency.” Bill muttered, “I hope Timmy gets socks.” Molly stood, clapping her hands. “Positive thinking, Bill. Now, let’s rehearse the cake collapse. You’ll be the cake; I’ll be gravity.” Bill groaned again but stood up. “If this goes sideways, I’m blaming you.” Molly grinned. “And if it works, we’ll be legends. The Robin Hoods of the toy underworld.” Bill rolled his eyes. “More like the Grinches of preschool.” Molly laughed. “Same difference. Now, let’s go steal ourselves a treasure chest.”
created_at
"2026-02-16T21:35:08.44725+00:00"
2/16/2026, 9:35:08 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
308
moderation_latency
(null)
generation_time
32241
tokens_prompt
77
tokens_completion
939
native_tokens_prompt
66
native_tokens_completion
846
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00077406
router
(null)
provider_responses
0
id
"chatcmpl-RsbNC5esjDag7ivhSmT4brCH"
status
200
is_byok
false
latency
49
endpoint_id
"5294d55f-9012-496b-8f22-8cc919432dcd"
provider_name
"DeepInfra"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277675-86hgTIxGDlMxEkmJ3p8z"
upstream_id
"chatcmpl-RsbNC5esjDag7ivhSmT4brCH"
total_cost
0.00077406
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
548
56.9102%
Dialogue to Total Word Ratio
Ratio: 73.36%, Deviation: 3.36%
neededClean
false
wordsTotal
548
wordsDialogue
402
28.4551%