NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5VW1WB37M6C1NY1SVBAN6
Run Details
Model:
DeepSeek V3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
53.4s
Cost:
$0.000632
Provider:
SambaNova
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rusted lawn chair groaned under Bill’s considerable weight as he squinted at the invitation. “It says ‘No gifts, please.’ Right there. In italics.” Molly, perched on the edge of a plastic patio table, didn’t look up from polishing a grappling hook with a silk handkerchief. “Of course it says that, Bill. It’s a lie parents tell each other to feel better. It’s a social nicety, like saying ‘How are you?’ when you don’t actually want to hear about my deviated septum.” “Seems risky,” Bill grumbled, folding the colorful card. “What if they listened? We go through all the trouble of neutralizing the bouncy castle attendant, and all we get is a handful of gluten-free cupcakes and a disappointed six-year-old.” “Bill, look at the address. This is the Pemberton-Smythe estate. Their little ‘Baxter’ or ‘Bentley’ or whatever horrid name they’ve chosen is not having a gift-less party. The other parents will ignore the request out of pure, unadulterated social panic. It’ll be a smorgasbord of overpriced plastic and educational toys that will teach the child nothing except resentment.” Bill considered this. “So… it’s a target-rich environment?” “It’s the Fort Knox of fifth birthdays,” Molly said, her eyes gleaming. “Now, the real security issue isn’t the parents. They’ll be distracted by sangria and comparing private preschools. It’s the entertainer.” “The clown?” “Worse. A licensed, certified, and fully-credentialed ‘Childhood Enrichment Coordinator’ named Professor Pumpernickel. His focus is rhythmic cognitive engagement. He’s a one-man distraction machine.” Bill snorted. “A guy in a bad hat with a bunch of scarves. What’s he gonna do, harmonize at us?” “He’s the key. He’s our unknowing accomplice.” Molly leaned in, her voice dropping to a conspiratorial whisper. “His entire act is built around a ‘Magical Treasure Chest’ he uses for his finale. The kids all put their ‘happy thoughts’ inside, and he A slow, ugly grin spread across Bill’s face. “He uses a lock?” “A flimsy theatrical padlock. You could beat it with a stern look. Our play is simple. You create a diversion during the ‘Hokey Pokey’ remix—I suggest a catastrophic ice cream meltdown near the main patio. While all eyes are on the sticky tragedy, I switch his treasure chest with our identical, empty one. He performs his finale, locks it up, and wheels it right to his van, which we will have conveniently parked in the shade for him.” “We steal his van? Molly, that’s grand theft auto!” “No, you oaf. We *offer* to park his van. We are considerate party guests. He’ll be grateful. He’ll load the chest himself. We drive away with the entire haul in one neat, professionally handled package. It’s not a heist, Bill. It’s a logistical reassignment.” Bill stared into the middle distance, imagining it. The chaos, the switch, the clean getaway. “It’s… beautiful. But what about the happy thoughts? What happens to them?” Molly sighed, slipping the grappling hook into her duffel bag. “Bill, in our line of work, you can’t afford to get sentimental about the happy thoughts of strangers. Besides,” she said, standing up and brushing off her trousers, “we’re giving those parents a far more valuable gift.” “What’s that?” “A wonderful reason to never speak to each other again. Now, go get your party hat. We’ve got a birthday to crash.”
created_at
"2026-02-16T21:32:46.606144+00:00"
2/16/2026, 9:32:46 PM
model
"deepseek/deepseek-chat-v3.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1517
moderation_latency
(null)
generation_time
53317
tokens_prompt
95
tokens_completion
927
native_tokens_prompt
67
native_tokens_completion
829
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0006318
router
(null)
provider_responses
0
id
"6ab46c23-52aa-4b1a-865c-f9a37dac4426"
status
200
is_byok
false
latency
1517
endpoint_id
"1dda4bb9-c626-4816-bb54-99665323fef0"
provider_name
"SambaNova"
model_permaslug
"deepseek/deepseek-chat-v3.1"
api_type
"completions"
id
"gen-1771277512-Ow4FKYbwkZFYNktdfOwy"
upstream_id
"6ab46c23-52aa-4b1a-865c-f9a37dac4426"
total_cost
0.0006318
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SambaNova"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
554
0.1069%
Dialogue to Total Word Ratio
Ratio: 58.30%, Deviation: 11.70%
neededClean
false
wordsTotal
554
wordsDialogue
323
0.0535%