NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5YT273VNV9BGYM5JYJ34N
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
14.4s
Cost:
$0.000710
Provider:
SiliconFlow
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dim glow of a single bulb flickered above the rusted kitchen table where Molly and Bill huddled over a hastily drawn floorplan. Empty coffee cups and half-eaten donuts littered the surface between them. Molly tapped the paper with a chipped fingernail, her brows furrowed. "You’re telling me the only way in is through the *bounce house*?" Bill squinted at the diagram. "That’s gotta be the dumbest plan we’ve ever had." Molly smirked. "No, the dumbest plan was when you tried to rob a pet store and got chased out by a parrot." "It *mocked me*, Molly. Traumatizing." Bill crossed his arms. "But this? Kids’ parties are like military operations. Helicopter parents, sugar-crazed toddlers—it’s a warzone." "Exactly." Molly leaned in. "No one expects thieves at a six-year-old’s birthday. We show up in clown costumes, blend in, and—" "—get stuck inflating the bounce house for two hours while Karen from accounting micromanages the juice boxes. No thanks." Molly rolled her eyes. "Fine. Plan B: The piñata." Bill perked up. "Now we’re talking. Smash and grab?" "Better." She grinned. "We *become* the piñata." Bill stared. "... You wanna *wear* the piñata." "Think about it," Molly said, thrusting a hand into her backpack. She pulled out a giant, glittery donkey head. "We sneak in as the entertainment. Kids whack us with sticks, we ‘accidentally’ burst open, and while everyone scrambles for candy—" "—we nab the presents and vanish like ninjas." Bill scratched his chin. "I like it. But what if the kids whack too hard?" "Wear a cup." "Fair." Bill sighed, then frowned. "Wait. How do we carry the loot? Piñatas don’t have pockets." Molly tossed him a fanny pack. "Stash it in here. We’ll duct tape it inside the piñata." Bill eyed the pack. "This is the least cool thing I’ve ever worn." "You robbed a geriatric bingo night dressed as a nurse. Your coolness hit rock bottom years ago." "Low blow." Bill snapped the fanny pack around his waist. "Alright, clown or donkey—either way, we’re committing felonies in festive attire. What’s the exit strategy?" Molly pointed to the back of the diagram. "After the ‘accident,’ we ditch the costumes in the bounce house, slip out the side gate, and meet at the getaway car—which, by the way, is a minivan with ‘Soccer Mom’ stickers." Bill groaned. "We’re gonna die uncool and covered in glitter, aren’t we?" Molly clapped him on the back. "Welcome to the big leagues, Billy." He sighed, resigned. "Just promise me one thing." "What?" "If we get caught, we tell the cops our names are Santa’s Elves." Molly grinned. "Deal."
created_at
"2026-02-16T21:33:43.804178+00:00"
2/16/2026, 9:33:43 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1745
moderation_latency
(null)
generation_time
14168
tokens_prompt
77
tokens_completion
722
native_tokens_prompt
66
native_tokens_completion
694
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0007105
router
(null)
provider_responses
0
id
"019c685f6a90879a3b3e75c0fe65e94b"
status
200
is_byok
false
latency
1745
endpoint_id
"d562908f-c5eb-4a32-9a91-24d53b176fd8"
provider_name
"SiliconFlow"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277609-Dl67yp3tHmX3Jh05798m"
upstream_id
"019c685f6a90879a3b3e75c0fe65e94b"
total_cost
0.0007105
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
441
10.3539%
Dialogue to Total Word Ratio
Ratio: 63.27%, Deviation: 6.73%
neededClean
false
wordsTotal
441
wordsDialogue
279
5.1770%