NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2HZSM62XR9A9DDPAMKH5YY
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
16.5s
Cost:
$0.000298
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the oversized sunglasses she’d bought at a gas station, peering through the hedge at the inflatable bounce house. “Okay, Bill, run it by me again. Slowly. Like I’m five.” Bill, crammed into a lime-green clown costume with a frilly collar that was slowly strangling him, shifted his weight. “We waltz in. I’m ‘Bubbles the Magic Clown.’ You’re my assistant, ‘Sparkles.’ We do balloons. Bad ones. Shapes that look like misshapen kidneys.” “And then?” “Then we hit the present table. We switch the real ones with decoy boxes I filled with rocks and old VHS tapes of ‘Barney.’” Molly snorted. “Barney? That’s a war crime.” “It’s a distraction. The kids find decapitated dinosaur heads, they cry. Chaos. We slip the real loot into my exploding flower.” “Your flower is a duffel bag, Bill.” “A thematic duffel bag. It squirts purple juice. A kid licked it earlier. He’s fine.” Molly rubbed her temples. “Fine. But what about the mother? The one with the ‘mom-rage’ eyes and the Bluetooth earpiece? She looks like she’d taser us for using the wrong kind of glitter.” “That’s where the glitter bomb comes in.” “We don’t have a glitter bomb.” “We do now.” Bill pulled a small canister from his pocket. “Industrial-grade. The kind they use at weddings for spite. One pop, she’s a disco ball. We run.” “Run where? Into a wall of screaming six-year-olds? They’ll hug us to death.” “We zigzag. I’ve got a route mapped.” He unfolded a napkin with a crude drawing of a castle cake and an arrow labeled “ESCAPE VIA THE PONY.” Molly stared. “The pony is a man in a horse costume named Kevin. He’s seventy. He has a heart condition.” “Kevin’s a professional. He’ll cover our retreat with a sad trot.” “This is the stupidest plan since you tried to steal a vending machine by tipping it into a U-Haul.” “That was a success until the chips fell out the back.” “You lost two hundred bags of Cool Ranch.” “We still got the Snickers.” Bill held up a hand. “Look. The presents are in a pile. Unsupervised. The cake is out. The adults are drinking white wine from sippy cups. We have a window.” Molly sighed, peering again. A child ran by, face smeared with frosting, screaming about a missing dinosaur. “Fine. But if I end up with glitter in my hair for a week, I’m putting your name on the police report as ‘The Clown Who Cried.’” “Deal. On my signal.” “What’s the signal?” Bill pointed. “When the bounce house deflates.” “It’s not going to deflate.” “It will if I put this in it.” He held up a small, sharp rock. Molly grabbed his wrist. “You are not stabbing the bounce house with a rock in front of twenty mothers.” “It’s a safety rock.” “There’s no such thing.” “Fine. New plan. We wait for the magic show. When the magician turns a bunny into a scarf, we act.” “Why?” “Because that rabbit always gets loose, and chaos is our friend.” From inside the hedge, a small voice said, “Are you guys stealing the presents?” Bill froze. Molly turned slowly. A little girl in a princess dress stood two feet away, holding a juice box. Molly smiled. “No, sweetie. We’re the entertainment.” The girl squinted. “You look like my mom’s ex-boyfriend.” Bill dropped the rock. “We’re heroes, actually. We’re testing the presents for cooties.” “Cooties aren’t real.” “They are today,” Molly said, grabbing Bill’s arm. “Run.” They ran. The girl took a long sip of her juice box, then shouted, “Mom! The weird clowns are stealing the good gift bags!” The mom with the Bluetooth earpiece turned. She did not look amused. “We’re doing the glitter! Now!” Bill yelled. Molly yanked the canister from his hand, tossed it over her shoulder, and said, “I am quitting this partnership tomorrow.” “You said that last week.” “I meant it this time.” The glitter bomb exploded behind them, coating a pony named Kevin in a shimmering rainbow. He did a sad trot. Everything went quiet. “That was a distraction?” Molly whispered. “No,” Bill said. “That was a tragedy.”
created_at
"2026-04-25T14:51:35.436Z"
4/25/2026, 2:51:35 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
527
moderation_latency
(null)
generation_time
16273
tokens_prompt
77
tokens_completion
1168
native_tokens_prompt
67
native_tokens_completion
1031
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00029806
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"90518881-d5ca-489a-b845-c6a59333288b"
is_byok
false
latency
314
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777128695-xOCrvGpyDnT5joZYQiGT"
session_id
(null)
api_type
"completions"
id
"gen-1777128695-nkswk28Iu5R4mXgDVwVt"
upstream_id
"90518881-d5ca-489a-b845-c6a59333288b"
total_cost
0.00029806
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
688
0.00%
Dialogue to Total Word Ratio
Ratio: 68.45%, Deviation: 18.45%
neededClean
false
wordsTotal
691
wordsDialogue
473
0.0000%