NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2HZ66RCRP4K240XPHJEG39
Run Details
Model:
DeepSeek V4 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
21.0s
Cost:
$0.000783
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted through the grimy window of the van, her breath fogging the glass. On the other side of the manicured lawn, a bouncy castle shaped like a vomiting dragon was vibrating with the screams of tiny, over-sugared humans. She tapped a manicured nail against a blueprint that was actually just a napkin with a cartoon of a cake drawn on it. “Alright, Bill, walk me through the layers of this beautiful, beautiful onion one more time,” she said. Bill, who was struggling to pull a pair of oversized yellow overalls over his regular clothes, grunted. “The outer layer is the character. I am ‘Boopsy the Clown,’ a professional of whimsy. My mirth is a shield.” “Your trousers are on backwards,” Molly observed. Bill looked down. The embroidered name tag that was supposed to sit on his chest was flapping sadly against his left buttock. “It’s a deconstruction of the classic clown silhouette,” he said, wrenching them around with a dull ripping sound. “I’m subverting expectations before I even fall over.” “The real subversion is in the gift pile,” Molly said, tapping the napkin again. “That’s the creamy centre. While you’re making balloon poodles and traumatising the introverts with your aggressively friendly magic tricks, I’ll be a ghost in the wrapping paper.” “A ghost in a cat suit,” Bill corrected, pulling a frizzy rainbow wig over his head. It looked like a startled, candy-floss creature had died on his scalp. “A spectral, feline presence. You’re Clawdette, the Cat Burglar. It’s thematically tight. The kids get a clown, we get their presents.” Molly checked the compact mirror, attaching a set of pointy, felt-covered ears to her head. “And where is the alleged security for this juvenile Fort Knox?” “Kevin,” Bill said, the name dripping with professional contempt. “The birthday father. He’s the muscle. Apparently, he got a BBQ smoker for Father’s Day and has been insufferably territorial ever since. My intel says his weak point is a craft IPA and any conversation about the perfect smoke ring on a brisket. I’ll occupy him with a bit about how my seltzer bottle can also brine a chicken.” “What if the other kids get grabby with the loot?” she asked. Bill secured a bulbous red nose over his own, his voice taking on a muffled, honking quality. “That’s the genius. The piñata is our Trojan horse. I’ve already switched the real one with our decoy. It’s filled with healthy snacks and sugar-free lollipops. Absolute chaos. While the little monsters are rioting over a bag of carrot sticks, you consolidate the primary assets into your cat-suit’s expandable tail. It’s a reverse heist. We give them the most disappointing party memory of their lives, and we leave with a haul of instant pots and scented candles.” A minivan pulled up, disgorging a pack of children clutching primary-coloured envelopes. The air filled with the pre-instinctual shriek of a party in its fetal stage. Molly adjusted her tail, feeling the weight of the empty, specially sewn compartments within. “Earmuffs,” she said, pointing a sleek, cat-clawed glove at a toddler who was already trying to eat a decorative pebble. “That’s our critical failure point,” Bill agreed, his clown suit squeaking as he slid the van door open. “A rock-eating kid won’t care about a sugar-free lollipop black market. He’ll just eat the piñata stick. We must be faster than the feral ones.” Molly slithered out of the van, her body moving with a liquid grace that belied the absurdity of her ears. “You’re the distraction, Boopsy. I’m the resolution. Make them laugh so hard they don’t notice their new air fryer growing legs and crawling away inside a fluffy tail.”
created_at
"2026-04-25T14:51:15.549Z"
4/25/2026, 2:51:15 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
894
moderation_latency
(null)
generation_time
20936
tokens_prompt
77
tokens_completion
982
native_tokens_prompt
67
native_tokens_completion
866
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000782565
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"8605bdb1-8097-4af9-8d5e-aa3dd681399b"
is_byok
false
latency
278
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777128675-fzlje4HPQOQcnDUg5blr"
session_id
(null)
api_type
"completions"
id
"gen-1777128675-Xtp9Fe2SCQi0YPzNzJXf"
upstream_id
"8605bdb1-8097-4af9-8d5e-aa3dd681399b"
total_cost
0.000782565
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
612
0.00%
Dialogue to Total Word Ratio
Ratio: 54.59%, Deviation: 24.59%
neededClean
false
wordsTotal
621
wordsDialogue
339
0.0000%