NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2HY1X8WYR3CNTQSJ7PK8PR
Run Details
Model:
DeepSeek V4 Pro (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
42.9s
Cost:
$0.001277
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her binoculars against the car’s fogged window, the rubber eyecups leaving twin suction rings around her sockets. Across the street, a bouncy castle shuddered gelatinously in a suburban backyard, vomiting a stream of glitter-smeared toddlers onto the grass. A banner reading HAPPY 6TH BIRTHDAY CHAOS GREMLIN flapped limply above a gift table stacked high with boxes wrapped in paper so shiny it looked like a unicorn had been mulched. Bill sucked his teeth and rotated a lollipop from one side of his mouth to the other like a safecracker spinning a dial. “Run it again,” he said. Molly lowered the binoculars. “I’ve run it six times. The ventilation system is a Fisher-Price bouncy house. We insert via the clown car, neutralize the piñata, extract the haul.” She tapped a crayon-drawn blueprint on her knee. The getaway route was marked with a dotted line ending in a stick figure holding a very large cake. “Neutralize the piñata,” Bill repeated, flatly. “It’s a papier-mâché donkey stuffed with Fun-Dip, not a laser grid.” “And yet it’s the linchpin. You saw the intel. That piñata drops at three o’clock sharp, triggering a sugar-crazed stampede. We use that forty-second window to hit the gift pile while every chaperone is swatting blindfolded kids with a broom handle.” Bill flipped open a notebook dense with paranoid scrawl. “What’s the clown-car payload capacity? If I’ve gotta fold myself into a twelve-inch cube again, my left knee will formally secede from my body.” “I’ve upgraded. It’s a pedal-powered fire truck. Two clown slots, generous legroom, a functional water cannon we can fill with apple juice for a diversionary spill.” He considered this. “Acceptable. And the mark? The birthday kid?” Molly’s face hardened with professional gravity. “Codename: Sticky Griffin. Ringleader of this operation. She has a known allergy, a weak grasp of object permanence, and a nap scheduled for two forty-five. We must strike while she’s unconscious and the piñata’s still swinging. Once she awakens and the paper rips, all her little guests become a feral distribution network for stolen property rights.” Bill pulled a second lollipop from his vest, an emergency backup, and stuck it in the side of his mouth next to the first, chipmunking his cheeks. “And if the nap finishes early? If Sticky Griffin wakes up mid-heist and sees me upending her loot pile into a sack shaped like a rainbow trout?” “We pivot to Plan B,” Molly said, producing a small, laminated card. It featured a cartoon bear weeping into a handkerchief. “You claim to be a present inspector, here to check for faulty sound chips and loose bow ribbons. Children respect plausible bureaucracy. I’ll be outside, revving the get-away wagon and subtly eating a large distracting hot dog.” “Subtly eating a hot dog isn’t a tactic, it’s a lunch order.” “Not the way I do it. I’ll use jazz hands.” Bill extracted a tiny stopwatch from his waistcoat. “We approach at fourteen hundred hours, disguised as licensed party-goers. You’ll breach security—meaning you’ll compliment the host’s seven-layer dip—while I establish a blind spot behind the inflatable slide. At the piñata signal, we converge, load the birthday economics into the trout sack, and exfiltrate through the hedge shaped like a dolphin. Departure vehicle: a rented minivan with a ‘My Child Is an Honor Student’ decal for cover. ETA at our fence’s basement: exactly one viewing cycle of an unskippable ad on a tablet dropped by a toddler.” “That’s seventy-three seconds. You’re cutting it tight,” Molly said. “I’m cutting it real. The ad ends, some kid’s gonna notice his new plastic dinosaur combine harvester has vanished from his lap.” Molly nodded slowly, stowing her crayon map in a manila envelope labeled TOP SECRET DO NOT EAT. “One last complication. The clown shoes. You’re wearing them this time.” Bill recoiled, his lollipops clacking like antlers. “Unacceptable. The honking undermines my stealth.” “That’s the genius, Bill. When every step you take emits a jubilant squeak, nobody believes you’re there to commit a felony. You’re just a jolly man with enormous feet, and jolly men don’t steal paw patrol towers.” She slapped the dashboard, hard. A little pine-scented tree fell off the mirror. “Now let’s move. Sticky Griffin only naps once.”
created_at
"2026-04-25T14:50:38.38Z"
4/25/2026, 2:50:38 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
704
moderation_latency
(null)
generation_time
42825
tokens_prompt
77
tokens_completion
1592
native_tokens_prompt
67
native_tokens_completion
1434
native_tokens_completion_images
(null)
native_tokens_reasoning
387
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001276725
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"17dd12bc-ce97-4f91-8f02-687f189e4d6e"
is_byok
false
latency
275
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777128638-Bh8JqBSy3KpQG9LY4VQt"
session_id
(null)
api_type
"completions"
id
"gen-1777128638-0gRvI13fhvjglDjDevma"
upstream_id
"17dd12bc-ce97-4f91-8f02-687f189e4d6e"
total_cost
0.001276725
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
705
0.00%
Dialogue to Total Word Ratio
Ratio: 66.62%, Deviation: 36.62%
neededClean
false
wordsTotal
722
wordsDialogue
481
0.0000%