NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2DQFTPFX0956JTAEE3SQEA
Run Details
Model:
DeepSeek V4 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
25.5s
Cost:
$0.000729
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly had the look of a woman who was about to weaponize a bouncy castle. She was staring at the hand-drawn map on the coffee table, a crayon-scribbled nightmare of arrows and Xs that Bill had produced with a worrying amount of pride. The X marked the spot not of buried treasure, but of a folding table groaning under the weight of gift-wrapped plastic and sugar-based bribes. Bill, a man whose physical description demanded the word "schnozz," tapped the map with a licorice whip he was using as a pointer. "Right, so. Operation Gummy Bear Gauntlet. Phase one: ingress. We blend, Moll. We become the party." He was dressed in a pair of cargo shorts that had enough pockets to hide a small monarchy's crown jewels and a t-shirt that read 'World's Okayest Uncle'. Molly's upper lip curled. "Your entire plan is to wade into a sea of over-sugared five-year-olds and hope they don't notice a man built like a pallbearer helping himself to a princess castle dollhouse." "It's the 'World's Okayest Uncle' shirt," Bill said, completely missing the point. "It disarms. It says, 'I might be a stranger, but I probably have a lightly humorous anecdote about a barbecue.' Nobody questions the uncle." "They will when you try to leave with a stuffed giraffe the size of a Shetland pony strapped to your back." "Phase two," Bill continued, undeterred, using the licorice to point at a spot on the map marked 'Punch Pit'. "The diversion. I’ve paid a dozen grade-schoolers from the local park. Five bucks each to re-enact the storming of the beaches of Normandy using juice boxes and cupcake sprinkles." Molly blinked slowly, a gesture of profound, spiritual exhaustion. "You've retained a child militia." "A small-business owner retains. I've just incentivized some messy, clumsy chaos. While little Timmy is screaming about a Capri Sun grenade in his eye, I'm filling a garbage bag shaped like a birthday clown with enough Lego to build a reasonably sized panic room." He smiled, a stretch of lips that suggested a sheer-drop cliff face. "That's where you come in. Phase three: extraction and denial." "Denial," Molly repeated, the word tasting like ash and cheap icing. "You're the grieving clown." Bill threw the licorice whip over his shoulder, where it stuck to the lampshade. "You burst in five minutes after the juice box apocalypse, sobbing. Honking your nose in a rhythm of pure despair. You tell the shell-shocked parents that your hamster, Mister Hobbs, just died and you're too heartbroken to make balloon animals. I'll use the emotional paralysis to waddle out with the haul." "Let me dissect this," Molly said, her voice dangerously quiet. "Your master plan hinges on a troop of feral mercenaries you recruited for the price of a large pizza, and my ability to weaponize a grief-stricken honk to cover our escape." "It's lean. It's efficient. It weaponizes pathos." "The giant giraffe, Bill." She pointed a finger sharp enough to pop a balloon at fifty paces. "What about the impossible-to-conceal giant giraffe?" Bill’s face split into a grin of pure, unadulterated idiocy. "Inflatable decoy. While they're trying to figure out how a grieving clown and a clog-dancing giraffe know each other, we're already three blocks away, splitting the proceeds from a Mega-Chunk Blaster 9000." He leaned back, smugness radiating from him like heat off a summer sidewalk. "You're thinking like a thief. You need to start thinking like a clown."
created_at
"2026-04-25T13:37:08.954Z"
4/25/2026, 1:37:08 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1172
moderation_latency
(null)
generation_time
25457
tokens_prompt
77
tokens_completion
883
native_tokens_prompt
67
native_tokens_completion
805
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000729495
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"fef23068-02bb-41df-a1f6-96083ea468b7"
is_byok
false
latency
274
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777124228-O0mTKHLwjpbplS9sfNtk"
session_id
(null)
api_type
"completions"
id
"gen-1777124228-yhg8Yhjrh81susUkrwGx"
upstream_id
"fef23068-02bb-41df-a1f6-96083ea468b7"
total_cost
0.000729495
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
572
0.00%
Dialogue to Total Word Ratio
Ratio: 60.71%, Deviation: 30.71%
neededClean
false
wordsTotal
588
wordsDialogue
357
0.0000%