NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2FCAH281Z6SJ22JTYCJ1F8
Run Details
Model:
DeepSeek V4 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
21.0s
Cost:
$0.000737
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her earpiece, the absurdly large lollipop she was using as a prop nearly taking out a potted fern. "So what's the official story on the bouncy castle?" Bill asked, his voice crackling through the comms. "Are we treating it as an obstacle or an asset?" "I'm treating it as a threat, Bill. A vinyl-coated threat. Have you ever tried to move stealthily in an inflatable death trap filled with sugar-crazed gremlins? It's like running in a nightmare made of air." "Noted. I'll attach a small distraction device to the generator. On your signal, the inflatable giraffe will make a sound that suggests it has a severe gastric issue. That'll clear the ethical dilemma of pushing a five-year-old out of the way." "An ethical heist, that's what I'm aiming for," Molly said, peering around a giant cardboard cutout of a cartoon puppy. "Okay, the present table is a mountain of glittering temptation. My target is the big one in the galaxy-print wrapping paper. That's the one little Timmy’s mum was bragging about at the school gates. A 'limited-edition gaming console'. She actually mouthed the words 'you can't buy this anymore'. The arrogance is dizzying." "Dizzying, she says. I'm currently dressed as a party clown. My dignity is a distant speck. My target is the three-foot-tall plush unicorn. Don't ask why, it's for a very specific, very wealthy collector with a perverse sense of interior design. What's your extraction method for the console?" "Decoy present. I've got a box wrapped in identical paper, filled with artisanal fruitcake. It weighs a ton and smells of despair." "Diabolical. Nobody checks a heavy present. They assume it's just an educational encyclopaedia. What about the kid, though? Little Timmy. He'll be devastated." "Bill, the kid is four. By 3 PM his brain will be a soup of cake and cartoon-induced mania. He'll open the fruitcake, his face will do that crumpled thing for exactly six seconds, and then a relative will give him a drum kit and he'll forget it ever existed. We're not stealing joy, we're slightly delaying it on a minor scale. It's redistribution of temporary euphoria." "A regular Robin Hood, you are. My intel says the magician is running late. That's our window. A room full of children with no scheduled entertainment is a powder keg. I'll suggest an impromptu game of 'Sleeping Lions'. It's the only thing that buys us silence." "That's genius. Thirty kids flat on the floor, faking sleep. We can just waltz in. What's your cover name as the clown?" "Bubbles the Silent. My gimmick is mime. A silent, desperately sad mime who only makes balloon poodles that look like they've seen terrible things. Parents love the existential dread. Keeps them from asking questions." "I'm entering the gift zone now. The birthday boy is crying because he squeezed his juice box too hard. This is the chaos we prayed for. I'm making my approach. The fruitcake is in my hand. How's the giraffe's stomach, Bubbles?" "Primed and ready. On my mark... Wait, abort. Abort! The mother, she's staring at me. She's mouthing something. She's asking me to make a balloon hat for a child who is currently wiping his nose on my oversized shoe. This is not in the dossier." "Improvise, Bubbles! Create a masterpiece of a hat. A tragic, beautiful hat. While his head is encased in latex, I make the switch. Ready? One... two... unleash the giraffe!"
created_at
"2026-04-25T14:06:00.23Z"
4/25/2026, 2:06:00 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
659
moderation_latency
(null)
generation_time
20923
tokens_prompt
77
tokens_completion
869
native_tokens_prompt
67
native_tokens_completion
814
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000737325
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"bb12fa79-c65a-411d-bf44-66ee454d7057"
is_byok
false
latency
288
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777125960-1rgNXq4HRg3RksckhAV5"
session_id
(null)
api_type
"completions"
id
"gen-1777125960-SJuI8q1gJ9lq3tYgkdgW"
upstream_id
"bb12fa79-c65a-411d-bf44-66ee454d7057"
total_cost
0.000737325
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
572
0.00%
Dialogue to Total Word Ratio
Ratio: 93.12%, Deviation: 23.12%
neededClean
false
wordsTotal
581
wordsDialogue
541
0.0000%