NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2DVA93QJ5V2PXN27DYX4D0
Run Details
Model:
DeepSeek V4 Pro
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
25.5s
Cost:
$0.000736
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly knelt on the fire escape, pigeon droppings soaking into her designer slacks, and peered through the rain-streaked window at a mountain of glittering gift boxes. “I count forty-three presents,” she whispered, not taking her eye from the diamond loupe she’d pressed against the glass. “Forty-four if you include that one shaped like a depressed hippopotamus.” Bill was below on the street, disguised unconvincingly as a hot dog vendor. A single, sad sausage rolled lazily on his grill. “Forty-four presents at a six-year-old’s birthday party? Who is this kid, a tiny Saudi prince?” “Worse. An influencer’s child. The goodie bags have monogrammed silk pajamas, Bill. Silk pajamas. I haven’t owned silk pajamas since the Copenhagen Sapphire Affair.” “That wasn’t an affair, Molly, you fell into a canal and we had to burn our clothes to avoid Interpol.” He flipped the sausage with aggressive boredom as a mother pushing a stroller glanced his way. “So we go in during the magician’s act. Standard misdirection.” Molly retracted the loupe with a practiced flick. “No good. Magician cancelled. Apparently, a fluffy white rabbit is a ‘choking hazard’ and an ‘outdated trope of problematic wonder.’ It’s a petting zoo instead.” “A petting zoo. Inside an Upper West Side duplex.” “A miniature therapy donkey named Sashimi and three profoundly anxious pygmy goats. The children are encouraged to share their feelings with the livestock, which I believe puts them squarely in our professional wheelhouse. We will be the petting zoo handlers.” Bill abandoned his hot dog cart entirely, letting a tiny drifter of steam signal his distress. “I am not dressing as a farmhand, Molly. My union has rules about pastoral cosplay.” “You’ll be the donkey handler. The costume includes a straw hat so large it functions as a personal privacy screen. I’ve already forged the veterinary credentials. We’re from an outfit called Bleats & Brays Emotive Animal Therapy. Our business card uses a font called ‘Neutraface’ which, I’m told, implies a non-threatening yet clinically effective affect.” A long pause drifted up the fire escape, filled only by the faint, bleating cry of an emotionally unprepared goat from inside the party. “How do we extract forty-four presents under the cover of a sad donkey?” Bill finally asked. “We’re not extracting presents, we’re extracting ‘biohazardous organic enrichment materials.’” Molly said, smiling for the first time. “That’s what we’ll call the presents on our manifest. The donkey will be wearing a specially designed therapeutic weighted blanket with dozens of hidden, oversized pockets. A bit like a quadrupedal shoplifter’s trench coat. We load Sashimi up, we process the children’s experiential trauma from the animal visit, and we walk out the service elevator. The parents will be so terrified of their own children’s emotional fragility they’ll hold the door for us.” Bill sighed the sigh of a man who had, in a past life, stolen a live penguin for similar reasons. “And if the donkey brays at the wrong moment? Or the pygmy goats stage a psychological mutiny?” Molly stood up, dusting the urban filth from her knees. “Then we pivot to our exit strategy. I’ve hidden a smoke machine in the bounce house. It’s set to simulate a catastrophic, yet enchantingly whimsical, structural failure. In the ensuing panic of glitter and tears, we are simply two sad clowns fleeing with a donkey full of loot. Now, are you ready to meet your spirit animal, or should I call my second cousin who owes me a favor involving a stolen hot air balloon?”
created_at
"2026-04-25T13:39:14.343Z"
4/25/2026, 1:39:14 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
910
moderation_latency
(null)
generation_time
25386
tokens_prompt
77
tokens_completion
961
native_tokens_prompt
67
native_tokens_completion
812
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000735585
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"83aece41-69e9-46a6-a7f7-aa18dd44a43a"
is_byok
false
latency
287
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777124354-Gt8geqjO0Ndw1Kk0NqGP"
session_id
(null)
api_type
"completions"
id
"gen-1777124354-6STSoPaVc0bmYdDnfRsV"
upstream_id
"83aece41-69e9-46a6-a7f7-aa18dd44a43a"
total_cost
0.000735585
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
583
0.00%
Dialogue to Total Word Ratio
Ratio: 71.48%, Deviation: 21.48%
neededClean
false
wordsTotal
589
wordsDialogue
421
0.0000%