NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2JEC7PB2AS9NYVGWYNN26H
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
10.3s
Cost:
$0.000199
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.02%
Dialogue to Total Word Ratio
(details)
0.0124%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“It’s a birthday party,” Bill said, staring at the blueprints like they were a sacred text. “For a six-year-old. Named Cody.” “And Cody’s getting a zip-line,” Molly whispered back, tapping the map with a gloved finger. “A deluxe, battery-powered, ride-on zip-line from Uncle Rick. We’re not robbing a kid, Bill. We’re robbing a zip-line.” “From a kid.” “The kid’s not going to use it. He’s six. He’ll cry when the batteries die in twenty minutes. We’re providing a service.” Bill rubbed his temples. “And the unicorn piñata?” “Sugar-free candy. Nobody wants that. We’d be doing the dentists a favor.” Molly unfolded a party-favor list. “Look at this booty. A Lego Star Destroyer. A digital microscope. A signed copy of ‘The Man Who Walked Between the Towers.’ That’s liquid gold on the dark web.” “It’s a children’s book.” “It’s out of print! Parents pay a hundred bucks for that. We’re not stealing joy, Bill. We’re redistributing it.” Bill sighed, a long, defeated groan. “And the magician? Mortimer the Magnificent? He’s got doves. Live doves, Molly.” “He’ll be busy making balloon giraffes when we slip out the back. I’ve timed it. He does the dove trick at 2:15, then he’s stuck untangling a rabbit from a top hat for six minutes.” “What about the cake?” “We don’t touch the cake. That’s a line I won’t cross.” “No, I mean the cake is in the way. It’s on the gift table. He’s having a volcano cake. With dry ice. It’s going to smoke up the whole room.” Molly’s face lit up. “That’s not an obstacle. That’s cover. We’re ninjas in a fog machine.” She scribbled something on a napkin. “I’ll take the south approach, by the moon bounce. You come in from the kitchen, through the pantry. The mother’s name is Karen. She’ll be crying in the bathroom at 2:30 because the magician’s assistant is wearing a prettier hat.” “How do you know that?” “I follow her on Nextdoor. She’s a mess.” Bill picked up the zip-line brochure again. “It says here it needs three C batteries. We don’t have C batteries.” “We’ll grab some from the smoke detector in the hall closet.” “That’s a fire hazard.” “It’s a zip-line, Bill. Priorities.” Molly stood, dusted off her black turtleneck. “We go in as party planners. We go out as legends. And Cody gets a story to tell his therapist about the time the unicorn balloon popped and his presents vanished into a cloud of dry ice.” Bill was silent for a long moment. Then he smiled, slow and crooked. “Fine. But I’m keeping the microscope for myself.”
created_at
"2026-04-25T14:59:33.241Z"
4/25/2026, 2:59:33 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
499
moderation_latency
(null)
generation_time
10177
tokens_prompt
77
tokens_completion
725
native_tokens_prompt
67
native_tokens_completion
679
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001995
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"61b45886-1e45-4ca0-a6c5-c435f26fbec6"
is_byok
false
latency
278
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777129173-ARbWpv8930FyjQKpL8q3"
session_id
(null)
api_type
"completions"
id
"gen-1777129173-T5oS7nI5mAEa5A39cPsE"
upstream_id
"61b45886-1e45-4ca0-a6c5-c435f26fbec6"
total_cost
0.0001995
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
434
0.02%
Dialogue to Total Word Ratio
Ratio: 82.88%, Deviation: 12.88%
neededClean
false
wordsTotal
444
wordsDialogue
368
0.0124%