NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2EAJR849Z22238KCD2W4YF
Run Details
Model:
DeepSeek V4 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
11.2s
Cost:
$0.000207
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The fairy lights strung across the backyard cast a deceptive glow, twinkling like a hundred tiny, judgmental eyes. Molly crouched behind a decorative garden gnome that was inexplicably holding a tiny pitchfork, its ceramic grin frozen in mockery. She nudged Bill, who was wedged between a trampoline and a paddling pool shaped like a dragon. “Tell me again why we’re casing a five-year-old’s birthday party?” Bill whispered, his voice a low rumble that smelled faintly of last night’s gas station burrito. “Because, Bill, that’s where the money is. Or rather, the gift cards. And the cash-stuffed cards from Grandma. And the Nintendo Switches. Have you seen the haul at a modern kid party? It’s a liquid asset buffet.” Molly adjusted her binoculars. “Look at the table. Tinker Bell wrapping paper. That’s premium. Guaranteed contains something rechargeable.” “I still think we should have stuck with bank lobbies. Less chance of getting pelted by a bouncy ball.” “Banks have cameras. This place has a princess-themed bounce house. Which, by the by, is a perfect entry point. We go over the fence, under the bouncy house tarp, and we’re inside the present pile before they even call for cake. Easy.” Bill shifted, his knee cracking like a dry branch. “And the parents?” “Dads are grilling. Moms are on their third glass of sangria. They’re hyper-focused on preventing the clown from accidentally lighting himself on fire. We’re ghosts, Bill. Ghosts with pockets.” A child shrieked nearby, and they both flinched. “Fine,” Bill said. “But if I end up with rainbow sprinkles in my hair again, I’m billing you for dry cleaning.” “Noted. Now, the plan: I create a diversion. When I give the signal—a loud, off-key rendition of ‘Happy Birthday’—you slip in from the side. Grab the big ones first. The ones with the fancy bows. Those are from the rich cousin. Do not, under any circumstances, grab the one wrapped in newspaper. That’s from the broke uncle and contains socks.” “What’s our exit?” “Over the fence. We’ll use the spare pool float as a decoy. Leave it floating in the kiddie pool. They’ll think it’s a runaway helium tank.” Bill stared at the ball pit. “This is insane. We are orchestrating a heist at a party where the most threatening adult is wearing a tiara.” “And that’s why it’s genius. No one expects the magpie. Now, get ready. I see the pinata swoon. That’s our moment.” Molly slithered forward, her dark clothes blending into the shadows of the inflatable unicorn. The DJ started a remix of the “Baby Shark” theme. Somewhere, a balloon popped. It was time.
created_at
"2026-04-25T13:47:34.539Z"
4/25/2026, 1:47:34 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
720
moderation_latency
(null)
generation_time
11146
tokens_prompt
77
tokens_completion
775
native_tokens_prompt
67
native_tokens_completion
704
native_tokens_completion_images
(null)
native_tokens_reasoning
65
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0002065
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"9f17b3fe-2fe1-45ce-bd9e-4e634c5dbb2d"
is_byok
false
latency
278
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777124854-xpEM7i52HVOHewjcmzb3"
session_id
(null)
api_type
"completions"
id
"gen-1777124854-mv7mqCqNOQl1hgOmEVE0"
upstream_id
"9f17b3fe-2fe1-45ce-bd9e-4e634c5dbb2d"
total_cost
0.0002065
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
435
0.00%
Dialogue to Total Word Ratio
Ratio: 70.75%, Deviation: 40.75%
neededClean
false
wordsTotal
441
wordsDialogue
312
0.0000%