NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2E538WVBG9B5K1B140Z0E7
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
11.4s
Cost:
$0.000220
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3562%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air in Dinky’s Party Palace smelled like stale cake and rubber. Molly was wedged behind a plastic palm tree, peering through its fronds at what she called the “Score-O-Mat.” Bill, who had been elbowed in the ribs twice for calling it the “Toddler-Trove,” was busy trying to pop a small, colorful balloon without looking like a giant man strangling a bubble. “Okay,” Molly whispered, her voice a taut wire. “Inventory. The purple mountain at table three is the Loot-Bot 3000, retail value three hundred dollars.” “It’s a pile of toy hovercrafts,” Bill grunted. The balloon finally yielded with a sad little phfft. “And it’ll be sticky.” “Sticky is profit, Bill.” He rubbed his ear. “And the little red one in the corner?” “Gift bag. Miss Poppet’s Custom Sewing Kit. Two hundred and fifty.” Bill squinted. “For a sewing kit? Do kids even sew?” “They glue felt to rocks, Bill. It’s a luxury felt-rock-gluing economy. We’re not here to judge capitalism, we’re here to liberate it.” She nudged him with her elbow. “Focus. The main haystack is the giant pink castle cake. That’s where they drop the envelopes. Cash.” “Ew,” Bill said. “The cake has a fondant moat. What if the envelopes are in the moat?” “Then we drink the moat,” Molly snapped. “The moat is for the brave.” A piercing shriek of laughter erupted from the play area, where a small child was trying to fit a fireman’s hat onto a dachshund. Bill flinched. “I don’t know, Moll. There’s a lot of… noise. And a clown. We didn’t budget for a clown.” “We are the clown now,” Molly said, tossing him a limp, rainbow-colored wig. “Put it on. You’re ‘Bubbles the Balloon Pony.’” Bill held the wig as if it were a dead rat. “I thought I was ‘Crusty the Exit Strategy.’” “You were, until the puppeteer canceled. Now you’re my spotter. When I shift the castle cake, you yell ‘Rainbow’ and drop a glitter bomb in the ball pit.” “And what do you do?” Molly grinned, her face half-lit by the disco ball above the bouncy castle. “I install the decoy.” From her pocket, she produced a cheap, pastel-colored sticky note that read: “Thank you for coming! Your gifts will be digitized at Fun Uncle’s Cloud Vault for maximum joy. Please enjoy your party cracker.” Bill stared at it. “They’ll know.” “They’ll be three,” Molly whispered. “They can’t read.” She paused. “And the parents will be three glasses deep into ‘hypoallergenic’ punch. Now, on my mark. The magician is about to do the dove trick. When he sneezes from the fake feathers, we move.” Bill sighed, pulling the wig over his balding head. One of the fake ponytails caught on his ear. “You know,” he said, “for being a genius, you have a remarkably weak plan.” Molly was already slithering towards the cake table. “No plan survives a birthday party, Bill. But a glitter bomb in the ball pit? That is timeless.”
created_at
"2026-04-25T13:44:34.848Z"
4/25/2026, 1:44:34 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
509
moderation_latency
(null)
generation_time
11275
tokens_prompt
77
tokens_completion
804
native_tokens_prompt
67
native_tokens_completion
751
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00021966
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"0f919f7d-3258-49be-9725-be5a2571ef46"
is_byok
false
latency
272
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777124674-iklwIZYQlexXU66Z02tf"
session_id
(null)
api_type
"completions"
id
"gen-1777124674-PM4kqvMtRzxMebdfzrYO"
upstream_id
"0f919f7d-3258-49be-9725-be5a2571ef46"
total_cost
0.00021966
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
494
0.00%
Dialogue to Total Word Ratio
Ratio: 59.40%, Deviation: 29.40%
neededClean
false
wordsTotal
500
wordsDialogue
297
49.3562%