NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2EVSFZZ6NCD9D2V0PW39SN
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
23.2s
Cost:
$0.000241
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“You’re sure about this?” Bill whispered, peering through the rhinestone-encrusted binoculars at the bouncy castle. “I mean, sure-sure? Not ‘I-saw-it-on-a-cereal-box-and-it-looked-fun’ sure?” Molly adjusted the strap of her neon-pink fanny pack, which held a lockpick set and three juice boxes. “Bill, the target is a seven-year-old’s birthday party. The loot is a mountain of wrapped presents stacked near a piñata shaped like a unicorn. We are professionals.” “Professionals don’t usually hide behind a hedge wearing a giant balloon animal costume, Mol.” “It’s called a disguise. Blend in. The theme is circus. I’m a poodle. You’re a limping tiger.” Bill looked down at his own sagging orange-and-black bodysuit. “I wanted to be the magician. I had a whole spiel. ‘Now you see the present, now you don’t.’” “Your ‘spiel’ involves a smoke bomb and a rubber chicken. The magician is currently throwing up in the bathroom. Probably from too much cake. We strike now.” A shrill shriek erupted from the party. A child had fallen off the slide. “See?” Molly said. “Distraction. We go for the pile. Grab the big ones with the sparkly bows. Do not, under any circumstances, grab the one with the dinosaur wrapping. That’s for the birthday girl, and she’s got a kung-fu grip and a personal vendetta against anyone touching her stuff.” “How do you know that?” “I did reconnaissance. She bit the mailman last Tuesday.” Bill shook his head. “This is insane. We’re stealing from children. What if they cry? I can’t handle crying. Last time I tried to boost a toddler’s tricycle, I ended up buying it a juice box and apologizing to its mom.” “That was your fault. You offered to help it build a fort. We don’t build. We steal.” “Right. Steal. From a seven-year-old. While dressed as a tiger with one leg that’s twenty percent shorter than the other.” Molly sighed. “Fine. New plan. We use the bounce house as a smokescreen. You create a diversion by pretending to have a sudden allergy to face paint. I’ll crawl under the table and empty the gift bag into my fanny pack. If anyone asks, I’m a very small postal worker.” “You’re five-foot-nine.” “I’ll slouch. And wear the balloon poodle’s head.” Bill groaned. “We are the worst criminals. We once tried to rob a hot dog stand and forgot the ketchup.” “That was your fault! You wanted mustard!” “It’s a classic pairing!” A new scream pierced the air. The mother was heading toward the gift table. “Now or never!” Molly hissed. Bill took a deep breath. “Okay. But if I get glitter in my fur, you’re explaining this to my therapist.” “Deal. On three. One… two… three.” They charged. Bill immediately tripped over his own tail and face-planted into a bowl of fruit punch. Molly, undeterred, dove under the table, grabbed the sparkliest package, and rammed it into her fanny pack. It barely fit. The zipper screamed. “Freeze!” a voice boomed. They both looked up. The birthday girl stood over them, arms crossed, face smeared with blue frosting. “That’s my present,” she said. Molly grinned nervously. “It’s… a safety test. For pirates.” The girl narrowed her eyes. “You’re not pirates. You’re weird.” Bill sighed, pulling a juice box from his costume. “Want a Capri Sun?”
created_at
"2026-04-25T13:56:58.499Z"
4/25/2026, 1:56:58 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
571
moderation_latency
(null)
generation_time
12995
tokens_prompt
77
tokens_completion
922
native_tokens_prompt
67
native_tokens_completion
826
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00024066
router
(null)
provider_responses
0
endpoint_id
"934a69f9-bd54-474b-beca-24560f721e12"
is_byok
false
latency
60
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepInfra"
status
(null)
1
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"7e9198f1-ab0c-4a97-9aa2-dce4eeeced26"
is_byok
false
latency
282
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777125418-FX9d1Xiv38bx3UhyJ6nd"
session_id
(null)
api_type
"completions"
id
"gen-1777125418-Rd4XZSzZg5LG0ydHWRT0"
upstream_id
"7e9198f1-ab0c-4a97-9aa2-dce4eeeced26"
total_cost
0.00024066
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
541
0.00%
Dialogue to Total Word Ratio
Ratio: 69.86%, Deviation: 19.86%
neededClean
false
wordsTotal
554
wordsDialogue
387
0.0000%