NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2FKAWHNFXV4DJDPH6F3WD8
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
9.7s
Cost:
$0.000188
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted through the binoculars, her face pressed against the grimy window of the parked ice cream truck. “Okay, Bill. The objective is clear. The Jones-Bartholomew twins. Their ‘Frozen Dragons and Laser Unicorns’ party.” She lowered the binoculars. “Presents. Lots of them.” Bill scratched his chin, his eyes wide with dubious calculation. “We’re stealing from toddlers, Molly. Toddlers. They get upset. They cry.” “Crying is a reversible condition,” Molly said, spinning a purple balloon on her finger. “A penthouse is not. And that’s what we’re building, Bill. A penthouse made of gift cards and tiny, questionably-licensed superhero figurines.” Bill sighed, the sound of a man who had once tried to steal the crown jewels and ended up stealing a goat. “So, what’s the play? We dress as a giant bag of Cheez-Its?” “Better.” Molly tapped the truck’s engine. “The ice cream man is my cousin, Lenny. He’s inside, hooking up the disco ball for the ‘Dance of the Conquering Hamsters’ segment. We have his uniform.” “You’re going to waltz in, ring a little bell, and say, ‘Who wants a Choco-Taco?’” Bill asked, flatly. “No, you are,” Molly said, handing him a crisp white cap. “People trust the ice cream man. It’s a sociological fact. You’re the face. The comforting, slightly nervous face.” “What are you going to be?” Molly unzipped a duffel bag and pulled out a bright yellow rubber chicken the size of a Labrador. “General Klaxon of the Royal Flock of Yon. The twins’ birthday party is pirate-themed, but I’ve been assured the children are… flexible on lore.” Bill stared at the chicken. “That’s a puppet with a peg leg.” “It’s a character, Bill!” she hissed. “A pirate chicken who has turned to a life of plunder after his family was turned into chicken nuggets by the evil Baroness of Broccoli. It’s deep.” “That is insane.” “That is distraction,” Molly corrected. “You walk in, sell a few fifty-dollar popsicles to the parents, charm them, and while you’re filling the inflatable unicorn with helium, I’ll deploy General Klaxon from the cake table. The kids will be mesmerized. You then casually gather the gift bags. We slip out the garden gate.” Bill put the cap on, adjusting his reflection in the side mirror. “And what happens when the parents realize the cake is now wearing a patch and screaming for ‘sweet, sweet revenge’?” “Then we have a new problem,” Molly said, her voice a whisper. “The authenticity of the performance. Now, get the bell. And for the love of money, don’t smile. It looks like you’re planning a felony.”
created_at
"2026-04-25T14:09:49.973Z"
4/25/2026, 2:09:49 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
553
moderation_latency
(null)
generation_time
9586
tokens_prompt
77
tokens_completion
714
native_tokens_prompt
67
native_tokens_completion
639
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001883
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"ca3c3366-c3a7-402b-a107-89aafd54892a"
is_byok
false
latency
272
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777126189-fvgquijfocflRXAm1oU4"
session_id
(null)
api_type
"completions"
id
"gen-1777126189-3Ua6aNlHFTmCkprTCJiE"
upstream_id
"ca3c3366-c3a7-402b-a107-89aafd54892a"
total_cost
0.0001883
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
429
0.00%
Dialogue to Total Word Ratio
Ratio: 71.49%, Deviation: 41.49%
neededClean
false
wordsTotal
435
wordsDialogue
311
0.0000%