NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2HED16M5ASZJ29WG0REZ9R
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
20.2s
Cost:
$0.000191
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.07%
Dialogue to Total Word Ratio
(details)
0.0371%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“You’re telling me we’re robbing a birthday party,” Bill said, tugging at the collar of his inflatable dinosaur costume. “A child’s birthday party.” “Specifically, the presents,” Molly said, adjusting the unicorn horn on her own costume. “We’re stealing the presents. Not the cake. Not the goodie bags. The presents.” “I’m asking the obvious question, Molly. Why?” “Because the father is Gerald Grimblethorpe. The watch guy. Each kid gets a Golden Ticket watch worth twelve grand.” Bill sighed. The inflatable tail behind him flopped against a folding chair. “We’re forty-two years old.” “Bill. Look at me. We are two wizards. You are a wizard in a dinosaur suit. I am a wizard in a unicorn suit. Any hesitance you feel is cowardice wrapped in nostalgia.” “Wizards don’t wear inflatable dinosaurs.” “They do tonight. The invitation says ‘Wizard Party.’ They have a bouncy castle shaped like a spellbook. We blend in.” Bill stared at a helium balloon tied to a mailbox. “We’re going to blend in with the children’s magician?” “We are not the magician. We’re parents.” “We don’t have kids.” “We’re the divorced parents of a sick kid. His name is Kevin. He’s at home with the flu. We’re picking up his present since he’s so sad.” “Kevin’s not real.” “Neither is my dignity, yet here I am, Bill. Here we are. In dinosaur and unicorn suits. Now, the plan. The gift table is next to the cupcake tower. You cause a distraction.” “What kind of distraction?” “You trip. You fall into the bouncy castle. You start crying.” “I’m not crying.” “You’re a lizard man in full emotional meltdown. You scream that you lost your magic wand. The kids will laugh. The parents will film. Gerald Grimblethorpe will look away from his watch collection for exactly seven seconds.” “And you?” “I scoop the presents into this.” Molly held up a large gift bag with “THANKS FOR COMING, KEVIN!” in glitter letters. “Then I walk out the side gate while you pretend to be revived by a juice box.” Bill’s dinosaur suit hissed as he shifted his weight. “And if someone recognizes me from that car wash job in ‘19?” “Then we run. But we run separately, and we meet at the clown car.” “There’s a clown car?” “I rented it for the getaway. It honks when you squeeze the horn. It honks.” Bill was silent for a long moment. A child ran past them, shrieking, covered in frosting. “I hate everything about this.” “Good. You’re in character. Now go fall into the bouncy castle and scream about your wand.” “What’s my name?” “Your name is Kevin’s dad. Now move.”
created_at
"2026-04-25T14:42:05.482Z"
4/25/2026, 2:42:05 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
507
moderation_latency
(null)
generation_time
9614
tokens_prompt
77
tokens_completion
745
native_tokens_prompt
67
native_tokens_completion
647
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00019054
router
(null)
provider_responses
0
endpoint_id
"934a69f9-bd54-474b-beca-24560f721e12"
is_byok
false
latency
45
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepInfra"
status
(null)
1
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"ce130974-23bb-4830-8afd-4e570ace317b"
is_byok
false
latency
277
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777128125-RORGCUR2hnTogDg4pBIJ"
session_id
(null)
api_type
"completions"
id
"gen-1777128125-2NKVT5a4ExmuUjuB31jX"
upstream_id
"ce130974-23bb-4830-8afd-4e570ace317b"
total_cost
0.00019054
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
438
0.07%
Dialogue to Total Word Ratio
Ratio: 82.00%, Deviation: 12.00%
neededClean
false
wordsTotal
439
wordsDialogue
360
0.0371%