NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2G7RXRWZ61HEPF0ATKDXH7
Run Details
Model:
DeepSeek V4 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
7.0s
Cost:
$0.000131
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.90%
Dialogue to Total Word Ratio
(details)
49.9515%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“So you’re telling me the mark is a six-year-old?” Bill adjusted his black beanie, which did nothing to hide his skepticism. “We’ve done banks, museums, even that one petting zoo—but a child’s birthday?” “It’s a high-value target,” Molly whispered, spreading a napkin diagram across the diner table. “The parents are rich. Those presents are unsecured, unsupervised, and wrapped in shiny paper that rips in three seconds.” “And the security? A woman named Grandma Carol with a handbag full of Werther’s Originals and a deadly glare.” “Exactly. She’s the muscle. But I have a plan.” Molly tapped the napkin. “You distract with a magic show. I go in through the bounce house.” Bill snorted. “Magic show? My rabbit is out on parole. I can juggle. Badly.” “Better. You drop one. The screaming will cover my entry.” “And if the kid cries?” “Then we’re not just thieves, we’re heroes—we taught them about disappointment.” Bill sighed. “Fine. But I’m not wearing the clown nose.” “You will wear the clown nose, or I’m telling the others about the alpaca incident.” “Low blow.” “High reward. Now, let’s run the con again. What’s your distraction?” “I make a balloon giraffe. It pops. Chaos.” “And then?” “I yell ‘Freeze tag!’ and point at grandma.” “And I?” “You empty the gift table into the trash bags. Exit via the dog door.” “The dog door? I’m not that flexible.” “Then don’t eat the cake beforehand. Remember, we’re professionals.” “We’re stealing from a five-year-old.” “Six. Get it right.”
created_at
"2026-04-25T14:20:59.709Z"
4/25/2026, 2:20:59 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
525
moderation_latency
(null)
generation_time
6880
tokens_prompt
77
tokens_completion
511
native_tokens_prompt
67
native_tokens_completion
435
native_tokens_completion_images
(null)
native_tokens_reasoning
46
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00013118
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"4423cc0c-e467-4d45-a165-b473ab32bfb3"
is_byok
false
latency
275
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777126859-ydrRH6yu6Vt6G0jCTjt3"
session_id
(null)
api_type
"completions"
id
"gen-1777126859-X2JMETn2NUTS3sFwy1DQ"
upstream_id
"4423cc0c-e467-4d45-a165-b473ab32bfb3"
total_cost
0.00013118
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
250
99.90%
Dialogue to Total Word Ratio
Ratio: 88.24%, Deviation: 1.76%
neededClean
false
wordsTotal
255
wordsDialogue
225
49.9515%