NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2G0BJD8DTQJ4QAHC7NWCQM
Run Details
Model:
DeepSeek V4 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
6.1s
Cost:
$0.000116
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
9.61%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
4.8041%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“You’re telling me we have to dress like clowns?” Bill held up the floppy red shoes with genuine horror. “I have a reputation, Molly.” “Your reputation as a guy who once tried to steal a forklift wearing a banana costume,” Molly said, adjusting her oversized polka-dot bow. “Besides, kids trust clowns. They hand you their presents voluntarily.” “They hand you their presents because you’re doing balloon animals,” Bill grumbled. “I don’t make balloon animals. I make trouble.” “Then make a balloon sword. Distract them while I work the loot table.” Molly checked her watch. “The party starts in ten minutes. Mrs. Henderson’s son is turning six. There’ll be at least thirty gifts. All unwrapped. Easy pickings.” “What if the birthday kid cries?” “What if you stop being a baby and put on the nose?” Molly shoved the red foam sphere into his palm. “We’re professionals. Act like one.” Bill sighed and squeezed the nose. It squeaked. “If anyone from the syndicate sees this, I’m blaming you.” “Relax. They’ll just see a sad clown with a balloon giraffe.” Molly grinned. “Now remember: we’re here for the presents, not the cake.” “What’s wrong with the cake?” “Nothing, if you want to get caught with frosting on your chin. Focus, Bill.” He shoved the nose onto his face. “Fine. But I’m keeping the balloon animals.”
created_at
"2026-04-25T14:16:56.658Z"
4/25/2026, 2:16:56 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
543
moderation_latency
(null)
generation_time
5953
tokens_prompt
77
tokens_completion
443
native_tokens_prompt
67
native_tokens_completion
382
native_tokens_completion_images
(null)
native_tokens_reasoning
59
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011634
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"f4abc5b8-f047-46c4-ad87-bc9d18c6c7ec"
is_byok
false
latency
276
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777126616-wzAqBoQo8mtNQfFncdOM"
session_id
(null)
api_type
"completions"
id
"gen-1777126616-4Tm7R28tY1G9pQi6zUph"
upstream_id
"f4abc5b8-f047-46c4-ad87-bc9d18c6c7ec"
total_cost
0.00011634
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
9.61%
Matches word count
n/a
neededClean
false
words
222
0.00%
Dialogue to Total Word Ratio
Ratio: 77.58%, Deviation: 27.58%
neededClean
false
wordsTotal
223
wordsDialogue
173
4.8041%