NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2ET08ZNGX5KAD6FQ4QWJSY
Run Details
Model:
DeepSeek V4 Pro
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
11.8s
Cost:
$0.000322
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed, ushering in the scent of rain and damp wool. “You’ve been standing at that display case for five minutes. The bear claws haven’t changed in that time.” “I’m savoring the anticipation. Besides, it’s not the pastry I’m looking at, it’s the angle. See how the morning light hits that third tray? Perfectly obscures the kitchen door.” “You think I don’t know my own shop’s sightlines? That’s the point. Why do you think Mayor Calloway always takes the corner booth?” “Because he’s a creature of habit, which is exactly how someone sneaks a little something extra into his chamomile tea.” “A little something… you mean like gossip?” “I mean like powdered baneberry. The coroner’s calling the tennis club incident a heart attack, but I saw the dishes from that luncheon. His coffee cup had a residue.” “That’s a serious accusation. Baneberry doesn’t just wander into a scone on its own. You’re suggesting someone in my kitchen…” “I’m suggesting your new gluten-free flour supplier might be a front, and the person who delivered it last Tuesday has a key to the back alley. The same person who very conveniently found the body.” “Hand me that cinnamon shaker, will you? You’re going to tell me you’ve already taken a sample, aren’t you?” “I may have borrowed a bagel for forensic purposes. The one with the poppy seeds. Harder to spot the dusting.”
created_at
"2026-04-25T13:55:59.907Z"
4/25/2026, 1:55:59 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1453
moderation_latency
(null)
generation_time
11782
tokens_prompt
81
tokens_completion
391
native_tokens_prompt
65
native_tokens_completion
338
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000322335
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"775d7bd9-80e4-4410-b886-fa9ecbf914dc"
is_byok
false
latency
278
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777125359-NXUp3l9M0UKt5l2cGjwZ"
session_id
(null)
api_type
"completions"
id
"gen-1777125359-9TUjqruaJ08ikWTcIQmE"
upstream_id
"775d7bd9-80e4-4410-b886-fa9ecbf914dc"
total_cost
0.000322335
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)