NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2GV9N31SEP7PE70GE90DS9
Run Details
Model:
DeepSeek V4 Pro (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
22.9s
Cost:
$0.000529
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The warmth of the ovens fought a losing battle against the chill from the propped-open back door. Flour dust motes danced in the morning sun. A tray of untouched cinnamon rolls sat on the cooling rack, their glaze already hardening into a dull, cracked shell. “You don’t just leave a batch of your prizewinning snickerdoodles to burn and vanish before dawn.” “Those weren’t for the bake sale and you know it. She was going to deliver them herself.” “Exactly. And now she’s not here, and neither are the cookies. I found her apron folded on the flour bin. Folded. As if she’d just stepped out for a coffee refill.” “Cora hasn’t taken a day off in seventeen years. When I came in to proof the sourdough at five, the back door wasn’t just unlocked—it was wide open. The alley was silent. Too silent, now that the mayor’s yappy terrier has gone missing.” “Missing? I saw that dog tied to the bike rack outside the library not two hours ago.” “Then you saw a ghost, because the library steps are where they found Mrs. Henderson’s knitting bag, with one needle missing. And a smear of cocoa powder on the railing.” “Cocoa powder… dusted on her snickerdoodles. You’re saying Cora and the dog—no. That’s absurd. You’ve been reading too many of those paperback thrillers left in the lending library.” “Perhaps. But explain the single cinnamon roll left untouched on the welcome mat of the closed-down post office. Bite marks too small for a human.”
created_at
"2026-04-25T14:31:44.546Z"
4/25/2026, 2:31:44 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
940
moderation_latency
(null)
generation_time
17735
tokens_prompt
81
tokens_completion
667
native_tokens_prompt
65
native_tokens_completion
575
native_tokens_completion_images
(null)
native_tokens_reasoning
226
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000528525
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"8b2796cb-6bdc-49d5-8868-043a8f054ce2"
is_byok
false
latency
276
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777127504-EeX9pWjQXHKv45Gca1HW"
session_id
(null)
api_type
"completions"
id
"gen-1777127504-NnZ3PXiq31epl5wML95W"
upstream_id
"8b2796cb-6bdc-49d5-8868-043a8f054ce2"
total_cost
0.000528525
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)