NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2G3VVJMRK6HPVQ6BTNH5EQ
Run Details
Model:
DeepSeek V4 Pro (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
16.3s
Cost:
$0.000483
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The last of the morning’s cinnamon rolls sat untouched in the display case, a rare sight in a town where Addie’s baking usually vanished by ten. Outside, autumn leaves scraped against the window, restless in a breeze that carried the faint, sweet smell of woodsmoke from the distant hills. “You can’t honestly think the candied ginger just walked out of the pantry on its own.” “Stranger things have happened on a Tuesday. Remember when the Teagues’ cow turned up in the church pews?” “That was a prank. This is a very specific larceny. A whole jar of my crystallized ginger, gone, and the lid left perfectly tight. No fingerprints, no stray sugar. Who steals a single ingredient?” “Someone who needed a last-minute addition to a very particular recipe. Or someone who wanted to send a message.” “A message? It’s ginger, not a horse’s head. What could it possibly signify?” “The gingerbread competition at the harvest festival is in three days. And everyone knows Eliza Thornton’s secret weapon is a heavy hand with the candied stem ginger. Without it, her recipe falls flat as a pancake.” Silence draped the little bakery, thick as flour dust. The only sound was the old refrigerator humming its weary tune. Then, a soft, dry crunch—a stray sugar cube being crushed beneath an anxious heel. “You’re saying someone sabotaged her before she even started.” “I’m saying I’d check the grocery store receipts for anyone who bought a suspicious backup jar this morning. And I’d do it before the sheriff gets a slice of that pie and decides to call it a prank, too.”
created_at
"2026-04-25T14:18:51.638Z"
4/25/2026, 2:18:51 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
968
moderation_latency
(null)
generation_time
16166
tokens_prompt
81
tokens_completion
630
native_tokens_prompt
65
native_tokens_completion
523
native_tokens_completion_images
(null)
native_tokens_reasoning
168
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000483285
router
(null)
provider_responses
0
endpoint_id
"13bda129-f8a6-4a60-b1db-98082e5f7ccd"
is_byok
false
latency
107
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"Together"
status
429
1
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"67bb3bc6-308a-4e42-81ff-ac8f609e6a58"
is_byok
false
latency
281
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777126731-ohp0UYjLvDNIY8cK9ORi"
session_id
(null)
api_type
"completions"
id
"gen-1777126731-KXFR2MsHWA9KNYs98Eae"
upstream_id
"67bb3bc6-308a-4e42-81ff-ac8f609e6a58"
total_cost
0.000483285
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)