NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2E3G033NXKCFNTCF4Q2PSA
Run Details
Model:
DeepSeek V4 Pro (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
27.3s
Cost:
$0.001032
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bells above the door had long since gone silent, leaving only the hum of the pastry case and the heavy, yeasty warmth of cooling bread. A damp rag slapped against the glass countertop. “You’ve been quiet since the morning rush.” “I’m trying to figure out if I’m imagining things, or if someone’s playing a very expensive prank.” “Define expensive.” “A single vanilla bean left in the flour bin. Madagascar, perfectly plump, split right down the middle. Not ours. Our order hasn’t come in.” “Maybe a gift from the spice shop?” “Tessa would’ve left a card. She’d want a testimonial for the town newsletter. No, this was placed there after I locked up last night.” A chair scraped against the tile floor. Outside, the fog pressed against the window like a held breath. “The lock’s still stiff from when the deputy jimmied it for the stray cat.” “Exactly. They didn’t use the door. And the bean was sitting on top of a receipt I filed three days ago for two dozen orange-currant scones.” “The ones Delia Maplewood ordered for the book club.” “Delia Maplewood died last Tuesday.” Silence settled, thick as meringue. A teaspoon traced nervous circles in a forgotten latte. “The coroner said heart attack.” “He also said she’d eaten a very peppery meal. The same coroner who’s been buying double-chocolate muffins every day since she passed.” “And today? What did he buy today?” “Nothing. He just slid a vanilla bean across the counter and asked if I’d picked up any interesting new recipes.”
created_at
"2026-04-25T13:43:42.342Z"
4/25/2026, 1:43:42 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1239
moderation_latency
(null)
generation_time
27185
tokens_prompt
81
tokens_completion
1358
native_tokens_prompt
65
native_tokens_completion
1154
native_tokens_completion_images
(null)
native_tokens_reasoning
798
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001032255
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"997a7929-1b09-494c-9fa3-bce878f17229"
is_byok
false
latency
314
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777124622-9tR0dACBWAT20DOzeawH"
session_id
(null)
api_type
"completions"
id
"gen-1777124622-4bf7WJUJBlpp25BKMKdl"
upstream_id
"997a7929-1b09-494c-9fa3-bce878f17229"
total_cost
0.001032255
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)