NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2FF6P8X1AD6SDJC89T1SW0
Run Details
Model:
DeepSeek V4 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
6.3s
Cost:
$0.000114
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
“I swear, if I have to make one more lavender honey scone, I’m going to stuff the entire batch into Gertrude Higginbotham’s handbag.” “She’d probably find that a compliment. Last week she told me my lemon bars were ‘pleasantly tart, but not nearly as tart as the look I gave her son at the library sale.’” “Her son collects vintage doorknobs. Who am I to judge? But the scones—she orders three dozen every Tuesday, picks them up at 9:15 on the dot, and I’ve never seen her eat a single one.” “Maybe she’s a secret hoarder. A scone dragon sitting on a glittering mountain of baked goods.” “More likely she’s feeding them to that mangy cat that lives behind the feed store. I heard it turned down a piece of salmon last week. Too refined now.” A pause, the soft scrape of a spatula against a baking sheet. “So what’s the real reason you wanted me to come by this morning? Not because you needed help with the buttercream.” “Because someone went through the trash behind my shop last night. Left the lid off, scattered flour bags everywhere. And I found this stuck to the dumpster.” A rustle of paper. “Is that… a recipe card? For my grandmother’s cinnamon rolls.” “The ones only you and I know how to make. The ones that got your aunt run out of the county fair committee in ’98.” “Oh, sugar.” “Exactly.”
created_at
"2026-04-25T14:07:34.604Z"
4/25/2026, 2:07:34 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
589
moderation_latency
(null)
generation_time
5957
tokens_prompt
81
tokens_completion
430
native_tokens_prompt
65
native_tokens_completion
376
native_tokens_completion_images
(null)
native_tokens_reasoning
50
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011438
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"78d244eb-c6e9-41cc-974a-c4e1edd19565"
is_byok
false
latency
274
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777126054-WOU3dcIRGRllUFzHVbFD"
session_id
(null)
api_type
"completions"
id
"gen-1777126054-3iyH8xrdoq6KxH1sod5Y"
upstream_id
"78d244eb-c6e9-41cc-974a-c4e1edd19565"
total_cost
0.00011438
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)