NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ92W67RHMNKF2BP5N1M25N
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
8.8s
Cost:
$0.004521
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed as the last customer shuffled out into the drizzle, leaving the warm bakery blissfully quiet. The smell of cinnamon and burnt sugar hung in the air like a comfortable secret. "You heard about Gerald Pemworth, I suppose." "Found him this morning, they say. Right there in his garden." A tray of sourdough loaves slid onto the cooling rack with a heavy thunk. "Terrible business." "Margaret at the post office says it wasn't natural causes." Flour dusted the countertop as a rolling pin pressed steadily back and forth. "Says the detective was there for three hours." "Margaret says a great many things." "She does. But she's right more often than people give her credit for." A pause, filled only by the soft creak of the oven door swinging open. "He owed money, you know. To half the town." "I know he owed money to me." The rolling pin stopped. "He owed you money?" "Thirty pounds. For the birthday cake I made his wife last spring."
A dish towel snapped sharply
against the counter. "Though I suppose I won't be seeing that now." "No. I don't suppose you will."
created_at
"2026-02-18T02:26:00.334022+00:00"
2/18/2026, 2:26:00 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
566
moderation_latency
(null)
generation_time
8686
tokens_prompt
81
tokens_completion
291
native_tokens_prompt
77
native_tokens_completion
286
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.004521
router
(null)
provider_responses
0
id
"msg_01MAg5CnCpb974swG6hhnCU4"
status
200
is_byok
false
latency
566
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381551-oqVlN0qexSfBDTRbvHsG"
upstream_id
"msg_01MAg5CnCpb974swG6hhnCU4"
total_cost
0.004521
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"A dish towel snapped sharply"