NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ915344Y3CGG3VNDJ3CGJ7
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
8.4s
Cost:
$0.004626
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning rush had faded, leaving the bakery smelling of cinnamon and warm butter. A small bell chimed as the door swung open. "You heard about Gerald Finch, I suppose." "Who hasn't? Found face-down in the duck pond, they're saying." A paper bag crinkled as it slid across the glass counter. "Your usual sourdough." "Thank you, dear. And yes, though I'd hardly call that pond water deep enough to drown a man his size."
A coin purse snapped
open. "That's what I keep telling people." "You think someone helped him along?" "I think Gerald Finch had exactly three enemies in this town, and all three of them were at the harvest festival that evening." The older woman lowered her voice. "I saw them myself." The baker leaned forward, elbows on the counter. "You mention any of this to Sheriff Dobbins?" "Mention it? I told him outright. He smiled at me the way young people smile at old ones when they've stopped truly listening." "That's a shame." "It is." She tucked the bread under her arm and straightened her coat. "Which is why I intend to handle it myself." The bell chimed again as she left.
created_at
"2026-02-18T02:25:03.595318+00:00"
2/18/2026, 2:25:03 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
504
moderation_latency
(null)
generation_time
8214
tokens_prompt
81
tokens_completion
293
native_tokens_prompt
77
native_tokens_completion
293
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.004626
router
(null)
provider_responses
0
id
"msg_014pok7j6U6rxVDJDzeXjJ45"
status
200
is_byok
false
latency
502
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381495-SZTC9EqQGs07zHk92nRI"
upstream_id
"msg_014pok7j6U6rxVDJDzeXjJ45"
total_cost
0.004626
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"A coin purse snapped"