NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ94SNQ0X10MW3GWTG910MP
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
8.9s
Cost:
$0.004641
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed as the last customer shuffled out into the grey morning. The smell of cinnamon rolls and fresh coffee hung thick in the warm air. "You heard about Gerald, I suppose." "Found him right there in the park, they say. Still clutching his walking stick." A tray slid onto the counter with a soft clatter. Flour dusted the worn wooden surface like fresh snow. "Heart, the sheriff thinks. But Marvella down at the post office isn't so sure." "Marvella isn't sure about anything unless she's read it herself, opened it herself, and stamped it herself." A small laugh, quickly swallowed. "Still. Gerald hadn't missed a Tuesday morning here in eleven years. Always the blueberry scone, always the small black coffee." "I know it." The oven timer sang out from the back room. Neither of them moved. "He paid in advance last week. Said he wanted credit on his account for the month ahead." "Did he now." "Seemed in a hurry. Distracted, like." A pause settled between them, comfortable and heavy at once, the way silences only can between people who have known each other a very long time. "Save me a scone, will you."
created_at
"2026-02-18T02:27:03.674601+00:00"
2/18/2026, 2:27:03 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
786
moderation_latency
(null)
generation_time
8821
tokens_prompt
81
tokens_completion
309
native_tokens_prompt
77
native_tokens_completion
294
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.004641
router
(null)
provider_responses
0
id
"msg_01NUpSNLPVWsSUdk5JFRpVGR"
status
200
is_byok
false
latency
786
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381614-OWL93IJBvEyrMfFUx4Is"
upstream_id
"msg_01NUpSNLPVWsSUdk5JFRpVGR"
total_cost
0.004641
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)