NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8VG7AZCB7HV76KF3R41937
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
8.0s
Cost:
$0.005382
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning rush had thinned, leaving the bakery smelling of cinnamon and warm bread. A glass display case stretched across the counter, crowded with loaves and pastries that hadn't yet sold. The door chimed as the last customer left. "Did you hear about the Whitmore place? Police were there again last night." "Third time this month." A tray slid onto the counter with a soft clatter. "Grace Whitmore came in yesterday, bought six of the almond croissants. Looked like she hadn't slept in weeks." "Can you blame her? Her brother missing, all that money unaccounted for." "She said something strange, actually. Said Harold had been asking about the old Fenwick property before he disappeared. Said he'd gotten real secretive about it." "The Fenwick land? Nobody's touched that place in twenty years." "Exactly what I thought." A cloth moved slowly across the glass case. "But then this morning, I found something when I opened up. Someone pushed it under the door overnight." "What was it?" A folded piece of paper appeared on the counter between them. Two words were written inside, in hurried, shaking handwriting. *Stop asking.* "Should we call Sheriff Daley?" "Already did."
created_at
"2026-02-24T22:14:48.429Z"
2/24/2026, 10:14:48 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
683
moderation_latency
210
generation_time
7741
tokens_prompt
81
tokens_completion
347
native_tokens_prompt
94
native_tokens_completion
340
native_tokens_completion_images
(null)
native_tokens_reasoning
33
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.005382
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"24ba7e6c-b2b4-453b-9909-bfd4e5eb6b3e"
is_byok
false
latency
683
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771971288-766dDw71Kf6MwXLfjWfa"
upstream_id
"24ba7e6c-b2b4-453b-9909-bfd4e5eb6b3e"
total_cost
0.005382
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)