NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ94DEV7ZT5AF4FQFRMAKFV
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
6.7s
Cost:
$0.004521
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning rush had faded, leaving the bakery warm and quiet. The smell of cinnamon rolls still hung in the air as the last customer pushed through the door. "Did you hear about the missing necklace at the Hargrove estate?" "Everyone's heard. Margaret Hargrove hasn't stopped telling anyone who'll listen." A tray of sourdough loaves slid onto the cooling rack with a soft thud. "She came in yesterday, bought six croissants, and talked my ear off for forty minutes." "Do you think it was stolen?" "That necklace has been in their family for three generations. Things like that don't just wander off on their own." Flour dusted across the wooden counter as another batch of dough was turned out. "Besides, they had that dinner party Friday night. Twelve guests, all sitting pretty in that grand dining room." "Twelve suspects, you mean." A small laugh broke the quiet. "I didn't say that." "You were thinking it." "I was thinking these brioche need another ten minutes." The oven door creaked open. "Though I will say, Robert Finley looked awfully nervous when he picked up his birthday cake this morning." "Did he now." "He did."
created_at
"2026-02-18T02:26:48.670177+00:00"
2/18/2026, 2:26:48 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
582
moderation_latency
214
generation_time
6390
tokens_prompt
81
tokens_completion
301
native_tokens_prompt
77
native_tokens_completion
286
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.004521
router
(null)
provider_responses
0
id
"88b6b7e9-7d9f-49b1-852c-670a2c10832b"
status
200
is_byok
false
latency
581
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381601-QElQfGzebqhbpa9Hu2F1"
upstream_id
"88b6b7e9-7d9f-49b1-852c-670a2c10832b"
total_cost
0.004521
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)