NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8X7C1481NBNSEZ2TZ0NZDF
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
11.1s
Cost:
$0.009420
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning light filtered through the checkered curtains of Sweet Crumb Bakery, casting warm squares across the flour-dusted counter. The scent of cinnamon rolls hung thick in the air, mingling with fresh coffee and something unspoken. "You heard about Harold, I suppose." "Found him in the garden, they say. Face down in his prize-winning roses." A tray of scones slid across the display case, each one golden and perfectly round. "The sheriff's calling it an accident. Tripped over his own garden hose." "Harold Pemberton hadn't tripped over anything in seventy-two years of living. That man could navigate his property blindfolded." The coffee machine hissed and sputtered. A mug was filled and pushed forward, steam curling upward like a question mark. "I saw Margaret at the pharmacy yesterday. Buying antihistamines, she said. Funny thing is, Margaret doesn't have allergies." "That's a strange thing to notice." "It's a strange thing to lie about." Silence settled between them like flour dust. Outside, a church bell rang nine times. A knife cut through a warm loaf of sourdough, the crust crackling. "I saved you the last blueberry muffin." "You always do. Now tell me what else you saw."
created_at
"2026-02-24T22:44:55.468Z"
2/24/2026, 10:44:55 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1444
moderation_latency
141
generation_time
10891
tokens_prompt
81
tokens_completion
356
native_tokens_prompt
94
native_tokens_completion
358
native_tokens_completion_images
(null)
native_tokens_reasoning
33
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00942
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"2b87526f-33fe-47e0-8f59-8238f79088b2"
is_byok
false
latency
1357
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771973095-4CI08522Kx2JNWwccxIC"
upstream_id
"2b87526f-33fe-47e0-8f59-8238f79088b2"
total_cost
0.00942
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)