NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8V7SYB26PZR9QBX13SBHXJ
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
9.8s
Cost:
$0.009195
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning light filtered through the checkered curtains of Sweet Crumbs Bakery, casting warm squares across the flour-dusted counter. The scent of cinnamon rolls hung heavy in the air, mingling with fresh coffee and something unspoken. "You heard about Gerald, I suppose." "Found him in his garden shed. Martha nearly fainted when she went to borrow the hedge trimmer." "They're saying it was natural causes." "They're saying a lot of things." A muffin was torn in half, steam curling upward. "Natural causes don't explain why his prize-winning roses were all dug up." "People do strange things when they're not feeling well." "People do strange things when they're looking for something buried." The coffee machine gurgled into the silence. Outside, a blue jay landed on the windowsill and immediately thought better of it. "You know, Gerald ordered a cake last Tuesday. Wanted it decorated with a very specific message." "What kind of message?" "'I know what you did in 1987.' Wanted it delivered to the town council meeting." "And did you deliver it?" "Never got the chance. He was dead by Wednesday morning." Another piece of muffin disappeared. The coffee grew cold. "I think I'll make that cake after all."
created_at
"2026-02-24T22:10:12.564Z"
2/24/2026, 10:10:12 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1427
moderation_latency
121
generation_time
9637
tokens_prompt
81
tokens_completion
363
native_tokens_prompt
94
native_tokens_completion
349
native_tokens_completion_images
(null)
native_tokens_reasoning
33
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.009195
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"cef1490c-d13a-4c28-bcf5-e1c683aa6f3e"
is_byok
false
latency
1344
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771971012-VDEQE4TBlsCb0mjMwE4j"
upstream_id
"cef1490c-d13a-4c28-bcf5-e1c683aa6f3e"
total_cost
0.009195
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)