NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WYN76HXP4N047WWWTWCBH
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
10.0s
Cost:
$0.009195
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed softly as morning light spilled across the flour-dusted counter. The scent of cinnamon rolls hung thick in the warm air, mingling with fresh coffee and something unspoken. "You heard about Margaret, I suppose." "Heard she didn't show up to open the library yesterday. First time in thirty-one years." "That's not the half of it. Sheriff found her back door wide open, kettle still on the stove. Whistling away to an empty house." A long pause filled the space between them, broken only by the gentle scraping of a butter knife across toast. "Her garden was torn up, too. Every last rosebush pulled out by the roots." "Margaret loved those roses more than most people love their children." "Exactly." The espresso machine hissed and sputtered. A tray of scones emerged from the back, golden and fragrant, set down with careful hands. "I don't like what you're implying." "I'm not implying anything. I'm stating facts. She's gone, her roses are destroyed, and that nephew of hers was spotted at the gas station Tuesday night with dirt under his fingernails." "Could be nothing." "Could be. But in this town, nothing has a funny way of turning into something."
created_at
"2026-02-24T22:40:09.966Z"
2/24/2026, 10:40:09 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1735
moderation_latency
233
generation_time
9695
tokens_prompt
81
tokens_completion
353
native_tokens_prompt
94
native_tokens_completion
349
native_tokens_completion_images
(null)
native_tokens_reasoning
33
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.009195
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"9255508c-7039-4d70-93b7-f91013c2eb0c"
is_byok
false
latency
1649
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972809-cJFM4S6D8GtiPrvHOHc7"
upstream_id
"9255508c-7039-4d70-93b7-f91013c2eb0c"
total_cost
0.009195
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)