NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-08-01KGYXZM69DQ7554G9A2157BMC
Run Details
Model:
Claude Opus 4.6
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
9.0s
Cost:
$0.008085
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
# The Missing Recipe The morning rush had barely ended when the bell above the door chimed. Flour dust motes danced in the sunlight streaming through the front window of Sweet Crumbs Bakery, and the display case gleamed with fresh croissants and lemon tarts. "I heard you found something strange in Martha's kitchen." "Strange doesn't begin to cover it. Her recipe box was wide open on the counter, and three cards were missing." "Three? Which ones?" "The brown butter shortbread, the lavender scones, and that famous rum cake she always brought to the church social." "Someone broke into Martha Henderson's house to steal *recipes*?" "That's what Sheriff Daniels said. No sign of forced entry, though. Whoever did it had a key — or knew where she kept the spare." "Under the ceramic frog by the back steps. Half the town knows that." "Exactly. But here's the thing that's really eating at me." "What?" "Martha told me last week that someone offered her five thousand dollars for the rum cake recipe. She turned them down flat." The espresso machine hissed into the silence. Outside, a cardinal landed on the windowsill and pecked once at the glass. "Five thousand dollars is a lot of money in this town." "Enough to make someone desperate."
created_at
"2026-02-08T15:30:13.593017+00:00"
2/8/2026, 3:30:13 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1886
moderation_latency
171
generation_time
8735
tokens_prompt
61
tokens_completion
271
native_tokens_prompt
77
native_tokens_completion
308
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.008085
router
(null)
provider_responses
0
id
"9304b35f-3f21-4d02-8e1d-d3c70711ab6a"
status
200
is_byok
false
latency
1885
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770564604-6nFpfsSYlSRl87ufkdvU"
upstream_id
"9304b35f-3f21-4d02-8e1d-d3c70711ab6a"
total_cost
0.008085
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)