NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WE3GH6ATR68VW7JF5ZQEC
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
8.3s
Cost:
$0.005427
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The smell of cinnamon and warm bread drifted through Honey & Crumb as the morning light stretched across the checkered floor. The glass case gleamed with fresh pastries, and the old copper bell above the door had been quiet for a blessed ten minutes. "You heard about Gerald Finch, I suppose." "Hard not to. Half the town was whispering about it before I'd even unlocked the door this morning." "They're saying his shop was cleaned out. Every last piece of inventory, gone." "Cleaned out or just... moved?" A pause. The sound of tongs lifting a croissant onto wax paper. "What do you mean by that?" "Well, Gerald owed money to at least three people I can name without thinking too hard. Could be he moved it himself before anyone came collecting." "Or someone got there first." "Exactly what I said to my sister." The bell above the door chimed, and both women straightened instinctively, arranging their expressions into something more neighborly. "Morning, Deputy Reeves. The usual?" "Please. And make it two — Sheriff's in a mood today." He lowered his voice. "Don't suppose either of you knew Gerald Finch well?" The silence that followed was just a second too long.
created_at
"2026-02-24T22:31:07.545Z"
2/24/2026, 10:31:07 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
805
moderation_latency
146
generation_time
8087
tokens_prompt
81
tokens_completion
349
native_tokens_prompt
94
native_tokens_completion
343
native_tokens_completion_images
(null)
native_tokens_reasoning
33
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.005427
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"6cdd2d9c-76cd-48ba-a9c9-2fab623da57a"
is_byok
false
latency
805
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972267-LLtEy1wmRSd9UStZU9CD"
upstream_id
"6cdd2d9c-76cd-48ba-a9c9-2fab623da57a"
total_cost
0.005427
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)