NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8W7EEV3FP7Z6AVYGEFCK8D
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
18.3s
Cost:
$0.012756
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
55.58%
Dialogue to Total Word Ratio
(details)
73.0315%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crayon-drawn map across the hood of the van and smoothed out the wrinkles with the focused intensity of a general preparing for war. The map had been confiscated from a seven-year-old named Derek, and it smelled faintly of fruit snacks. "Okay," she said, clicking her penlight on. "The party starts at two. We go in at two-fifteen." Bill squinted at the map. "Is that a drawing of a unicorn?" "That's the bouncy castle. Focus." "Right, right." He pulled his black beanie down further over his ears even though it was sixty-eight degrees outside. "What's our entry point?" "Side gate. Apparently Derek leaves it unlocked because he's been sneaking his hamster in and out." "We're relying on the security habits of a child and his hamster." "We are professionals, Bill. We adapt to the intelligence available." Bill stared at the map for a long moment. A small part of him, the part that still had a soul, whispered something uncomfortable. He told that part to be quiet. "How many kids are we looking at?" he asked. "Birthday boy's turning eight, so probably twelve to fifteen guests. Parents will be distracted by cake and the ongoing disaster that is group parenting." "And the presents?" "Piled on the gift table near the back fence." Molly tapped the map triumphantly. "Which Derek has helpfully labeled with a drawing of what I believe is either a treasure chest or a really sad house." Bill pulled a clipboard from the back seat and clicked his pen. "Okay. I'll back the van up to the fence." "Quietly." "Obviously quietly. I'll back it up quietly to the fence, we grab everything in under four minutes, and we're gone before anyone notices." "The clown is our biggest threat," Molly said gravely. "The clown." Bill stopped writing. "He's positioned near the gate according to Derek's intelligence report and he's described as, quote, very fast." "A fast clown." Bill set the clipboard down on the van hood and rubbed his temples. "Molly, this is the third time this month we've been outsmarted by someone's birthday party logistics." "We were not outsmarted last time. We made a strategic retreat." "We hid in a ball pit for forty minutes." Molly rolled up the map with crisp, dignified authority and tucked it under her arm. The afternoon sun caught the rhinestone on her all-black tactical jacket, which somewhat undermined the intimidating effect. She had bought it on clearance and refused to acknowledge this. "The ball pit was a calculated position," she said firmly. "There was a toddler in there with us." "He was very discreet." She opened the van door and climbed in. "Now get in. We've got thirty minutes to review the extraction route and eat our sandwiches." Bill looked up at the pale blue sky and wondered, genuinely and not for the first time, where exactly his life had taken its crucial wrong turn. He got in the van.
created_at
"2026-02-24T22:27:29.378Z"
2/24/2026, 10:27:29 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
850
moderation_latency
142
generation_time
18056
tokens_prompt
77
tokens_completion
866
native_tokens_prompt
92
native_tokens_completion
832
native_tokens_completion_images
(null)
native_tokens_reasoning
104
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.012756
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"0ed51766-1701-4807-85db-b3dbb08990e4"
is_byok
false
latency
833
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972049-wZ6bMN6EtZtBpFTQKNat"
upstream_id
"0ed51766-1701-4807-85db-b3dbb08990e4"
total_cost
0.012756
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
55.58%
Dialogue to Total Word Ratio
Ratio: 53.43%, Deviation: 3.43%
neededClean
false
wordsTotal
496
wordsDialogue
265
73.0315%