NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ955Q945QNE48DPDNRTA7R
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
14.9s
Cost:
$0.010860
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
19.54%
Dialogue to Total Word Ratio
(details)
55.0124%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled like cigarettes and ambition, neither of which were particularly useful at nine in the morning outside a suburban house decorated with pink balloons. Molly pressed her binoculars against her face and watched a parade of seven-year-olds waddle up the driveway clutching wrapped boxes almost as big as themselves. "The target is well guarded," she said. "Those are children, Molly." "Children with gifts, Bill. Focus." Bill adjusted his party hat, which he had insisted on wearing for authenticity despite the fact that they were currently sitting in a surveillance van fifty feet from the house. He studied the floor plan Molly had sketched on a paper bag from a fast food restaurant. "Walk me through the entry point again," he said. "We go in through the side gate. There's a bouncy castle in the back garden providing excellent cover and also, frankly, an opportunity I personally intend to take advantage of." "We are not going on the bouncy castle." "We are absolutely going on the bouncy castle, Bill. That's not negotiable." He pinched the bridge of his nose. Twelve years of heists, six of which had been with Molly, and still every single job found a way to go sideways before it even started. He looked back at the house. A clown was now visible through the front window, juggling something that appeared to be on fire. "Why is there a clown?" "Birthday parties have clowns sometimes, Bill, keep up." "That changes the whole operation. Clowns are unpredictable. You can't account for a clown." "You can't account for a clown," Molly said, pointing a finger at him. "I have a whole section in my notes specifically about clown mitigation." He stared at her. "Who plans for clown mitigation?" "Professionals." She passed him a sheet of paper that did, genuinely, have the words CLOWN CONTINGENCY written at the top in red marker and underlined twice. He read it in silence for a moment. It said to offer the clown a sandwich. "We don't have a sandwich," he said. "We have half a granola bar." "A clown is not going to accept half a granola bar as a bribe, Molly." "Then we improvise." She clicked the cap onto her marker pen with the finality of someone closing a boardroom presentation. "The presents get stacked by the back door according to my intel, which came from a very reliable source." "The source was a child at the bus stop." "A very observant child. Eight years old and already excellent situational awareness. I was impressed." Bill looked out at the balloons swaying gently in the morning breeze, at the cheerful hand-painted banner reading HAPPY BIRTHDAY JESSICA, at the general atmosphere of innocent joy they were about to professionally disrupt. "For the record," he said, "I want it noted that I had serious reservations." "Noted." Molly opened the van door. "Now put your party hat on straight. We look suspicious."
created_at
"2026-02-18T02:27:21.971096+00:00"
2/18/2026, 2:27:21 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
559
moderation_latency
104
generation_time
14772
tokens_prompt
77
tokens_completion
759
native_tokens_prompt
75
native_tokens_completion
709
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01086
router
(null)
provider_responses
0
id
"daacf091-7bef-420b-a726-07cfcf19e24b"
status
200
is_byok
false
latency
557
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381626-q8kHp5ZXfRQ9nfAsUIpN"
upstream_id
"daacf091-7bef-420b-a726-07cfcf19e24b"
total_cost
0.01086
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
19.54%
Dialogue to Total Word Ratio
Ratio: 44.29%, Deviation: 5.71%
neededClean
false
wordsTotal
490
wordsDialogue
217
55.0124%