NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8X73ZFNNCJMCJ03PK90CRP
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
17.2s
Cost:
$0.012276
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
78.76%
Dialogue to Total Word Ratio
(details)
39.3784%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the hood of the van and stabbed her finger at a wobbly rectangle she had labeled THE TARGET. "Okay, talk me through it," Bill said, squinting at the paper. "Right. So the party starts at two. We have approximately ninety minutes before the cake comes out, which is when every adult in that house will be distracted singing and taking photos." "What kind of cake?" "Bill." "I'm just asking. For the timeline." "It doesn't matter what kind of cake it is." "It matters to me personally." Molly closed her eyes for a moment. "Presumably a children's birthday cake. Can we continue?" "Sure, sure. Go ahead." "The presents are being staged in the conservatory at the back of the house. I counted fourteen boxes through the window this morning." "You were already scoping the place and didn't invite me?" "You were asleep." "I'm always asleep at seven in the morning, Molly. That's what seven in the morning is for." She ignored him and tapped the map again. "The side gate has a latch but no padlock. We go in during the cake moment, grab everything in the conservatory, and we're back in the van in four minutes." Bill picked up the map and examined it with great seriousness. "You've drawn the bouncy castle." "For reference." "You've given it a little face." "It helps me visualize the layout." "It's smiling." "Bill, I need you to focus." He set the map down. "What about the clown?" Molly went very still. "What clown?" "There's always a clown at these things. I'm not going anywhere near a clown, Molly. That's in my contract." "You don't have a contract." "It's a verbal contract. We agreed after the incident." "There was no incident." "He made a balloon animal that looked exactly like me. It was unsettling." Molly rolled up the map and tucked it under her arm. She breathed slowly through her nose the way her therapist had shown her. "If there is a clown," she said carefully, "you will walk past the clown, you will not make eye contact with the clown, and you will load the presents into the van." "And if he starts making animals again?" "Then I will personally deflate him." Bill nodded, seemingly satisfied. He zipped up his jacket and looked toward the quiet suburban street ahead of them. "So we're really doing this," he said. "Robbing a kid's birthday party." "We are absolutely robbing a kid's birthday party." "That's pretty low, Molly." "The presents are going to a resale charity. We talked about this." "Right, right." He paused. "Which charity?" "Us. We are the charity." Bill considered this. "Fair enough," he said, and climbed into the van.
created_at
"2026-02-24T22:44:47.223Z"
2/24/2026, 10:44:47 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
729
moderation_latency
129
generation_time
17005
tokens_prompt
77
tokens_completion
806
native_tokens_prompt
92
native_tokens_completion
800
native_tokens_completion_images
(null)
native_tokens_reasoning
87
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.012276
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"6bce58a9-3a4a-4946-bf32-bebe4a79794b"
is_byok
false
latency
729
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771973087-F7PYPssS0AGRS9sOA7d2"
upstream_id
"6bce58a9-3a4a-4946-bf32-bebe4a79794b"
total_cost
0.012276
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
452
78.76%
Dialogue to Total Word Ratio
Ratio: 72.19%, Deviation: 2.19%
neededClean
false
wordsTotal
453
wordsDialogue
327
39.3784%