NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8X7HWFVB351BJ7SJ1VMYNX
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
18.6s
Cost:
$0.013026
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.14%
Dialogue to Total Word Ratio
(details)
21.7578%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the hood of the van and jabbed her finger at a wobbly rectangle she had labeled THE TARGET. "Okay, here's what we're working with. Fourteen kids, two clowns, one bouncy castle, and a chocolate fountain. The presents are in the living room, northeast corner, unguarded." Bill squinted at the map. "Why is there a drawing of a duck?" "That's the dog." "Your dog drawing looks like a duck." "Bill. Focus." "Right, right." He crossed his arms. "So what's our entry point?" "Back garden gate. I've confirmed the latch is broken. We slip in during the pass-the-parcel round. Every single person in that house will be completely fixated on the music stopping." "Genius," Bill said. "What are we wearing?" "I have two clown costumes." Bill stared at her. "Why do we need clown costumes if we're sneaking in?" "Because there are already two clowns there. We blend in." "Molly, there aren't normally four clowns at a birthday party." "There are if two of them are criminals." Bill opened his mouth, then slowly closed it again. "I actually cannot argue with that." "I know. Now, the presents are mostly in bags, which is ideal. We take the backpacks, load up, and we're out in under three minutes. The whole operation runs like clockwork." "What about the chocolate fountain?" Molly turned to look at him very slowly. "What about it?" "I'm just saying, if we're already there." "We are not taking the chocolate fountain, Bill." "One quick dip of something? A biscuit? We bring our own biscuit?" "We are professional thieves." "Professional thieves who keep passing up a free chocolate fountain." Molly rolled up the map with considerable aggression. "I need you mentally present for this. What is your job the moment we get over that fence?" Bill straightened up. "Distraction detail. If any child spots us, I do a balloon animal." "And you practiced?" "I make an excellent sausage." "That's just a balloon that hasn't been twisted." "Kids love it. I tell them it's a snake." Molly pressed two fingers to her temple. "Fine. After distraction detail, you grab the bags on the left side of the pile. I take the right. We do not speak to any adults, we do not eat any cake, and we absolutely do not, under any circumstances, go near the bouncy castle." "Why would I go near the bouncy castle?" "Bill." "Last time was a different situation entirely." "We were robbing a warehouse." "A warehouse with a bouncy castle in it, which is unusual and therefore morally confusing." Molly climbed back into the van. "Just get in. We park two streets away, we walk in calmly, and by the time little Timmy blows out his candles, we are gone." Bill climbed in after her and clipped his seatbelt. "Do you think they'll have party bags?" "Get out of my van." "I'm already buckled."
created_at
"2026-02-24T22:45:01.466Z"
2/24/2026, 10:45:01 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
938
moderation_latency
210
generation_time
18256
tokens_prompt
77
tokens_completion
850
native_tokens_prompt
92
native_tokens_completion
850
native_tokens_completion_images
(null)
native_tokens_reasoning
85
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.013026
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"ff19e9fa-b6ba-4527-b571-4faa9ca09291"
is_byok
false
latency
937
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771973101-vpAQt8Y5JbW5pMSowFrI"
upstream_id
"ff19e9fa-b6ba-4527-b571-4faa9ca09291"
total_cost
0.013026
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
0.14%
Dialogue to Total Word Ratio
Ratio: 81.48%, Deviation: 11.48%
neededClean
false
wordsTotal
486
wordsDialogue
396
21.7578%