NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8X8WW6YYFQP8QK1CAQAP13
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
23.2s
Cost:
$0.021085
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.45%
Dialogue to Total Word Ratio
(details)
48.2174%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the hood of the van, smoothing out a wrinkle that bisected what appeared to be a bouncy castle. Bill leaned over her shoulder, squinting. "Walk me through it again," Bill said. "And this time, try not to make it sound like we're storming Normandy." "Fine. The party starts at two. We arrive at one forty-five disguised as balloon artists. Nobody questions balloon artists, Bill. They're like plumbers but happier." "I can't make balloon animals." "You don't have to make balloon animals. You just have to carry a bag of uninflated balloons and look whimsical." Bill scratched his chin. "I don't think I've ever looked whimsical in my life." "Then this will be a growth experience for you." Molly tapped the map with a pen. "The presents will be stacked here, on a folding table near the back fence. Based on the Facebook event page, we're looking at roughly thirty kids, which means thirty presents minimum. Some overachievers will bring two. We could be talking about forty gifts, Bill. Forty." "What kind of gifts are we expecting here? Because last time you said the take would be impressive and I ended up with six copies of Candy Land and a hamster." "The hamster was a bonus. You loved that hamster." "The hamster bit me every single day for two years." "That's how hamsters show love. Now focus. The birthday boy's father is a dentist, which means this is an upper-middle-class affair. We're talking premium Lego sets. Name-brand action figures. Possibly a bicycle with a bow on it." Bill's eyes widened slightly. "A bicycle?" "With a bow on it. You see why I brought this to you." "Who's running security?" Molly checked her notes. "The mom. Sandra. She's five-foot-two, runs a book club, and based on her online reviews, she has strong opinions about scented candles. I think we can handle Sandra." "What about the dad?" "He'll be manning the grill. No man has ever left a grill during a party, Bill. It's a biological impossibility. He is neutralized by charcoal." "And the kids?" "What about the kids?" "They're witnesses, Molly." "They're six. Who's going to believe them? Half of them still think the moon follows their car." Bill folded his arms and stared at the bouncy castle on the map. "How do we extract the presents?" "I back the van up to the fence at exactly two-thirty during the magic show. Every child will be watching a man pull a rabbit from a hat. You load the gifts. I drive. We're gone before the first kid cries." "What if the magician is bad and the kids lose interest early?" Molly paused. She hadn't considered this. "Then we improvise." "Improvise," Bill repeated flatly. "That's what separates professionals from amateurs." Bill sighed and pulled a deflated balloon from the supply bag, holding it between two fingers like evidence at a crime scene. "I want the bicycle," he said. "Done."
created_at
"2026-02-24T22:45:45.488Z"
2/24/2026, 10:45:45 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1408
moderation_latency
161
generation_time
22981
tokens_prompt
77
tokens_completion
828
native_tokens_prompt
92
native_tokens_completion
825
native_tokens_completion_images
(null)
native_tokens_reasoning
61
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.021085
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"048ac59e-a5f6-4728-b386-eef3b92e8e34"
is_byok
false
latency
1373
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771973145-QIPTPOTMLdcuARNifqh1"
upstream_id
"048ac59e-a5f6-4728-b386-eef3b92e8e34"
total_cost
0.021085
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
0.45%
Dialogue to Total Word Ratio
Ratio: 80.40%, Deviation: 10.40%
neededClean
false
wordsTotal
500
wordsDialogue
402
48.2174%