NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WMC492AVSSK6QKCVHT094
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 41s
Cost:
$0.246635
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
66.12%
Dialogue to Total Word Ratio
(details)
82.7460%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn blueprint across the sticky table at Denny's, knocking over Bill's orange juice in the process. The map, rendered in colored pencil on the back of a placemat from another restaurant, depicted the suburban home of one Kevin Fletcher, age seven, whose birthday party was scheduled for Saturday at two o'clock. Bill dabbed at the juice creeping toward his lap and studied the blueprint with the seriousness of a man who had not, three weeks ago, gotten his belt loop caught on a fence during a botched warehouse job and dangled there until morning. "Walk me through it again," he said, pulling a soggy napkin from beneath the map. Molly tapped a purple circle near the back of the house. "This is the bouncy castle. They're inflating it at one thirty. Every kid at the party will be on that thing like flies on a carcass. That's our window." "And the presents?" "Front living room. Piled on a table next to a life-size cardboard cutout of some dinosaur from a cartoon I don't recognize. We go in through the garage. Kevin's dad always leaves it cracked because he smokes out there and his wife doesn't know." Bill leaned back in the booth and crossed his arms. His leather jacket creaked in a way he clearly thought was intimidating but mostly sounded like a sad balloon. "How do you know so much about this family?" he asked. "I'm in Kevin's mom's book club," Molly said without shame. "You joined a book club to case a seven-year-old's house?" "I joined a book club because I enjoy reading, Bill. The casing was a bonus." The waitress refilled their coffees and glanced at the blueprint. Molly flipped it over, revealing the placemat side, which advertised a kids' meal called the Buckaroo Burger. The waitress moved on, unimpressed. Bill rubbed his temples. The last three jobs Molly had planned went sideways in spectacular fashion. There was the pet store incident. The museum debacle. The thing at the church they had both agreed never to speak of again. But presents from a seven-year-old's birthday party represented a return to fundamentals. Low security. High reward. Probably a lot of gift cards. "What about the clown?" he asked. Molly's expression darkened. "What clown?" "There's always a clown, Molly. You never account for the clown." She hadn't accounted for a clown. This was evident from the way her left eye twitched and her hand moved instinctively toward the pepper spray in her purse. Clowns were an unpredictable variable. They roamed freely, had access to all areas of the party, and worst of all, they were already in disguise, which meant they were essentially working the same job from the other side. "We'll handle the clown if there's a clown," she said firmly. "That's not a plan." "It's a philosophy. Plans come later. Finish your eggs." Bill finished his eggs. He had a bad feeling about Saturday, but then again, he always had a bad feeling, and he was almost always right.
created_at
"2026-02-24T22:34:32.979Z"
2/24/2026, 10:34:32 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1572
moderation_latency
167
generation_time
100630
tokens_prompt
77
tokens_completion
4239
native_tokens_prompt
92
native_tokens_completion
9847
native_tokens_completion_images
(null)
native_tokens_reasoning
3469
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.246635
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"8109297c-5275-4043-a285-78265d8590ba"
is_byok
false
latency
1534
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972472-Pr0wlMJLYx5BZyuCi91R"
upstream_id
"8109297c-5275-4043-a285-78265d8590ba"
total_cost
0.246635
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
66.12%
Dialogue to Total Word Ratio
Ratio: 32.88%, Deviation: 2.88%
neededClean
false
wordsTotal
511
wordsDialogue
168
82.7460%