NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8W59YJZW4N0P7RHVQ99FWE
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 22s
Cost:
$0.292535
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
75.63%
Dialogue to Total Word Ratio
(details)
37.8142%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the hood of their dented minivan with the reverence most people reserved for ancient treasure maps. It had been sketched in crayon on the back of a Denny's placemat, but it depicted the Peterson household in alarming detail, right down to the inflatable bounce house that dominated the backyard like a castle made of nylon and poor decisions. Bill squinted at it sideways. He had the look of a man who had made a long series of bad choices and was gearing up to make another one. He'd already pulled on his balaclava despite the fact that it was two in the afternoon and they were parked in a gas station lot in broad daylight. "Take that off. We don't go in for another three hours." Molly snatched a cheese puff from the bag wedged between them and used it to tap the crayon outline of the backyard. She walked him through the timeline with the calm precision of a field general. The party began at four. Gift drop-off would happen in a frantic fifteen-minute window as parents deposited their children and expensive presents on the front hall table, then fled. Nobody lingered at a seven-year-old's birthday party voluntarily. That, she explained with visible satisfaction, was the opening. Bill peeled off the balaclava. It snagged on his ears, and his hair sprang up in wild tufts of static. He studied the placemat, tracing a thick finger along the crayon lines toward the garage. "We go in through the back gate?" "Exactly. The bounce house gives us a blind spot along the entire east wall. Kids too sugared up to notice a thing, one dead-eyed parent on watch duty staring into the abyss. We slip through the sliding glass door, grab the presents, and load the van." She made it sound elegant, which was precisely how Bill knew it would all go sideways. Their last job had involved stealing a wedding cake from a country club reception and had ended with Bill waist-deep in a koi pond while Molly drove away without him. Trust, in their partnership, had always been more of a suggestion than a binding principle. "What about the dog, though? The Petersons definitely have a dog." He said this with the gravity of a man identifying a final and fatal flaw. "It's a golden retriever, Bill. That breed hasn't been suspicious of anything in its entire evolutionary history." Bill accepted this with a slow nod. He fished out another cheese puff and chewed it thoughtfully, watching the quiet suburban street ahead of them like a soldier surveying foreign and hostile terrain. "Are the presents even going to be worth all this trouble?" "This is Highland Park. These people spend more on a child's birthday gift than we spend on rent. Last year someone gave a six-year-old a tablet worth more than your car." "Everything's worth more than my car," Bill muttered. Molly folded the placemat with surgical precision and slid it into her jacket. The afternoon sun caught the minivan's chipped paint and, for one brief and deeply misleading moment, made them both look almost heroic. "Three hours. Eat your cheese puffs and stay calm." Bill pulled the balaclava back on out of pure spite.
created_at
"2026-02-24T22:26:19.227Z"
2/24/2026, 10:26:19 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1480
moderation_latency
169
generation_time
141714
tokens_prompt
77
tokens_completion
2137
native_tokens_prompt
92
native_tokens_completion
11683
native_tokens_completion_images
(null)
native_tokens_reasoning
1327
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.292535
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"5d9d28e5-bc94-42b2-b5e3-80226465e138"
is_byok
false
latency
1453
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771971979-yJNRswPZlhoi6621nYjo"
upstream_id
"5d9d28e5-bc94-42b2-b5e3-80226465e138"
total_cost
0.292535
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
542
75.63%
Dialogue to Total Word Ratio
Ratio: 27.64%, Deviation: 2.36%
neededClean
false
wordsTotal
550
wordsDialogue
152
37.8142%