NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8X809ZJ0MDZ3HV004BT1B9
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
23.1s
Cost:
$0.021560
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
17.5018%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the hood of the van and anchored one corner with a half-eaten granola bar. The map was done in crayon, which she felt added a certain poetic justice to the operation. "Alright, here's the layout," she said, tapping a wobbly blue rectangle. "The Patterson house. Party starts at two. We've got a bounce castle on the east lawn, a balloon artist working the patio, and a pile of presents roughly the size of a Fiat stacked in the living room." Bill leaned over the map, squinting. "Why is the house shaped like a dinosaur?" "I traced it from the kid's birthday invitation. Focus, Bill." "I am focused. I'm just saying, architecturally, that's a bold choice for a split-level ranch." Molly pinched the bridge of her nose. She had pulled off a jewel heist in Monaco with fewer logistical headaches than this conversation. "The presents will be unguarded between two fifteen and two forty-five," she continued. "That's when they do the piƱata. Every adult in that house will be outside watching children swing a bat, which frankly is the only thing worth watching at these events." "What about security?" Bill asked. "There's a golden retriever named Biscuit. He's old. He's slow. I've been feeding him deli meat through the fence for two weeks. He loves me more than he loves that family." Bill crossed his arms. "And the entry point?" "Side door off the kitchen. The lock is a joke. I've seen better security on a diary." "I had a diary with a lock once," Bill said, staring into the middle distance. "My sister picked it in four seconds." "Your sister should be on this crew instead of you." "She's a dentist now. Went straight. Real tragedy." Molly rolled up the map and stuffed it into her jacket. The afternoon sun bounced off the van's dented panels, and somewhere down the street a child screamed with the boundless, unhinged joy that only a sugar high could produce. She shuddered. "Once we're inside, we load everything into the duffel bags. We do not open any presents. We do not shake any presents. Last time you shook a box and set off a singing fish toy and we almost got caught at that retirement home." "That fish haunts me, Molly." "It haunts all of us. Now, getaway route. We go out the side door, cut through the neighbor's yard, and we're back at the van in ninety seconds." Bill nodded slowly, then frowned. "Do you ever feel bad about this? Stealing from kids?" Molly stared at him. "Bill, that pile contains at least fourteen gaming consoles, a drone, and something called a Squishmallow that resells for sixty bucks. These kids live better than we do." "Fair point." "Now put on your clown costume." Bill looked at the rainbow wig and red nose sitting on the passenger seat. He picked them up with the grim resignation of a man who had made every wrong decision available to him. "I want a bigger cut this time," he said. "Steal bigger presents," Molly said, and started the van.
created_at
"2026-02-24T22:45:16.232Z"
2/24/2026, 10:45:16 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1454
moderation_latency
204
generation_time
22849
tokens_prompt
77
tokens_completion
854
native_tokens_prompt
92
native_tokens_completion
844
native_tokens_completion_images
(null)
native_tokens_reasoning
75
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.02156
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"e879836f-3ad6-4451-9cb1-a0794f9f5692"
is_byok
false
latency
1421
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771973116-sCS674uDMxsTkxCGAHi2"
upstream_id
"e879836f-3ad6-4451-9cb1-a0794f9f5692"
total_cost
0.02156
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
518
0.00%
Dialogue to Total Word Ratio
Ratio: 64.94%, Deviation: 14.94%
neededClean
false
wordsTotal
522
wordsDialogue
339
17.5018%