NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WTFPF448YB49EZTZN0A13
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 50s
Cost:
$0.233135
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
96.25%
Dialogue to Total Word Ratio
(details)
96.1206%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn blueprint across the hood of the van, weighing down one corner with a half-eaten burrito and the other with a stapler she'd stolen from a dentist's office three weeks prior. The blueprint, rendered in purple crayon on the back of a Denny's placemat, depicted the Henderson backyard in meticulous detail. There was the bounce house on the north perimeter. The face-painting station to the east. And there, in the center of it all, like a dragon's hoard guarded by a dozen sugar-crazed six-year-olds, sat the present table. "We go in at fourteen hundred hours," Molly said, tapping the placemat. Bill squinted at his watch. "That's in twenty minutes. I haven't even had lunch." "Focus. This is the Henderson job. You know what kind of presents these rich kids bring? Name-brand toys. Gift cards. Possibly cash in little envelopes." She pulled a duffel bag from the back seat and unzipped it, revealing two matching clown costumes, a bag of balloon animals that came pre-twisted because neither of them had ever figured out the real ones, and a suspicious amount of glitter. Bill held up one of the clown wigs, a toxic shade of orange that seemed to vibrate against the afternoon light. He regarded it the way a man regards his own coffin. "I'm not wearing this," he said. "You are absolutely wearing it. The theme is clowns, Bill. We have to blend in." "The theme is princesses. I read the invitation." "What invitation?" Bill reached into his back pocket and produced a crumpled card decorated with cartoon tiaras. Molly snatched it from his hands and studied it, her left eye twitching the way it always did when a plan began to unravel. She recovered quickly. She was a professional, after all, or at least she told herself that during the quiet hours of the night when the doubts crept in and she questioned every life choice that had led her to robbing children's birthday parties in the suburbs. "New plan. We go in as entertainment. You juggle, I distract the parents, and we load the presents into the van during the piñata. Nobody pays attention to anything once there's a piñata." "I can't juggle," Bill said. "You had one job this week. One job. Learn to juggle." "I learned to ride a unicycle instead. I thought it'd be more impressive." Molly stared at him for a long moment, the silence broken only by distant children screaming in that unhinged way that could mean either pure joy or total anarchy. She pinched the bridge of her nose. "Do you have the unicycle with you?" "It's in the trunk." She considered this. A unicycling clown at a princess party was hardly ideal, but Molly had pulled off worse jobs with less. There was the pet store heist, executed with nothing but a trench coat and a convincing parrot impression. They'd gotten away with fourteen hamsters and their dignity mostly intact. "All right," she said, folding up the placemat. "We improvise. Like always." Bill smiled. "Like always."
created_at
"2026-02-24T22:37:53.239Z"
2/24/2026, 10:37:53 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1359
moderation_latency
212
generation_time
109676
tokens_prompt
77
tokens_completion
2325
native_tokens_prompt
92
native_tokens_completion
9307
native_tokens_completion_images
(null)
native_tokens_reasoning
1548
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.233135
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"cb70845b-da5a-4451-9b1b-a7b35770457d"
is_byok
false
latency
1337
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972673-6VeoJDrFGjwRn89RfFDc"
upstream_id
"cb70845b-da5a-4451-9b1b-a7b35770457d"
total_cost
0.233135
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
508
96.25%
Dialogue to Total Word Ratio
Ratio: 29.13%, Deviation: 0.87%
neededClean
false
wordsTotal
515
wordsDialogue
150
96.1206%