NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WR00Y0XTMQARF0AYT8VM3
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
18.7s
Cost:
$0.021285
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
71.70%
Dialogue to Total Word Ratio
(details)
85.8508%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the hood of the minivan, smoothing out the crayon marks with her thumb. Bill leaned over her shoulder, squinting through his ski mask, which he had already put on despite the fact that the party was not until Saturday. "Take that off," Molly said. "It's Tuesday. You look like you're about to rob a gas station." "I'm getting into character," Bill said. He pulled the mask up so it sat on his head like a beanie. "Now walk me through the layout again." Molly tapped the map with a pen. The blueprint had been drawn by her seven-year-old nephew, which meant the kitchen was roughly the size of a football field and the bathroom appeared to be located inside of a tree. Still, it was the best intelligence they had. "The presents will be stacked on a table in the living room, here," she said, circling a lumpy rectangle near something labeled COUCH OR DOG. "The party starts at two. By two fifteen every child in there will be completely deranged on sugar. That's our window." Bill folded his arms and nodded slowly, the way he always did when he was pretending to understand something. "And the parents?" "Exhausted. Defeated. Emotionally absent," Molly said. "They won't notice a thing. We go in through the back door, grab the presents, and load them into the van. Ninety seconds, tops." "What about the birthday boy?" Bill asked. "Kids are perceptive, Molly. They notice stuff. Last week my neighbor's kid told me I had the posture of a shrimp." "He's turning six. He'll be face-deep in cake," Molly said. "Besides, I've arranged a distraction. I hired a clown." Bill's face went pale. "A clown?" "Don't start," Molly said. "You know how I feel about clowns," Bill said, his voice rising to a pitch usually reserved for tea kettles. "You promised after the Cincinnati job that there would be no more clowns." Molly sighed and pinched the bridge of her nose. The Cincinnati job had been a disaster for reasons entirely unrelated to the clown, but Bill had fixated on it like a dog with a grudge. "The clown will be in the front yard," she said. "You will be in the back of the house. At no point will you and the clown occupy the same space. I have drawn a diagram." She produced a second piece of paper from her jacket. Bill studied the diagram. It was considerably more professional than the map. "Fine," he said. "But if that clown so much as honks a horn in my direction, I'm out." "Noted," Molly said. She rolled up both documents and tucked them under her arm. "Now, we need to talk about the getaway vehicle. I'm thinking we ditch the minivan and use something less conspicuous." Bill looked at the minivan. It had a bumper sticker that read MY OTHER CAR IS ALSO STOLEN. "What did you have in mind?" he asked. Molly grinned. "Ice cream truck."
created_at
"2026-02-24T22:36:31.656Z"
2/24/2026, 10:36:31 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1382
moderation_latency
171
generation_time
18525
tokens_prompt
77
tokens_completion
822
native_tokens_prompt
92
native_tokens_completion
833
native_tokens_completion_images
(null)
native_tokens_reasoning
90
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.021285
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"f39ced28-708b-43d2-a54a-3f56cbc1b8bc"
is_byok
false
latency
1349
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972591-SzGZ2Y9am28j8JJuD4Tk"
upstream_id
"f39ced28-708b-43d2-a54a-3f56cbc1b8bc"
total_cost
0.021285
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
71.70%
Dialogue to Total Word Ratio
Ratio: 47.42%, Deviation: 2.58%
neededClean
false
wordsTotal
504
wordsDialogue
239
85.8508%