NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYZA091CFN2ZTJHGY7RHXD7
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled like stale coffee and regret, which Molly had come to associate with every job Bill dragged her into. She peered through binoculars at the suburban house across the street, where a inflatable bounce castle swayed ominously in the afternoon breeze like a primary-colored tumor growing from the manicured lawn. "Run it by me again," she said, not lowering the binoculars. "Because I want to make sure I understand exactly how far we've fallen." Bill smoothed out the hand-drawn map on the dashboard. He'd used crayons, which Molly found either ironically appropriate or deeply concerning. "Okay, so the birthday boy is turning seven. His parents are loaded. We're talking surgeon dad, lawyer mom. These gifts aren't going to be socks and underwear, Molly. We're looking at gaming consoles, tablets, probably some of those robot dogs that cost more than my car." "Everything costs more than your car, Bill." He ignored her. "Entry point is the sliding glass door on the east side. The presents are staged in the dining room, away from the main party action in the backyard. We slip in during the magic show, grab what we can carry, and we're out before anyone notices." Molly finally lowered the binoculars and turned to face her partner. Bill wore his serious heist face, which unfortunately looked identical to his confused-by-a-restaurant-menu face. She'd known him for fifteen years, and somehow he still managed to surprise her with the depth of his terrible ideas. "There are going to be thirty children in that backyard." "Witnesses make terrible witnesses when they're hopped up on sugar and focused on a man pulling scarves from his sleeve." "And their parents?" "Drinking wine and questioning their life choices. Trust me, I've been to these things. The adults check out mentally around hour two. We're hitting at hour three. They'll be practically comatose." Molly rubbed her temples. Six months ago, they'd been planning a job on a diamond exchange. Granted, that had fallen through spectacularly when Bill accidentally triggered the alarm by sneezing, but at least it had dignity. Now she was sitting in a rust-bucket van, watching a woman in yoga pants carry a cake shaped like a dinosaur into a house they were about to burgle. "The getaway route?" she asked, because apparently she was actually considering this. Bill's face lit up like Christmas morning, which was painfully ironic given their target. He traced a crayon line through the neighborhood streets with his finger. "Three blocks to the highway on-ramp. I've timed it. Four minutes if we hit the lights right." "And if we don't?" "Then we'll be stuck in traffic next to a minivan full of the kids whose birthday we just ruined." Molly stared at him for a long moment. The bounce castle continued its hypnotic swaying. Somewhere inside that house, a seven-year-old was about to have the worst birthday of his young life. She reached for her black gloves. "I want sixty percent."
created_at
"2026-02-08T15:53:30.795875+00:00"
Sunday, February 8th, 2026 at 3:53:30 PM GMT+00:00
model
"anthropic/claude-4.5-opus-20251124"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1627
moderation_latency
164
generation_time
17470
tokens_prompt
61
tokens_completion
625
native_tokens_prompt
75
native_tokens_completion
709
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0181
router
(null)
provider_responses
0
id
"d8006b3a-7457-4e6a-a9b7-5f2f2df0fe00"
status
200
is_byok
false
latency
1626
endpoint_id
"de32581c-2e11-4f54-8d29-e8a6e19d3038"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-opus-20251124"
api_type
"completions"
id
"gen-1770565992-RypT6y0GUHqVFLBbMJ3m"
upstream_id
"d8006b3a-7457-4e6a-a9b7-5f2f2df0fe00"
total_cost
0.0181
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.7443%
Matches word count
n/a
neededClean
false
words
504
0.0007%
Dialogue to Total Word Ratio
Ratio: 45.44%, Deviation: 15.44%
neededClean
false
wordsTotal
504
wordsDialogue
229
49.8725%