NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.5
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYZVRY1GR854CQ74K7V5JSQ
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crayon-drawn map across the sticky surface of the McDonald's table and jabbed her finger at a wobbly rectangle labeled PREZENTS in purple marker. "This is where they're keeping the goods," she said with the gravity of someone planning a diamond heist. "Little Timmy's birthday party. Saturday. Two p.m. We're looking at approximately fifteen to twenty wrapped packages, maybe more if the grandparents show up." Bill leaned forward, squinting at the map. He pushed his reading glasses up his nose and frowned deeply. "Where did you even get this intelligence?" "I bribed a kid at the playground with fruit snacks. He's in Timmy's class. Solid source." Bill nodded slowly, impressed despite himself. A nearby toddler threw a chicken nugget at his head, and he ignored it with the practiced patience of a professional. "What's our entry point?" "The bouncy castle," Molly said, tapping another misshapen blob on the map. "It's positioned right next to the back fence. We go over, use the castle as cover, and slip into the garage where they're staging the gifts before the big reveal." "A bouncy castle," Bill repeated flatly. "You want me to infiltrate via inflatable structure. I'm fifty-three years old, Molly." "You wanted back in the game. This is the game now." Bill sighed and rubbed his temples. Twenty years ago, they'd lifted a Picasso from a private collector in Monaco. Now they were targeting a suburban garage in New Jersey for what would probably amount to Legos and gift cards to Target. "What about security?" "One golden retriever named Biscuit. Extremely friendly. I've already established rapport during my reconnaissance walks. He'll be no problem." "And the parents?" Molly grinned, sliding a crumpled flyer across the table. It advertised a magician named The Amazing Roberto. "Entertainment starts at two-thirty. Every adult eye will be on Roberto pulling rabbits out of hats while we're loading the van." Bill studied the flyer, then looked back at his partner of three decades. She was wearing a disguise consisting of a sun hat and novelty sunglasses shaped like stars. They were sitting in a fast food restaurant planning the lowest-stakes robbery of their careers. "Why are we doing this again?" "Because Timmy's father is Ronald Beckwith." Bill's expression transformed instantly. "The Ronald Beckwith? The one who ratted out Jimmy the Fingers?" "The very same. This isn't about the presents, Bill. This is about justice. This is about sending a message." Bill straightened in his plastic seat, newfound purpose burning in his eyes. The chicken nugget fell from his shoulder unnoticed. "What's our exit strategy?" "Ice cream truck. I've rented one for the afternoon. We drive away playing Pop Goes the Weasel, and no one suspects a thing." Molly extended her hand across the table. Bill grasped it firmly, sealing their partnership anew over cold french fries and a children's placemat. "Saturday it is."
created_at
"2026-02-08T16:03:13.110589+00:00"
Sunday, February 8th, 2026 at 4:03:13 PM GMT+00:00
model
"anthropic/claude-4.5-opus-20251124"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1782
moderation_latency
150
generation_time
17410
tokens_prompt
61
tokens_completion
629
native_tokens_prompt
75
native_tokens_completion
725
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0185
router
(null)
provider_responses
0
id
"721cb0fd-4705-4684-8399-a246f0f206cc"
status
200
is_byok
false
latency
1782
endpoint_id
"de32581c-2e11-4f54-8d29-e8a6e19d3038"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-opus-20251124"
api_type
"completions"
id
"gen-1770566575-3y65dX9vFIPXj0aK5rqr"
upstream_id
"721cb0fd-4705-4684-8399-a246f0f206cc"
total_cost
0.0185
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
35.0022%
Matches word count
n/a
neededClean
false
words
482
96.6151%
Dialogue to Total Word Ratio
Ratio: 49.17%, Deviation: 0.83%
neededClean
false
wordsTotal
482
wordsDialogue
237
65.8087%