NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYWYGYQTCFMSD1CNFSN0MGA
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn blueprint across the sticky table at Denny's, pushing aside a plate of cold mozzarella sticks to make room. The schematic, rendered in what appeared to be purple crayon, depicted the interior of a suburban house with alarming tactical precision. Little arrows indicated entry points, and someone had drawn stars around what was labeled THE GIFT TABLE in blocky capitals. "We hit at fourteen hundred hours," Molly said, tapping the blueprint with a french fry. "That's when the magician starts his set in the backyard. Every kid, every parent, every sticky little eyewitness will be watching a man pull scarves out of his sleeves for twenty minutes." Bill leaned forward and squinted at the drawing. He had the physique of a retired linebacker who had discovered pasta and never looked back, and the booth creaked under his shifting weight. A bead of sweat formed on his forehead, though whether from nerves or the proximity of the mozzarella sticks was unclear. "What kind of presents are we talking here?" he asked. "The kid's father is a tech executive. We're talking high-end gifts. Premium LEGOs. Possibly a drone. One source confirmed a PlayStation." "Source?" "The birthday boy's Instagram. He has three thousand followers, Bill. The kid is eight." Bill absorbed this information with the quiet horror of a man who had never received more than a gift card to Applebee's for any occasion in his life. He studied the blueprint again and noticed a section near the back door marked with crosshatched shading and the words DANGER ZONE. "What's this area here?" "Bounce house," Molly said gravely. "It's positioned right next to the sliding glass door, which is our primary exit. We'll need to move fast because those things are unpredictable. Kids go in sober and come out feral. I saw a seven-year-old knock a grown man unconscious at a party in Scottsdale." "This feels wrong," Bill said, though his tone carried the resignation of a man who already knew he was going to do it anyway. Molly folded her arms and fixed him with the stare she usually reserved for parking enforcement officers and people who talked during movies. She had planned this operation for three weeks, cross-referencing the party invitation they had intercepted from a neighbor's mailbox with satellite imagery of the property and Yelp reviews of the magician, who averaged two and a half stars and was described by multiple parents as both disappointing and weirdly aggressive. "We'll disguise ourselves as balloon artists. I already bought the helium tank. You know how to make a poodle?" "I can make a snake." "Everyone can make a snake, Bill. That's just a balloon." He conceded the point with a shrug. Outside the Denny's window, suburbia sprawled in its beige indifference, utterly unaware that its most elaborate heist was being orchestrated between a Grand Slam breakfast and a refillable coffee. Molly rolled up the blueprint and tucked it into her fanny pack with the ceremonial precision of a general archiving battle plans. The mozzarella sticks had gone completely cold. Neither of them cared.
created_at
"2026-02-08T15:12:20.028825+00:00"
Sunday, February 8th, 2026 at 3:12:20 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1309
moderation_latency
178
generation_time
20089
tokens_prompt
61
tokens_completion
629
native_tokens_prompt
75
native_tokens_completion
737
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0188
router
(null)
provider_responses
0
id
"264235fa-e924-46ac-b03e-3635517a671a"
status
200
is_byok
false
latency
1309
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770563519-C3fwU5YSQD74DLfjpjCk"
upstream_id
"264235fa-e924-46ac-b03e-3635517a671a"
total_cost
0.0188
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
27.1658%
Matches word count
n/a
neededClean
false
words
519
53.7064%
Dialogue to Total Word Ratio
Ratio: 33.53%, Deviation: 3.53%
neededClean
false
wordsTotal
519
wordsDialogue
174
40.4361%