NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGZ08CCX2Y18W4BMM911FWRA
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her back against the fence and peered through a knothole at the chaos unfolding in the backyard. Streamers dangled from trees, balloons bobbed in clusters, and approximately seventeen sugar-fueled seven-year-olds screamed with the intensity of air raid sirens. "This is your brilliant score?" Bill whispered, crouching beside her. "A kid's birthday party?" "Not just any party. Little Sophia Carmichael's party. Her parents are the Carmichaels." Molly raised her eyebrows meaningfully. "Am I supposed to know who that is?" "Tech billionaires. They just launched that new cryptocurrency thing. These presents aren't Lego sets and stuffed animals, Bill. Think solid gold toy cars, diamond-encrusted dollhouses, probably a pony wearing a Rolex." Bill squinted through the fence. "I see a piƱata shaped like a unicorn." "Obviously a decoy." "It's leaking Tootsie Rolls." Molly waved dismissively. "The good stuff's on that present table by the gazebo. See that enormous box wrapped in gold paper?" "That could literally be anything." "It's opportunity, Bill. Opportunity wrapped in fourteen-carat dreams." She checked her watch. "The clown arrives in eight minutes for his forty-five-minute set. That's our window." "Wait, what clown?" "Zippy the Magnificent. I checked their event schedule on the family calendar app. Amateurs left it public." Bill rubbed his temples. "So your plan is to waltz into a backyard full of witnesses during a clown performance?" "Everyone will be watching Zippy. We'll blend in, grab the high-value targets, exit through the side gate." "Molly, we're thirty-four and twenty-nine years old respectively. We won't exactly blend with the elementary school crowd." "That's why I brought disguises." Molly unzipped her backpack, pulling out two party hats and stick-on mustaches. Bill stared. "You can't be serious." "Would you prefer the balloon animal costume?" "There's a balloon animal costume?" "I came prepared for multiple scenarios." Molly affixed her mustache and party hat. "How do I look?" "Like a thirty-four-year-old woman wearing a party hat and a mustache." "Perfect. Exactly like Sophia's weird Aunt Bernard." "That's not a thing." "It is now." Molly straightened her shoulders. "Look, last week you said you wanted to get back to basics. Small jobs, low risk, quick turnaround." "I was thinking unlocked cars, not grand larceny at a seven-year-old's celebration." The sound of accordion music suddenly blasted from the backyard. Children shrieked with delight. "That's Zippy's entrance music. We're on." Molly adjusted her mustache one final time. Bill sighed deeply, accepting his party hat. "If we get arrested, I'm claiming you kidnapped me." "Deal. Now remember, if anyone asks, you're Uncle Kevin from Minneapolis." "Why Minneapolis?" "Too far to verify." Molly grinned. "Trust me, Bill. By this time tomorrow, we'll be drinking champagne and fencing luxury toys to specialized collectors." "Or eating prison cafeteria meatloaf." "That's the spirit. Now let's go steal some presents."
created_at
"2026-02-08T16:10:06.527214+00:00"
Sunday, February 8th, 2026 at 4:10:06 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1657
moderation_latency
198
generation_time
17637
tokens_prompt
61
tokens_completion
676
native_tokens_prompt
75
native_tokens_completion
809
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01236
router
(null)
provider_responses
0
id
"d9157cfb-4776-4772-8854-1842403944be"
status
200
is_byok
false
latency
1657
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770566988-dkpJRmzqF3DT2lFDFS2I"
upstream_id
"d9157cfb-4776-4772-8854-1842403944be"
total_cost
0.01236
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
6.0907%
Matches word count
n/a
neededClean
false
words
477
0.0000%
Dialogue to Total Word Ratio
Ratio: 73.79%, Deviation: 23.79%
neededClean
false
wordsTotal
477
wordsDialogue
352
3.0453%