NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYXCY6SGYY770JK8FKYXNRX
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her binoculars against the window of the stolen ice cream truck, watching children stream into the colonial-style house across the street. Balloons bobbed from every available surface, and a inflatable bouncy castle dominated the front lawn like a neon fortress. "This is it," she said. "The Henderson kid's birthday party." Bill adjusted his fake mustache in the rearview mirror, though Molly had told him three times it made him look like a divorced accordion player. "You sure the intel's good? Last time we broke into a kid's party, it was all homemade cookies and participation ribbons." "Trust me. Henderson's parents are tech millionaires. That gift table's gonna be loaded with iPads, gaming consoles, maybe even a hoverboard." "Hoverboards aren't real hoverboards. They're just expensive scooters that catch fire." "Even better. We can flip them as fire hazards to insurance fraudsters." Bill drummed his fingers on the steering wheel, eyeing the chaos of arriving minivans and harried parents. A clown car pulled up, and an actual clown emerged, armed with balloon animals and what appeared to be deep existential dread. "What's our cover story again?" Molly reached into the back and pulled out two cardboard boxes. "We're the entertainment. Magical Mike and his Mystical Assistant. We do the show, case the present situation, then make our move during the bouncy castle rush." "I hate being the assistant." "You lost the coin toss." "You used your weighted coin." "Prove it," Molly said, pocketing the binoculars. She slapped on a glittery top hat that had seen better decades. "Remember, we're professionals. We get in, we perform adequate magic, we grab the goods during the great bounce house migration, and we get out." They crossed the street carrying their boxes, nodding at the clown who was already looking defeated by a group of sugar-loaded seven-year-olds. The front door was open, and inside, the Henderson house was a shrine to excess. The gift table groaned under wrapped boxes, easily twenty or thirty presents stacked like a department store display. "Holy jackpot," Bill whispered. Mrs. Henderson appeared, wearing the desperate smile of someone who'd had too much coffee and not enough sleep. "Are you the magicians?" "Magical Mike and assistant," Molly said in what she hoped was a mysterious voice. "We're here to astound and amaze." "Great, you're early. Can you wait in the kitchen? We're still doing pizza." They shuffled toward the kitchen, passing the gift table. Bill's eyes were gleaming with the special kind of madness that only valuable electronics could inspire. Through the window, Molly could see the bouncy castle waiting, patient as a synthetic accomplice. "This is going to work," she muttered. "What was that?" "I said, let's hope we don't get caught by a bunch of eight-year-olds." Bill grinned, adjusting his terrible mustache one more time. "Where's your sense of adventure?"
created_at
"2026-02-08T15:20:07.967319+00:00"
Sunday, February 8th, 2026 at 3:20:07 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1757
moderation_latency
130
generation_time
15412
tokens_prompt
61
tokens_completion
626
native_tokens_prompt
75
native_tokens_completion
718
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.010995
router
(null)
provider_responses
0
id
"7effc825-ae41-47d2-ac5a-ae10fc01d6eb"
status
200
is_byok
false
latency
1757
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770563991-8Kj40aN6Mpzq1WF7r83c"
upstream_id
"7effc825-ae41-47d2-ac5a-ae10fc01d6eb"
total_cost
0.010995
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
9.6081%
Matches word count
n/a
neededClean
false
words
478
0.0545%
Dialogue to Total Word Ratio
Ratio: 42.26%, Deviation: 12.26%
neededClean
false
wordsTotal
478
wordsDialogue
202
4.8313%