NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYXW9GGR9M79QWQDDV154P2
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the hedge, binoculars pressed against her face, watching the chaos of screaming six-year-olds in the backyard. She'd seen maximum security prisons with less surveillance, though those didn't involve parents wielding smartphones like weapons. "This is insane," Bill whispered beside her, adjusting his fake mustache even though they were still fifty feet away. "It's brilliant," Molly corrected. "Look at that present table. I count at least twenty boxes." "We're robbing children. Children, Molly." She lowered the binoculars and shot him a look that could curdle milk. "We're robbing terrible parents who spend two hundred dollars on gifts for a kid who still eats paste. There's a difference." Bill peered through the fence slats, his expression somewhere between guilt and indigestion. The present table sat near the back door, unfortunately positioned between a bouncy castle and a clown making balloon animals. The clown had dead eyes that suggested he'd seen some things. Bill related to that clown. "The shift change is in ten minutes," Molly said, checking her watch. "When they bring out the cake, everyone focuses on the birthday boy. That's our window." "What shift change? It's a birthday party, not a guard rotation." "Parent attention spans. Keep up, Bill." She pulled out a rolled blueprint, which was actually just a napkin with crayon drawings. "You go in from the east, I'll create a distraction by the bouncy castle." "What kind of distraction?" "The unexpected kind. Just be ready." Bill sighed deeply, the sound of a man reconsidering every life choice that led to this hedge. Ten years ago, he was boosting cars and cracking safes. Now he was preparing to infiltrate a Paw Patrol themed birthday party. The next eight minutes crawled by like a DMV line. Then, right on schedule, two parents emerged carrying a cake the size of a small car, ablaze with candles. The crowd surged toward the picnic table. "Now!" Molly hissed. She darted toward the bouncy castle while Bill crept along the fence line. He watched as she did something near the castle's air pump, and suddenly the inflatable structure began deflating with a sad wheeze. Kids tumbled out like confused refugees. Bill reached the present table and started grabbing boxes, shoving them into his expandable duffel bag. His hands moved with practiced efficiency, even as his conscience screamed. A particularly heavy box nearly slipped from his grasp. "HEY!" someone shouted. Bill's heart stopped. He turned to see a toddler pointing at him, juice box in hand. "You're not the present man," the kid said. "Actually, I'm the present fairy," Bill improvised desperately. "Taking these to fairy headquarters for quality testing." The kid considered this. "Okay," he said, and wandered off. Bill stuffed the last box and ran.
created_at
"2026-02-08T15:28:30.46082+00:00"
Sunday, February 8th, 2026 at 3:28:30 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2304
moderation_latency
171
generation_time
15096
tokens_prompt
61
tokens_completion
598
native_tokens_prompt
75
native_tokens_completion
686
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.010515
router
(null)
provider_responses
0
id
"3d457ff5-0742-49c4-985d-c46141dfa5c2"
status
200
is_byok
false
latency
2303
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770564494-sgQudhQDt4z3vQcl9qPZ"
upstream_id
"3d457ff5-0742-49c4-985d-c46141dfa5c2"
total_cost
0.010515
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
458
73.3611%
Dialogue to Total Word Ratio
Ratio: 27.51%, Deviation: 2.49%
neededClean
false
wordsTotal
458
wordsDialogue
126
36.6805%