NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYXCJ1N3K8WWZV1ZWS7W873
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her face against the chain-link fence, watching a dozen screaming children chase each other around the backyard. A mountain of wrapped presents sat unguarded on a picnic table near the bouncy castle. "That's got to be at least two grand in toys," Bill whispered, squatting beside her. "Maybe more if the parents are loaded." "Look at that bouncy castle," Molly said. "These people have money." "So what's the plan?" Molly pulled out a crumpled piece of paper from her jacket pocket. "I've been watching the house for three days. The party started at two. Cake happens at four. That's when everyone goes inside." "And we just walk in and grab everything?" "Exactly." Molly folded the paper back up. "We'll have maybe ten minutes before someone notices." Bill scratched his stubble. "What if a kid comes back out?" "Then you distract them while I load the van." "How am I supposed to distract a kid?" Molly shrugged. "I don't know. Make balloon animals or something." "I don't know how to make balloon animals," Bill said. "Then learn in the next hour." Molly checked her watch. "We've got time." A woman emerged from the house carrying an elaborate princess cake with sparklers shooting from the top. The children swarmed toward her like moths to a flame. "That's our cue in about thirty minutes," Molly said. She stood up and brushed grass off her knees. "Let's get the van ready." "This feels wrong," Bill muttered as they walked back to their vehicle parked down the street. "Wrong? We're thieves, Bill. Nothing we do is right." "Yeah, but stealing from kids? That's low even for us." Molly opened the van's back doors. "We're not stealing from kids. We're stealing from their rich parents who bought overpriced toys to compensate for their emotional unavailability." "That's quite a rationalization," Bill said. "I'm good at those." Molly climbed into the driver's seat. "Besides, those kids will forget about this party in a week. We need this score." "We need it because you lost our last haul playing poker." "That was an investment that didn't pan out," Molly said, starting the engine. "You bet everything on a pair of threes." "I had a feeling about that hand." Molly drove the van closer to the target house. "Are you in or out?" Bill sighed heavily. He looked at the birthday party, then at Molly, then down at his worn-out shoes with holes in both soles. "I need new shoes," he finally said. "That's the spirit." Molly parked across the street and killed the engine. "Now remember, quick and quiet. In and out like professionals." "Professional children's party thieves," Bill said. "My mother would be so proud." "Your mother is in prison for tax fraud." "Still prouder than she'd be about this."
created_at
"2026-02-08T15:19:54.818054+00:00"
Sunday, February 8th, 2026 at 3:19:54 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1543
moderation_latency
153
generation_time
14991
tokens_prompt
61
tokens_completion
618
native_tokens_prompt
75
native_tokens_completion
711
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01089
router
(null)
provider_responses
0
id
"13597609-3676-4f7e-a0b1-c7ee6f37ec4f"
status
200
is_byok
false
latency
1543
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770563979-4PeyeHKwAxAiXZh80yOU"
upstream_id
"13597609-3676-4f7e-a0b1-c7ee6f37ec4f"
total_cost
0.01089
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0002%
Matches word count
n/a
neededClean
false
words
466
1.1590%
Dialogue to Total Word Ratio
Ratio: 59.44%, Deviation: 9.44%
neededClean
false
wordsTotal
466
wordsDialogue
277
0.5796%