NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYYF3N9596SX0PMC6JMAKGH
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her back against the fence, peering through a knot hole at the chaos of balloons and screaming six-year-olds beyond. "I can't believe we're doing this." "Twenty grand in gifts, Mol. That's what my intel says." Bill adjusted his unnecessarily large sunglasses. "Some tech mogul's kid. Probably got iPads in those gift bags." "Your intel is your nephew who delivers pizza to rich people." "Tommy's very observant." A particularly shrill shriek erupted from the bouncy castle. Molly winced. Children were everywhere, sticky-handed missiles of chaos wearing princess costumes and superhero capes. The present table sat near the back door, groaning under mountains of elaborate wrapping paper. "Okay, so what's the plan?" Molly asked. Bill pulled out a hand-drawn map on a napkin. "You go in through the side gate dressed as the pony." "I'm sorry, dressed as what?" "The pony. For pony rides." He gestured to a mangy costume crumpled in a duffel bag at their feet. "I called and canceled the real one." Molly stared at the costume. One of its button eyes hung by a thread. "That thing looks like it has diseases. Plural." "It's called commitment to the craft." "It's called a felony and a tetanus shot." Bill ignored her, continuing his briefing. "While you're ponying around, I'll slip in as the backup clown. Kids love clowns." "Kids are terrified of clowns. Everyone's terrified of clowns. There's been studies." "That's the beauty of it. They'll be too scared to notice me loading presents into my oversized clown duffel." Molly rubbed her temples. Last month it was stealing a dog from a pet show. The month before, replacing a city councilman's toupee with a ferret. She was really questioning her career choices. "Why can't we rob normal people?" she asked. "Adults. With insurance." "Where's the challenge in that?" "Where's the prison sentence in that, you mean." A woman's voice called out from the party. "Has anyone seen the entertainer? The pony should be here by now!" Bill shoved the costume at Molly. "That's your cue." "I hate you so much right now." "You'll thank me when we're splitting twenty grand." Molly grabbed the costume, examining its moth-eaten mane with disgust. "If I get kicked by a child, I'm keeping your share." "Deal. Now get in character." "Horses don't have character. They just stand there and poop." "Then you're perfect for the role." She punched his arm, but began pulling the costume on over her clothes. Bill applied his red nose and oversized shoes, transforming into something from a fever dream. "Ready?" he asked. Molly's voice came muffled from inside the horse head. "I can't see anything." "Perfect. That's method acting." They approached the gate as another chorus of "Happy Birthday" erupted from inside. Bill squared his shoulders. "Let's go steal some presents."
created_at
"2026-02-08T15:38:47.907988+00:00"
Sunday, February 8th, 2026 at 3:38:47 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1499
moderation_latency
194
generation_time
15641
tokens_prompt
61
tokens_completion
651
native_tokens_prompt
75
native_tokens_completion
748
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.011445
router
(null)
provider_responses
0
id
"66a61142-2d7a-4986-b5fe-3da393ead617"
status
200
is_byok
false
latency
1498
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770565111-gt0hncmd7BoqfH0jNe5i"
upstream_id
"66a61142-2d7a-4986-b5fe-3da393ead617"
total_cost
0.011445
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0304%
Matches word count
n/a
neededClean
false
words
470
92.1747%
Dialogue to Total Word Ratio
Ratio: 51.28%, Deviation: 1.28%
neededClean
false
wordsTotal
470
wordsDialogue
241
46.1025%