NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYWZR9WNYHWP32A3VWGYYDJ
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her back against the minivan and peeked around the corner at the suburban house. A giant inflatable unicorn bobbed in the front yard. "Are you seeing this?" she whispered. Bill adjusted his sunglasses. "I'm seeing a Tuesday afternoon goldmine, that's what I'm seeing." "The intel said there'd be at least forty kids." "Forty rich kids," Bill corrected. "You know what rich parents bring to birthday parties? iPads. Nintendo Switches. Designer sneakers that cost more than my rent." "Your rent is four hundred dollars." "Exactly my point." Molly glanced at her watch. "The clown arrives in ten minutes. That's our window." "I still think we should've gone with the clown disguise." "Bill, you're terrified of clowns." "I'm terrified of a lot of things. Doesn't mean I can't dress like them." He pulled out a wrinkled diagram from his pocket. "Okay, so presents are in the dining room, east side of the house. We go through the back gate, in through the patio doors, grab and dash." "What about the parents?" "They'll all be outside watching the clown. That's literally what they paid him for." "And the birthday girl?" "Birthday boy. Trevor. Age seven. Only child. Which means spoiled, which means expensive gifts." Molly nodded slowly. "What's our exit strategy?" "Same way we came in. I'll drive, you navigate. We'll be three neighborhoods away before anyone notices the present pile is gone." "This feels wrong." "It's extremely wrong. That's why we're criminals." "No, I mean morally wrong. They're kids, Bill." "Rich kids who already have everything. Trust me, little Trevor isn't going to suffer because he only gets to keep the bicycle and the telescope instead of also getting the robotic dog and whatever else is in those gift bags." "You really think there's a robotic dog?" "Molly, there's always a robotic dog." A car pulled into the driveway. A man in rainbow suspenders and a purple wig emerged carrying a small suitcase. "Clown's here," Bill said. "That's our cue." They crept along the fence line toward the back gate. Molly's heart hammered against her ribs. "Wait," she hissed. "What if there's a guard dog?" "What kind of family with a seven-year-old has a guard dog?" "A rich one?" "They have security systems, not dogs. Dogs are messy." Bill eased the gate open. The backyard was empty except for a bouncy castle and a snack table covered in juice boxes. "See? Clear sailing." "This is insane." "That's what you said about the wedding cake job, and we made eight grand." "We also got chased by a bride with a champagne bottle." "Details." Bill grinned. "Now come on. Those presents aren't going to steal themselves." They moved toward the patio doors, where through the glass, a mountain of wrapped boxes glittered in the afternoon sun.
created_at
"2026-02-08T15:12:55.471123+00:00"
Sunday, February 8th, 2026 at 3:12:55 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1588
moderation_latency
191
generation_time
14716
tokens_prompt
61
tokens_completion
626
native_tokens_prompt
75
native_tokens_completion
735
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01125
router
(null)
provider_responses
0
id
"fe6ede9f-ed56-4d3d-af77-26c124a80a24"
status
200
is_byok
false
latency
1587
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770563559-T4vRVgXgBH4c8IKVPBL2"
upstream_id
"fe6ede9f-ed56-4d3d-af77-26c124a80a24"
total_cost
0.01125
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0002%
Matches word count
n/a
neededClean
false
words
466
92.5468%
Dialogue to Total Word Ratio
Ratio: 71.24%, Deviation: 1.24%
neededClean
false
wordsTotal
466
wordsDialogue
332
46.2735%