NC Bench
Overview
About
Tests
Models
Model
Claude Haiku 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYZARWYQE4AVS34WM9WHVKW
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
# The Birthday Bandit Blueprint Molly spread the photographs across the kitchen table, each one capturing a different angle of the Henderson residence. The afternoon light caught the rim of her coffee mug as she tapped a manicured finger against a shot of the backyard. "The party starts at two," she said, not looking up. "Security camera covers the front entrance, but look at this side gate." Bill leaned in, his weathered face creasing into something resembling a smile. At sixty-three, he'd forgotten more about stealing than most people would ever know. He studied the image intently, already mapping routes through his mind like a chess grandmaster. "How many kids are we talking?" he asked. "Forty. Maybe fifty if they invited the whole soccer team," Molly replied, finally meeting his eyes. "But here's the beautiful part—complete chaos. The magician arrives at 2:15, which means everyone's distracted." Bill whistled low. "Professional." "I've been casing this for three weeks," Molly continued, a note of pride in her voice. "The presents are wrapped and stacked in the garage until the gift-opening ceremony at 3 o'clock. That's our window." "Forty-five minutes," Bill mused, rubbing his stubbled chin. "What about the parents?" "Mostly in the house. Kitchen, dining room. The mother hired a catering company, so they're running around like headless chickens. The father will be in the backyard filming everything on his phone like a maniac." Bill chuckled. "They always do." Molly stood and walked to the window, gazing out at the parking lot below. "We go in through the side gate at 2:17. You head straight to the garage while I create a distraction—something minor, nothing that brings serious attention." "What kind of distraction?" Bill asked, though experience told him not to ask too many questions about Molly's methods. "Slip-and-slide malfunction. The thing's going to mysteriously spring a leak right in front of the magician's illusion box. Parents will be so busy making sure nobody drowns, they won't notice anything else." Bill grinned despite himself. "You're devious." "You're just now figuring that out?" Molly spun back around. "The van will be parked on Elm Street. You get the presents loaded—we're talking video games, electronics, those expensive gaming headsets—and I'll be right behind you. In and out in thirty-five minutes." Bill studied her carefully. "Why children's birthday presents? You've got government secrets, bank vaults, museum pieces—and you want to rob a suburban six-year-old's party?" Molly's expression softened for just a moment. A flicker of something almost human crossed her face before she buried it again. "Because nobody expects it," she said simply. "And because somewhere in Henderson's garage is a limited-edition retro Nintendo set that kid's been wanting since birth. His mother couldn't find one anywhere." Bill stood slowly, understanding dawning. He extended his hand across the table. "I'm in," he said. Molly shook it firmly.
created_at
"2026-02-08T15:53:46.993531+00:00"
Sunday, February 8th, 2026 at 3:53:46 PM GMT+00:00
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
884
moderation_latency
180
generation_time
8213
tokens_prompt
61
tokens_completion
647
native_tokens_prompt
75
native_tokens_completion
729
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00372
router
(null)
provider_responses
0
id
"f93d4406-a088-4747-9a4b-5f4a1a3678d5"
status
200
is_byok
false
latency
882
endpoint_id
"d000261c-5aeb-47d1-ac87-ad64dbe1cdb6"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1770566017-qg8YGvsdqt7k1aSSRC0o"
upstream_id
"f93d4406-a088-4747-9a4b-5f4a1a3678d5"
total_cost
0.00372
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
68.1022%
Matches word count
n/a
neededClean
false
words
486
0.0000%
Dialogue to Total Word Ratio
Ratio: 58.44%, Deviation: 28.44%
neededClean
false
wordsTotal
486
wordsDialogue
284
34.0511%