NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGZ07P39TYH5Q2MVCB6VGP5N
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her back against the brick wall of the community center, watching a parade of minivans discharge their precious cargo of screaming children clutching gift bags. Each kid carried at least one brightly wrapped present, and her practiced eye caught the telltale shapes of electronics, toys, and what appeared to be a very expensive microscope. She adjusted her sunglasses and checked her watch. The party started in ten minutes. Bill shuffled up beside her, inexplicably wearing a full clown costume complete with rainbow wig and oversized shoes that squeaked with every step. "What are you wearing?" Molly hissed. "Camouflage. I'll blend right in." "It's a science-themed birthday party, Bill. There are no clowns." Bill looked down at his polka-dotted jumpsuit with what might have been wounded pride, though it was hard to tell under the white face paint. "The internet said kids love clowns." "The internet lied." Molly pulled out her phone and reviewed the floor plan she'd memorized. The presents would be stacked on a table in the main activity room, right next to the emergency exit. Simple grab and go. She'd done harder jobs at actual mansions. This was child's play. Literally. The sound of children singing happy birthday drifted through the propped-open door. She gestured for Bill to follow and crept along the building's perimeter. They reached the side entrance just as a harried mother rushed past them, dragging a crying child toward the bathroom. "Now," Molly whispered. They slipped inside. The hallway smelled like pizza and cake frosting. Molly's stomach rumbled traitorously. She hadn't eaten since breakfast, too focused on planning this ridiculous job. But rent was due, and apparently the birthday boy's parents had invited half the school. The haul would be worth thousands. Bill's shoes squeaked against the linoleum. She shot him a glare that could have melted steel. "Sorry," he mouthed. They rounded the corner and Molly's heart sank. The activity room door was wide open, and she could see straight through to the present table. But between them and their prize stood at least forty children wearing lab coats and safety goggles, enthusiastically mixing vinegar and baking soda in plastic bottles. A woman in a "Science is Magic" t-shirt stood at the front, demonstrating proper volcano-making technique. "We need a distraction," Molly muttered. Bill's painted face split into a grin. Before she could stop him, he honked his nose twice and cartwheeled into the room, bells jingling from his costume. "Who wants balloon animals?" he bellowed. Every child turned to stare. The science teacher's mouth fell open. In the chaos of delighted shrieks and stampeding children, Molly saw her chance. She darted along the wall, keeping low, fingers already reaching for the gift bags. This was either going to be her easiest score or her most embarrassing arrest.
created_at
"2026-02-08T16:09:41.719119+00:00"
Sunday, February 8th, 2026 at 4:09:41 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1547
moderation_latency
139
generation_time
15626
tokens_prompt
61
tokens_completion
603
native_tokens_prompt
75
native_tokens_completion
696
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.010665
router
(null)
provider_responses
0
id
"0cfc4625-728b-4491-b599-5431eba9bfde"
status
200
is_byok
false
latency
1540
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770566965-4CMwlDCW5wGIM6L6IFZy"
upstream_id
"0cfc4625-728b-4491-b599-5431eba9bfde"
total_cost
0.010665
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0848%
Matches word count
n/a
neededClean
false
words
471
0.0000%
Dialogue to Total Word Ratio
Ratio: 8.92%, Deviation: 21.08%
neededClean
false
wordsTotal
471
wordsDialogue
42
0.0424%