NC Bench
Overview
About
Tests
Models
Model
Claude Haiku 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYXG0CATSVPFAJ8VGH6B0PX
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
# The Birthday Caper Molly crouched behind the inflatable bounce castle, studying the gymnasium through her binoculars. The children's birthday party raged on before her, a cacophony of screaming six-year-olds and exhausted parents. Streamers hung from the rafters like limp spaghetti, and in the corner sat the treasure: a mountain of wrapped presents stacked haphazardly on a folding table. Bill appeared beside her, breathing heavily, his black turtleneck riding up to reveal a rather unfortunate belly button lint situation. "How many security guards?" he whispered. "Two. One by the cake, one by the bathrooms. Both distracted," Molly replied. "Though that clown keeps wandering around making balloon animals. That's our main obstacle." "The clown? Really? We've robbed museums, Molly." "Clowns are unpredictable, Bill. You know this." He adjusted his fake mustache, which was already peeling off on one corner. "What's the plan?" Molly pointed toward the gift table. "We go in through the south entrance. You create a diversion near the snack table. Grab some of those punch bowls, knock them over. Blame it on the birthday kid." "That seems harsh," Bill said, watching a pigtailed child named Sophie brutalize a piƱata with a plastic bat. "Sophie doesn't even look sad. She looks vindictive. She can handle it." Molly checked her watch. "We have seven minutes during the present-opening ceremony. Once parents start filming, everyone's phones are out. Security gets lazy." Bill scratched his chin. "What about the clown?" "Leave him to me. I'll have him doing face paintings in the corner." "How exactly?" "I told him my face has an 'interesting bone structure.' He's already gotten artistic. You should go with it, Bill. Lean into the method acting of it all." Bill looked at her like she'd suggested they rob an active volcano. "I'm not letting a clown paint my face." "You're going to have to. It's part of the plan." He sighed dramatically. "I became a criminal to avoid structured activities, Molly. Face painting feels incredibly structured." She ignored him, continuing her reconnaissance. "Backup plan: if we get spotted, we hide in the ball pit. Nobody thinks to look in ball pits. It's the oldest trick in the book." "That's not a trick, that's just... hiding in a children's ball pit. That's just sad." "Do you have a better idea?" Bill was quiet for a moment. "The presents are wrapped. We don't even know what we're stealing." Molly lowered her binoculars and looked at him seriously. "That's the beauty of it, Bill. The mystery. The potential. That's what being a thief is really about." They sat in silence, watching the chaos unfold before them. The clown twisted a balloon into what might have been a sword or possibly a very deformed giraffe. A child named Marcus knocked over a juice box directly onto another child's shoes, and nobody even reacted. "Fine," Bill said finally. "I'll do the diversion. But I'm not happy about it." "You're never happy, Bill." "That's fair."
created_at
"2026-02-08T15:21:41.634887+00:00"
Sunday, February 8th, 2026 at 3:21:41 PM GMT+00:00
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
644
moderation_latency
153
generation_time
8307
tokens_prompt
61
tokens_completion
670
native_tokens_prompt
75
native_tokens_completion
768
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003915
router
(null)
provider_responses
0
id
"30d9eb77-1d5e-4709-9ae6-e325a3c421ba"
status
200
is_byok
false
latency
643
endpoint_id
"d000261c-5aeb-47d1-ac87-ad64dbe1cdb6"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1770564092-PMcth5tOepnFrVsifxjA"
upstream_id
"30d9eb77-1d5e-4709-9ae6-e325a3c421ba"
total_cost
0.003915
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
98.7124%
Matches word count
n/a
neededClean
false
words
494
0.0000%
Dialogue to Total Word Ratio
Ratio: 57.09%, Deviation: 27.09%
neededClean
false
wordsTotal
494
wordsDialogue
282
49.3562%