NC Bench
Overview
About
Tests
Models
Model
Claude Haiku 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYWZ8SYX26P8R7P6W5WCE8W
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
# The Birthday Caper Molly adjusted her black turtleneck and peered through the chain-link fence at the suburban backyard. Multicolored streamers twisted in the wind like the intestines of some festive beast. A massive inflatable castle dominated the lawn, and beneath a striped canopy, mountains of wrapped presents sat unguarded. "This is insane," Bill whispered, crouching beside her. "We're stealing from children?" "We're not stealing from children. We're stealing from their parents' wallets." "That's the same thing, Molly." She turned to him with exasperation etched across her face. "Did you read the briefing or just skim the photos? These are billionaire kids. Their parents probably don't even know their names. One of those presents costs more than my first apartment." Bill squinted at a enormous box wrapped in holographic paper. "How much are we talking?" "The vintage Nintendo collection alone is worth seventy grand. There's a limited-edition drone, three gaming systems, and what I'm pretty sure is actual jewelry." Molly pulled out her phone, showing him detailed photos from the party planning website. "The mom posted everything. People are genuinely stupid." "So we just waltz in wearing what, party hats?" "Obviously." Molly produced two ridiculous cone-shaped hats from her bag. "We blend in. You're Uncle Bill. I'm Aunt Molly. We arrived early to 'help set up.' Nobody questions family at children's parties. They're too exhausted and caffeinated." Bill examined his hat skeptically. "This is the worst plan you've ever had." "Last week you suggested we rob a bank using nothing but a kazoo and confusion." "That would have worked!" Molly stood up, dusting off her knees. "Look, we go in, we grab the high-value items, we stuff them in the gift bags I've already prepared, and we leave through the side gate. In and out. Forty-five minutes, max." "What could possibly go wrong?" Bill stood beside her, already knowing the answer would be catastrophic. "The weather forecast predicted sunny skies, so nothing weather-related. The gate code is 8427, which I tested last week. The parents are apparently getting massages in the guest house because hiring entertainment 'wasn't their thing.'" Molly smiled, which Bill recognized as her signature expression before everything went sideways. "The only variable is whether little Timmy or Tiffany or whatever his name is actually cuts the cake at three o'clock like scheduled." Bill put on his ridiculous hat, already regretting his life choices. "When this goes wrong, and it will go wrong, I'm telling everyone you coerced me." "You're the one who needed money for your karate instructor obsession." "That's not—we're not discussing my personal life right now." Molly grabbed his arm and pulled him toward the fence opening. "Come on, Uncle Bill. Let's go make some terrible decisions at a child's birthday party."
created_at
"2026-02-08T15:12:32.581607+00:00"
Sunday, February 8th, 2026 at 3:12:32 PM GMT+00:00
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
594
moderation_latency
184
generation_time
7775
tokens_prompt
61
tokens_completion
609
native_tokens_prompt
75
native_tokens_completion
697
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00356
router
(null)
provider_responses
0
id
"4b20e403-c8d1-424c-b814-a84d5169c0fd"
status
200
is_byok
false
latency
594
endpoint_id
"d000261c-5aeb-47d1-ac87-ad64dbe1cdb6"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1770563543-m6kAOgpaRvcVKjXj5kdr"
upstream_id
"4b20e403-c8d1-424c-b814-a84d5169c0fd"
total_cost
0.00356
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
464
0.0000%
Dialogue to Total Word Ratio
Ratio: 65.95%, Deviation: 35.95%
neededClean
false
wordsTotal
464
wordsDialogue
306
0.0000%