NC Bench
Overview
About
Tests
Models
Model
Claude Haiku 4.5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2026-02-08-01KGYWYD5ZMHKRAZE6XV9KAAP3
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"So we hit the birthday party at 2 PM Saturday," Molly whispered. "When all the kids are doing the piƱata thing?" Bill asked. "Exactly. Maximum chaos, minimum witnesses." "What's the security like?" "Amateur hour. One mom by the door, totally distracted by her phone." "Perfect. I'll slip through the back near the kitchen." "The presents are stacked in the living room. We need garbage bags, maybe four." "Four? How many presents are we talking about?" "It's a six-year-old's party, Bill. His parents are insane. I counted thirty-two wrapped boxes." "Thirty-two? What kind of haul are we looking at?" "Action figures, video games, that new talking robot thing." "The one that costs two hundred bucks?" "That's the one." "Molly, this is genius. Pure genius." "Don't celebrate yet. We need an exit strategy." "The side gate by the driveway?" "Too exposed. We go through the garage, out to the alley." "What about the family dog?" "Already handled. I'll have treats." "You thought of everything." "That's why I'm the brains here." "When do we start?" "Saturday at 1:45. Don't be late."
created_at
"2026-02-08T15:12:00.316362+00:00"
Sunday, February 8th, 2026 at 3:12:00 PM GMT+00:00
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
872
moderation_latency
193
generation_time
3730
tokens_prompt
61
tokens_completion
258
native_tokens_prompt
75
native_tokens_completion
313
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00164
router
(null)
provider_responses
0
id
"14b3720f-5d9e-4bce-ae6d-8adc05e236a9"
status
200
is_byok
false
latency
872
endpoint_id
"d000261c-5aeb-47d1-ac87-ad64dbe1cdb6"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1770563515-bAIss473DMzjqf9IaWFa"
upstream_id
"14b3720f-5d9e-4bce-ae6d-8adc05e236a9"
total_cost
0.00164
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
35.0022%
Matches word count
n/a
neededClean
false
words
182
69.0342%
Dialogue to Total Word Ratio
Ratio: 97.80%, Deviation: 7.80%
neededClean
false
wordsTotal
182
wordsDialogue
178
52.0182%