NC Bench
Overview
About
Tests
Models
Model
Claude Haiku 4.5
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2026-02-08-01KGYY17WD7PQH4NZH1CQ7VQXS
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the inflatable bounce castle, studying the gymnasium through binoculars. A banner reading "Timmy's 7th Birthday Extravaganza" hung from the bleachers. Mountains of wrapped presents towered near the refreshment table like a glittering fortress. "This is insane," Bill whispered, adjusting his fake mustache. The thing kept slipping down his sweaty face. "Quiet. I'm counting the presents." They'd stolen from penthouses, museums, and high-security vaults. But nothing compared to the adrenaline rush of infiltrating a children's birthday party. The risk was enormous. A single child could recognize them as not belonging, and suddenly they'd be swarmed by seven-year-olds in Spider-Man pajamas. "How do we even carry all this?" Bill gestured at the haul. "We go in as party entertainers." Bill stared at her incredulously. "You want us to what?" "Clown and magician routine. Nobody suspects entertainers." The gymnasium doors burst open as parents began arriving with their offspring. Screaming erupted immediately. Bill's eye twitched. This was either their greatest heist or their most humiliating disaster. He straightened his oversized bow tie and followed Molly toward the chaos, hoping neither of them would accidentally do an actual magic trick.
created_at
"2026-02-08T15:31:02.157448+00:00"
Sunday, February 8th, 2026 at 3:31:02 PM GMT+00:00
model
"anthropic/claude-4.5-haiku-20251001"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
782
moderation_latency
231
generation_time
3982
tokens_prompt
61
tokens_completion
263
native_tokens_prompt
75
native_tokens_completion
313
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00164
router
(null)
provider_responses
0
id
"4fb2e109-eece-4f4c-b276-89dd7d6a65fb"
status
200
is_byok
false
latency
782
endpoint_id
"d000261c-5aeb-47d1-ac87-ad64dbe1cdb6"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-haiku-20251001"
api_type
"completions"
id
"gen-1770564657-kpN8QqjqogQnl9mj3arY"
upstream_id
"4fb2e109-eece-4f4c-b276-89dd7d6a65fb"
total_cost
0.00164
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
97.6276%
Matches word count
n/a
neededClean
false
words
193
49.2925%
Dialogue to Total Word Ratio
Ratio: 19.17%, Deviation: 9.17%
neededClean
false
wordsTotal
193
wordsDialogue
37
73.4601%