Model

Claude Sonnet 4.5

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2026-02-08-01KGYZTEZCKYH5VZ0MNRB5QS9A

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly leaned against the van. "So we're really doing this?" "Twenty grand in gift cards, Moll. Parents go crazy at these rich kid parties." "It's a six-year-old's birthday party, Bill." "Exactly. They won't suspect professional thieves." "Because we're not professionals. We're idiots casing a bounce house." Bill unfolded his blueprint. "You go in as the clown." "Absolutely not." "The magician then." "I'm not doing children's entertainment!" "Fine. You're the pony." "The what?" "Someone ordered a pony. You're the pony." "How am I supposed to be a pony?" "Horse costume. I got one cheap online." "Why do you have a horse costume?" "Don't ask questions. Point is, you trot in, I follow as your handler, we grab the present table during the piƱata, and boom. Out in five minutes." "This is the dumbest plan you've ever had." "Remember the yacht?" "Okay, second dumbest." "So you're in?" Molly sighed. "Where's the costume?" "Trunk. But Moll? The head's a little wonky." "Of course it is." "And you might be a unicorn." "I hate you so much."
created_at"2026-02-08T16:02:19.53271+00:00" Sunday, February 8th, 2026 at 4:02:19 PM GMT+00:00
model"anthropic/claude-4.5-sonnet-20250929" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1399
moderation_latency143
generation_time6570
tokens_prompt61
tokens_completion260
native_tokens_prompt75
native_tokens_completion323
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.00507
router(null)
provider_responses
0
id"46d80a20-2f22-4d0e-b7ae-b7f90d1882ae" 
status200
is_byokfalse
latency1399
endpoint_id"d8bee848-f020-454a-b2be-4a7ce868f99f" 
provider_name"Amazon Bedrock" 
model_permaslug"anthropic/claude-4.5-sonnet-20250929" 
api_type"completions" 
id"gen-1770566532-agCTQ5nNcideOlhVZEfz" 
upstream_id"46d80a20-2f22-4d0e-b7ae-b7f90d1882ae" 
total_cost0.00507
cache_discount(null)
upstream_inference_cost0
provider_name"Amazon Bedrock" 

Evaluation details

Result Evaluator Details Meta Data
2.0116% Matches word count n/a
neededCleanfalse
words175
98.1147% Dialogue to Total Word Ratio
Ratio: 93.71%, Deviation: 3.71%
neededCleanfalse
wordsTotal175
wordsDialogue164
50.0632%