Model

Claude Sonnet 4.5

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2026-02-08-01KGYZBHX8F83ANZP32907HEG2

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly leaned against the van. "So we're really doing this?" "Kids' birthday party, tons of presents, easy money." Bill adjusted his fake mustache. "I feel like we've hit a new low." "Low? This is genius! Security at a six-year-old's party? What are they gonna have, a bouncy castle guard?" "The parents will be there, Bill." "Drunk on mimosas and complaining about their mortgages. Trust me." Bill pulled out a deflated clown costume. "You're Giggles, I'm Sparkles." "I'm not wearing that." "Fine, you're Sparkles, I'm Giggles." "That's not better! Why are we clowns?" "Because nobody questions a clown at a party. It's science." "It's idiotic." "Remember the museum job? The yacht heist? This is just like those, except smaller victims and cheaper champagne." "Those victims could fight back." "Have you met a six-year-old? They're vicious!" Bill tossed her the costume. "Now hurry up. Party starts in twenty minutes." "I hate you." "You'll love me when we're splitting thousands in Lego sets." "We're not fencing Legos, Bill." "Not with that attitude."
created_at"2026-02-08T15:54:11.570862+00:00" Sunday, February 8th, 2026 at 3:54:11 PM GMT+00:00
model"anthropic/claude-4.5-sonnet-20250929" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1330
moderation_latency168
generation_time7068
tokens_prompt61
tokens_completion260
native_tokens_prompt75
native_tokens_completion313
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.00492
router(null)
provider_responses
0
id"95fd9007-14dc-460e-bb6b-1dc5fbfc10b1" 
status200
is_byokfalse
latency1330
endpoint_id"d8bee848-f020-454a-b2be-4a7ce868f99f" 
provider_name"Amazon Bedrock" 
model_permaslug"anthropic/claude-4.5-sonnet-20250929" 
api_type"completions" 
id"gen-1770566043-tvCFyHahREq6B9GWegRf" 
upstream_id"95fd9007-14dc-460e-bb6b-1dc5fbfc10b1" 
total_cost0.00492
cache_discount(null)
upstream_inference_cost0
provider_name"Amazon Bedrock" 

Evaluation details

Result Evaluator Details Meta Data
0.4920% Matches word count n/a
neededCleanfalse
words173
99.4567% Dialogue to Total Word Ratio
Ratio: 87.28%, Deviation: 2.72%
neededCleanfalse
wordsTotal173
wordsDialogue151
49.9744%