Model

Claude Sonnet 4.5

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2026-02-08-01KGYYF2Q1PZKS26QCTQECQKQN

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly pressed her ear against the door. "How many kids are we talking?" "Twelve. Maybe fifteen." "Armed?" "With cake forks, yeah." "Don't be cute, Bill. What's the security situation?" "Two parents in the kitchen, one dad manning the grill outside, and somebody's gran doing face painting." "Face painting? That's actually perfect cover." "You want me to get my face painted?" "Butterfly. Nobody suspects a butterfly." "I'm not doing a butterfly." "Fine. Tiger." "The presents are in the corner by the bouncy castle. We grab the haul during musical chairs." "Musical chairs is genius. Total chaos." "I've been casing this joint for three weeks, Molly." "Wait. Three weeks? This is a kid's birthday party, not Fort Knox." "The kid's parents are the Hendersons." "The Hendersons? The ones who—" "Exactly. Last year they gave out full-sized iPads in the goodie bags." "What's in the actual presents?" "Rumor is a limited edition Lego set. The Medieval Castle. Unopened." "That's four grand on eBay." "Minimum." "Okay. I'm in. But I'm the butterfly." "Fine." "When do we move?" "Pin the tail on the donkey. Blindfolds everywhere."
created_at"2026-02-08T15:38:39.050402+00:00" Sunday, February 8th, 2026 at 3:38:39 PM GMT+00:00
model"anthropic/claude-4.5-sonnet-20250929" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1399
moderation_latency230
generation_time8006
tokens_prompt61
tokens_completion277
native_tokens_prompt75
native_tokens_completion333
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.00522
router(null)
provider_responses
0
id"b93fea6b-b950-4b31-b5aa-87626bf02a13" 
status200
is_byokfalse
latency1399
endpoint_id"d8bee848-f020-454a-b2be-4a7ce868f99f" 
provider_name"Amazon Bedrock" 
model_permaslug"anthropic/claude-4.5-sonnet-20250929" 
api_type"completions" 
id"gen-1770565110-hogxFEtpeakmmcMZuLnF" 
upstream_id"b93fea6b-b950-4b31-b5aa-87626bf02a13" 
total_cost0.00522
cache_discount(null)
upstream_inference_cost0
provider_name"Amazon Bedrock" 

Evaluation details

Result Evaluator Details Meta Data
35.0022% Matches word count n/a
neededCleanfalse
words182
86.6397% Dialogue to Total Word Ratio
Ratio: 96.15%, Deviation: 6.15%
neededCleanfalse
wordsTotal182
wordsDialogue175
60.8209%