NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2026-02-08-01KGYYY2ACC3NXEQMFV2036MA1
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly cracked her knuckles. "So we're really doing this?" "Absolutely. Kids' parties are goldmines. Nobody expects it." "Bill, they're five-year-olds." "Exactly! Their parents buy expensive stuff. iPads, designer clothes, those creepy robot dogs." "What if we get caught?" "By who? A clown?" "There's always parents supervising." "Not during cake time. Everyone crowds around the birthday boy. That's our window." "How long do we have?" "Three minutes, maybe four if the candles don't light." "And we just grab everything?" "Everything. Wrapping paper and all." "What about thank-you cards? How will little Timmy know who gave him what?" "Little Joshua, actually. And that's his problem." "You're terrible." "I'm practical. You in or out?" "What's my cut?" "Sixty-forty." "Fifty-fifty or I walk." "Fine. Fifty-fifty." "When do we move?" "Saturday. Two o'clock. Jungle theme." "Should we wear camouflage?" "Molly, it's a metaphorical jungle." "Right. Obviously." "Bring the big duffel bags." "Got it. This is insane." "This is genius."
created_at
"2026-02-08T15:46:50.189716+00:00"
Sunday, February 8th, 2026 at 3:46:50 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1695
moderation_latency
199
generation_time
7842
tokens_prompt
61
tokens_completion
258
native_tokens_prompt
75
native_tokens_completion
324
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.005085
router
(null)
provider_responses
0
id
"15ccedcd-752e-469a-a6f9-32206a99c7c8"
status
200
is_byok
false
latency
1694
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770565601-9jqB2Z9WGeylti66MfBc"
upstream_id
"15ccedcd-752e-469a-a6f9-32206a99c7c8"
total_cost
0.005085
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
160
72.8763%
Dialogue to Total Word Ratio
Ratio: 97.50%, Deviation: 7.50%
neededClean
false
wordsTotal
160
wordsDialogue
156
36.4382%