NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE8WBSJVRRJ8JCXH5XWM0P0
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
17.2s
Cost:
$0.010306
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
21.10%
Dialogue to Total Word Ratio
(details)
28.0501%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the van, and Bill's mustache twitched with the particular anxiety of a man about to do something deeply stupid. "Say it again," Bill said. "Slower this time." "We're stealing presents," Molly said. "From a six-year-old's birthday party." "That's what I thought you said." "Bill, we've robbed banks. We've robbed a *casino* once. You cried during that one, remember? Big tough guy, crying about—" "That was pepper spray, Molly, and you know it." "Sure it was." Molly checked her watch, a habit she'd picked up from every heist movie ever made, despite the fact that they were currently sitting in a Honda Odyssey with a cracked windshield and absolutely no schedule to keep. "Look, the intel is solid. Cousin works catering. Says there's at least forty gifts under that table. Nintendo Switches. Legos. A bike, Bill. A whole bike." "A bike is not worth going to prison over." "It's not just the bike. It's the *principle*." "What principle? The principle of stealing candy from babies?" "They're not babies, they're six. And it's not candy, it's merchandise. There's a difference." Bill stared out at the house across the street, where pink and purple balloons bobbed against the porch railing like they were mocking him personally. A banner reading HAPPY BIRTHDAY EMMA hung slightly crooked above the door. "What's our cover?" he asked, already regretting the question. "We're clowns." "We are not clowns." "We're clowns, Bill. I bought the noses. I bought the wigs. There's a whole bag in the back with squeaky shoes and everything." "I am not wearing squeaky shoes into a felony." "You wore a tutu into the Kaminski job." "That was different. That was for the diamonds." "This is for a bike, Bill. Priorities." Bill pinched the bridge of his nose, which was somehow already starting to hurt despite the actual rubber nose not yet being anywhere near it. "And when the birthday girl notices forty gifts have vanished into thin air? What then?" "We tell her the real Santa needed them for a more deserving child." "It's not Christmas." "Details, Bill. Details are for cowards." "Details are for people who don't want to go to prison." "Nobody's going to prison over a Paw Patrol backpack." Bill turned to look at her fully now, taking in the seriousness etched across her face, a seriousness entirely unbefitting the conversation they were having. "You've really thought this through." "I think about everything through," Molly said, reaching into the back seat and producing, with unsettling ceremony, two red rubber noses. "Now put this on. We've got a party to crash." Bill took the nose, stared at it like it might bite him, and sighed the sigh of a man who'd long since stopped questioning where his life had gone wrong. "If we get caught," he said, "I'm blaming you." "Obviously," Molly said, already halfway out the door.
created_at
"2026-07-01T07:21:48.093Z"
7/1/2026, 7:21:48 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1831
moderation_latency
(null)
generation_time
17194
tokens_prompt
77
tokens_completion
771
native_tokens_prompt
93
native_tokens_completion
1012
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010306
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01R8NvzNxU5NRWPhK65NYBGS"
is_byok
false
latency
1831
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890508-usrSC5RwODb0P6YjrYzC"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890508-tUdBNfWxd8pO1QzDflbg"
upstream_id
"msg_01R8NvzNxU5NRWPhK65NYBGS"
total_cost
0.010306
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
482
21.10%
Dialogue to Total Word Ratio
Ratio: 55.58%, Deviation: 5.58%
neededClean
false
wordsTotal
484
wordsDialogue
269
28.0501%