NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE8VYVN0GGNBGCJBETGEQXC
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
21.4s
Cost:
$0.010486
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
24.56%
Dialogue to Total Word Ratio
(details)
62.2743%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of Bill's apartment smelled like microwave burritos and desperation, which Molly figured was appropriate given what they were about to discuss. "A children's birthday party," she said, not quite a question. "A children's birthday party," Bill confirmed, spreading a hand-drawn map across the table like he was planning D-Day. "We're stealing presents from children." "We're liberating presents from children who have too many presents already. Look at this house, Molly. Look at it." He jabbed a finger at a photo of a McMansion with a bounce castle in the yard. "That's a bounce castle. For one child. One." "So we're Robin Hood now." "We're Robin Hood if Robin Hood needed rent money by Friday." Molly picked up the map, squinting at Bill's terrible handwriting. "What's this say? 'Clown entry, 3pm'?" "That's our way in." "You want to dress as clowns." "I want *you* to dress as a clown. I'm going to be the guy who delivers the bounce castle's replacement generator, because apparently rich people's bounce castles need generators, and generators need delivery guys, and delivery guys get to walk right past security like they own the place." "There's security?" "One guy. Retired cop. Currently very interested in the taco truck parked out front, which I may have hired to park out front." Molly set the map down. "So the plan is: I distract twenty sugar-crazed six-year-olds while dressed as a clown, and you steal a generator that doesn't need replacing, and somewhere in there we grab every present at a birthday party and just... leave." "Correct." "That's not a plan, Bill, that's a fever dream." "It's a fever dream with a getaway van." "Does the van have a name yet?" "Bertha." "Of course it's named Bertha." "She's earned it." Molly rubbed her temples. "What happens when the birthday kid notices all his presents are gone? He's going to cry. Loudly. In front of forty parents holding phones." "By then we'll be three streets over, changing plates." "You've thought about the plates but not about the part where we're stealing a child's birthday presents." "I've thought about that part plenty. That part is the *point*, Molly. Presents. Plural. Wrapped. Portable. Nobody counts them at a party, they just pile them on a table like an altar to capitalism, and we walk up and take the whole altar." "You make it sound noble." "I make it sound *efficient*." She looked at the clown costume draped over the chair — red nose already attached, big shoes waiting like a threat. "If I do this, and one single kid cries because of me specifically, I'm blaming you in the getaway van." "Bertha doesn't hold grudges." "Bertha's a van, Bill." "Bertha's family." Molly picked up the nose, turned it over in her hand, sighed the sigh of someone who'd made worse decisions for worse reasons. "Fine. Clown recon at three. But if there's a piñata, I'm taking a swing at your head first, just to get in the spirit." "Deal."
created_at
"2026-07-01T07:21:34.849Z"
7/1/2026, 7:21:34 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1850
moderation_latency
(null)
generation_time
21374
tokens_prompt
77
tokens_completion
791
native_tokens_prompt
93
native_tokens_completion
1030
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010486
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01BNH1S1ssw4bguxxg9vDsUE"
is_byok
false
latency
1840
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890494-MfLl7z3HVLcMxOvl95Yc"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890494-4flAPaCRhLpTxIR8dCXC"
upstream_id
"msg_01BNH1S1ssw4bguxxg9vDsUE"
total_cost
0.010486
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
24.56%
Dialogue to Total Word Ratio
Ratio: 75.30%, Deviation: 5.30%
neededClean
false
wordsTotal
502
wordsDialogue
378
62.2743%