NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE986TEP3G2WTWC81XX5JCD
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.5s
Cost:
$0.011036
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1376%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the windshield of the van, and Bill was still wearing that ridiculous fake mustache he'd insisted on buying from the party store. "Take it off," Molly said, not looking up from the blueprints spread across her lap. "You look like a walrus had a mid-life crisis." "It's for the disguise, Molly. We're supposed to be caterers." "Caterers don't wear mustaches that make them look like they're compensating for something." Bill huffed but peeled it off anyway, sticking it to the dashboard where it curled at the edges like a dying caterpillar. "Fine. But when this whole thing goes sideways because nobody recognizes us as legitimate party staff, I'm blaming the mustache decision on you." "The mustache decision was never mine. I have never once in my criminal career suggested facial hair as a disguise strategy." Outside, the Wilkinson residence glowed with pastel balloons and a bounce house shaped like a castle, utterly oblivious to the fact that two professional thieves were parked across the street casing the joint for a seven-year-old's birthday party. "Okay, focus," Molly said, tapping the blueprint. "The presents table is set up in the dining room, right next to the cake. Twenty-three kids means twenty-three presents, minimum, probably more because rich parents overcompensate." "That's a lot of Legos to steal from children, Molly." "We've stolen bigger things from worse people." "We stole a Fabergé egg from a Russian oligarch. This is a Frozen-themed birthday party." "Money is money, Bill. A stolen PlayStation is still a stolen PlayStation whether it belongs to a mobster or a kid named Tyler." Bill rubbed his temples, the way he always did when he was about to lose an argument he knew he'd already lost three minutes ago. "There's going to be a bouncy castle attendant. And a magician. Possibly a clown." "I've factored in the clown." "You've factored in the clown." "Clowns are notoriously unobservant. It's a documented phenomenon. Something about the face paint restricting peripheral vision." "That's not a real thing." "It's real enough for our purposes." The van fell silent except for the rain and the distant sound of children shrieking with the particular joy that only comes from unlimited cake and inflatable structures. Bill stared at the house, at the pink and gold banner reading HAPPY 7TH BIRTHDAY EMMA, and felt something that might have been the faintest whisper of conscience. "They're gonna be so sad, Molly." "They'll get new presents. Insurance exists for a reason." "That's remarkably cynical, even for you." "I contain multitudes, Bill. Now put the mustache back on. We've got a party to crash and roughly forty minutes before the sugar high wears off and this becomes considerably harder." Bill sighed, peeling the mustache off the dashboard and pressing it back above his lip with the resigned dignity of a man who had made peace with his terrible life choices roughly a decade ago. "For the record," he said, "I still think this is a new low." "Noted and ignored," Molly said, already reaching for the door handle. "Let's go steal a seven-year-old's birthday."
created_at
"2026-07-01T07:28:16.217Z"
7/1/2026, 7:28:16 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1615
moderation_latency
(null)
generation_time
16508
tokens_prompt
77
tokens_completion
829
native_tokens_prompt
93
native_tokens_completion
1085
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011036
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_011qaGEmn2C1wYM18vUTkA5z"
is_byok
false
latency
1615
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890896-xideKTU2HM4Wr2dW4lLz"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890896-2H3qA1qU3ZgvOQ6CIrLW"
upstream_id
"msg_011qaGEmn2C1wYM18vUTkA5z"
total_cost
0.011036
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
515
0.00%
Dialogue to Total Word Ratio
Ratio: 55.83%, Deviation: 25.83%
neededClean
false
wordsTotal
523
wordsDialogue
292
30.1376%