NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9HZED1WKSKFJ5SY3NGAME
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.3s
Cost:
$0.010706
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
50.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of Bill's Pawn & Loan smelled like mildew and stale coffee, which felt appropriate given the caliber of the operation being planned there. Molly had the blueprint spread across the counter—if you could call a crayon drawing of a backyard a blueprint. It had been swiped from the mail, along with the invitation itself, which read "Tucker's Super Awesome 7th Birthday Bash" in bubble letters. "I'm just saying," Bill said, tapping the drawing with a greasy finger, "there's a bounce house here. A bounce house, Molly. That's a structural obstacle." "It's inflatable, Bill. You don't climb over it, you deflate it." "With what, my keen wit?" "With the pin I gave you yesterday. The one you probably lost." Bill patted his jacket pockets with the theatrical concern of a man who already knew the answer. Nothing. He shrugged, as if the universe had simply conspired against him again. "Found it," he said, producing it from behind his ear like a magician who'd forgotten his own trick. Molly closed her eyes and exhaled slowly, the way a woman does when she has chosen, against all reason, to continue a life of crime with this particular man. "The presents will be on a table near the garage. There's usually a banner. Cake's inside. Parents will be drunk on mimosas by two o'clock, which is our window." "Drunk parents at a kid's party. Truly we live in a golden age." "Focus, Bill." "I am focused. I'm extremely focused on the fact that we are about to rob seven-year-olds." "We're not robbing the kids. We're robbing the gifts. There's a difference." "Is there?" "Yes. The kids get cake. We get Legos. Everybody wins." Bill considered this with the seriousness of a man weighing a genuine moral quandary, then nodded, apparently satisfied. He rolled up the crayon map and slid it into his coat like a general pocketing battle plans. "What's our cover?" he asked. "Clowns." "Absolutely not." "It's foolproof. Nobody questions a clown at a birthday party. We walk in, we do a few balloon animals, we case the table, we load the car during the piñata." "I am not wearing the shoes." "You're wearing the shoes, Bill." "They're a safety hazard. I could break an ankle running from a swarm of eight-year-olds." "Then don't run. Walk briskly and menacingly, like a clown who means business." Bill stared at her for a long moment, then broke into the crooked grin that had gotten them out of exactly as many messes as it had gotten them into. "Fine," he said. "But if this goes sideways, I'm blaming the shoes." "It's not going sideways. We've done harder jobs than this." "We robbed a bank with a leaf blower once." "Exactly. Compared to that, this is a Tuesday." Outside, a car door slammed, and somewhere down the street an ice cream truck jingled its way past, oblivious to the two clowns-to-be quietly plotting the greatest birthday heist the neighborhood would never see coming.
created_at
"2026-07-01T07:33:36.342Z"
7/1/2026, 7:33:36 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1625
moderation_latency
(null)
generation_time
16231
tokens_prompt
77
tokens_completion
774
native_tokens_prompt
93
native_tokens_completion
1052
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010706
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01X4vcTXZdHTT7bfcCLfK8ux"
is_byok
false
latency
1625
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891216-hfIbaI8h04wkTrSxJ9na"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891216-7VdIARqWWLSpvEvZsqZ6"
upstream_id
"msg_01X4vcTXZdHTT7bfcCLfK8ux"
total_cost
0.010706
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
0.00%
Dialogue to Total Word Ratio
Ratio: 49.80%, Deviation: 19.80%
neededClean
false
wordsTotal
506
wordsDialogue
252
50.0000%