NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9W2YFVEGFWTJCNBQ6NK9E
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.6s
Cost:
$0.010356
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.6885%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse smelled like stale popcorn and regret, which Molly figured was about right given what they were about to do. She spread the blueprints across a card table that wobbled every time Bill leaned on it, which he did constantly, like a man testing whether gravity still worked. "A seven-year-old's birthday party," Bill said, not for the first time. "We're robbing a seven-year-old's birthday party." "We're robbing the presents at a seven-year-old's birthday party," Molly corrected. "There's a difference. The kid's not even the target. It's her uncle's watch collection, sitting in a box wrapped in dinosaur paper because he's an idiot who thinks that's clever camouflage." Bill crossed his arms. "You said this would be easy money." "It is easy money." "There will be a bouncy castle, Molly." "I'm aware there will be a bouncy castle." "There will be a clown." "I read the surveillance report same as you." Bill dragged a hand down his face like he was trying to wipe off the last twenty years of bad decisions. "Last job it was a bank vault with laser grids. This week it's balloon animals and a piñata. I don't know what happened to our careers." "Our careers happened to fall on hard times," Molly said, tapping the blueprint. "Focus. The present table is set up against the north wall of the backyard, under a tent. Fifty-plus gifts, and somewhere in that pile is a wristwatch worth more than this entire warehouse. We go in disguised as party staff, we grab the box, we're gone before the cake even comes out." "Disguised as what, exactly?" Molly reached under the table and produced two folded outfits with a flourish that suggested she was very proud of this part. Bill unfolded his and held it up. It was covered in sequins and had a rainbow embroidered across the chest. "I am not being a magician's assistant." "You're the magician. I'm the assistant. Read the room, Bill." "I don't do magic tricks." "You don't need to do magic tricks. You need to stand near the present table looking mysterious and occasionally pull a scarf out of somebody's ear. The real trick," she said, tapping her temple, "is getting close enough to swap the boxes without anyone noticing." Bill stared at the costume like it had personally wronged him. "If anyone from the old crew sees me in this—" "Nobody from the old crew is going to be at a seven-year-old's birthday party, Bill. That's sort of the beauty of it. Nobody's watching for us here. We're invisible. We're two adults in ridiculous outfits at a place literally designed to have adults in ridiculous outfits." He sighed, long and theatrical, the kind of sigh that had taken him thirty years of professional disappointment to perfect. "Fine. But I want it on record that if there is a game of pin-the-tail-on-the-donkey, I am not participating." "Noted," Molly said, already folding the blueprints away. "Wheels up at three. Wear comfortable shoes. It's a long walk to the getaway minivan."
created_at
"2026-07-01T07:39:07.611Z"
7/1/2026, 7:39:07 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1885
moderation_latency
(null)
generation_time
16531
tokens_prompt
77
tokens_completion
775
native_tokens_prompt
93
native_tokens_completion
1017
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010356
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01TACriocNQUh5TfSfT1gyig"
is_byok
false
latency
1885
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891547-ttugSKWZetOOCg2dcfj6"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891547-MR658zHjUAX6SRmJb4CV"
upstream_id
"msg_01TACriocNQUh5TfSfT1gyig"
total_cost
0.010356
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
0.00%
Dialogue to Total Word Ratio
Ratio: 66.67%, Deviation: 36.67%
neededClean
false
wordsTotal
519
wordsDialogue
346
49.6885%