NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9K95BQF4M1H7PX876XMPV
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
17.6s
Cost:
$0.011006
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
40.6364%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the safehouse—which was really just Bill's mother's garage—as Molly spread the blueprints across the workbench. She'd stolen them from the birthday planner's office three days ago, and they'd been worth every risk. "Okay," she said, tapping the paper. "The party starts at two. Bouncy castle here, cake table here, and the presents—the presents, Bill—are stacked on a table right by the sliding glass door." Bill leaned over the blueprints, chewing on a piece of licorice like it was a cigar. "So we're stealing birthday presents. From a six-year-old." "From Timmy Reginald Pierce the Third," Molly corrected. "Whose father owns half the diamond mines in this state and apparently thinks a six-year-old needs a present table roped off like it's the Louvre." "That's insane." "That's opportunity," Molly said. "You in or not?" Bill sighed, dropping the licorice. "I'm in. But I'm not wearing a clown costume again. Last time the balloons kept popping and everyone thought I was setting off firecrackers." "No clown costume. We're going as caterers." "Caterers." "Caterers. Nobody questions the guy carrying trays of chicken fingers, Bill. Nobody looks twice." He crossed his arms. "And how exactly do two caterers waltz out with an entire table of presents without anyone noticing?" Molly grinned, the grin that always meant trouble, and pulled out a second sheet of paper—a schematic of a rolling catering cart, the kind with three shelves and a false bottom. "We load the cart like we're clearing dishes. Bottom shelf, presents. Top shelf, dirty plates. We roll it right out the side door to the van." Bill studied it, impressed despite himself. "That's actually not bad." "Thank you." "But what about the kids? Six-year-olds notice things. They're basically tiny detectives with cake on their faces." "That's why we hit the table during the magician's act," Molly said. "Everyone's eyes are on the guy pulling a rabbit out of a hat. Nobody's watching a couple of caterers." "And if someone does notice?" Molly shrugged, tucking the blueprint back into her bag. "Then we improvise. Like always." "Your improvising got us locked in a walk-in freezer for four hours last month." "That walk-in freezer had a very nice cheese selection." Bill laughed despite himself, shaking his head. "Fine. Caterers. False-bottom cart. Magician distraction. What time do we need to be there?" "Noon. Setup starts at noon." "And the getaway?" "Van's parked two blocks down, disguised as a florist delivery truck." Molly zipped up her bag and slung it over her shoulder, already halfway to the door. "By two-thirty, we'll be halfway across town with a truck full of toys, and Timmy Pierce the Third will be crying into a bouncy castle." Bill grabbed his jacket, still shaking his head. "You know this is insane, right? We're stealing from a kid's birthday party." "Bill," Molly said, pausing at the door, "we're thieves. Insane is Tuesday."
created_at
"2026-07-01T07:34:19.064Z"
7/1/2026, 7:34:19 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1511
moderation_latency
(null)
generation_time
17559
tokens_prompt
77
tokens_completion
790
native_tokens_prompt
93
native_tokens_completion
1082
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011006
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_014gxmFU4Qoi1ELkB8ZufvPe"
is_byok
false
latency
1511
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891259-XhXqBEeLvTtslnZ81pqY"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891259-Qc6gv0cqaQiX7vXAjLl4"
upstream_id
"msg_014gxmFU4Qoi1ELkB8ZufvPe"
total_cost
0.011006
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
488
0.00%
Dialogue to Total Word Ratio
Ratio: 67.67%, Deviation: 17.67%
neededClean
false
wordsTotal
498
wordsDialogue
337
40.6364%