NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWEAASSA7M92AX8BBZT2622B
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
17.0s
Cost:
$0.011086
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
0.79%
Dialogue to Total Word Ratio
(details)
3.4424%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the windshield as Molly checked her makeup in the visor mirror, adjusting the ridiculous clown wig for the third time. "I still don't understand why I'm the clown," Bill said, tugging at his rainbow suspenders like they were personally insulting him. "Because you have the face for it," Molly said. "Naturally punchable. Kids will trust you immediately." "That's not comforting." "It wasn't meant to be comforting, Bill. It was meant to be true." Bill turned the van around the corner, easing past a row of manicured hedges. Somewhere beyond them, balloons bobbed above a backyard, and the tinny sound of a bounce house generator hummed like a dying appliance. "Remind me why we're robbing an eight-year-old's birthday party," Bill said. "This feels like a new low, even for us." "It's not the eight-year-old, it's the guest list," Molly said, pulling a laminated card from her jacket. "Every kid at that party belongs to somebody with money. Tech money. Crypto money. The kind of parents who buy their children solid gold Nintendo Switches because plastic ones feel 'inauthentic.'" "So we're stealing presents." "We're stealing an *opportunity*, Bill. Those gifts are going to be loaded with jewelry, watches, first-edition comics worth more than your car." "My car is worth four hundred dollars." "Exactly my point." Bill parked the van behind a dumpster two houses down, cutting the engine. Rain continued its assault, blurring the porch lights into halos. "What's my motivation supposed to be?" he asked, checking his reflection in the side mirror, grimacing at the red nose Molly had glued to his face an hour earlier. "As the clown, I mean. Method acting requires motivation." "Your motivation is: balloon animals, then bathroom break, then you slip out to the garage where the gift table's been relocated for 'safekeeping.'" "That's not motivation, that's blocking." "Bill, we don't have time for your community theater trauma right now." He sighed, popping open the glove compartment to retrieve a small crowbar disguised inside a foam pool noodle. "And you're the caterer?" "I'm the caterer's assistant. Low profile. Nobody remembers the assistant." "They'll remember if you drop a tray of cupcakes." "I'm not going to drop the cupcakes, Bill." "You dropped the champagne flutes at the Hendricks wedding." "That was different. Those flutes were slippery and also, frankly, poorly designed." Bill opened the van door, letting the rain spill in sideways. "Fifteen minutes. In and out. No improvising." "When have I ever improvised?" "The Hendricks wedding." "That was a *response* to the flutes, Bill, not improvisation." He shook his head, tucking the crowbar-noodle under his arm like some absurd talisman. "Just—stick to the plan. Balloon animals, bathroom, garage, gifts, gone." Molly smiled, grabbing her tray of fake cupcakes from the back seat. "Try not to terrify any children, clown boy." "No promises," Bill muttered, stepping into the storm.
created_at
"2026-07-01T07:47:09.748Z"
7/1/2026, 7:47:09 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1572
moderation_latency
(null)
generation_time
16931
tokens_prompt
77
tokens_completion
798
native_tokens_prompt
93
native_tokens_completion
1090
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011086
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01H4fgU6Hthy7PSjLSJZHt4w"
is_byok
false
latency
1572
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782892029-uwTOe100CthfoeHlYaBz"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782892029-8UvLegyBgqEZ5eVN3ENP"
upstream_id
"msg_01H4fgU6Hthy7PSjLSJZHt4w"
total_cost
0.011086
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
477
0.79%
Dialogue to Total Word Ratio
Ratio: 59.83%, Deviation: 9.83%
neededClean
false
wordsTotal
483
wordsDialogue
289
3.4424%