NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWEA4D28NFPCFW0PPHM42G0H
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.3s
Cost:
$0.010326
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
28.65%
Dialogue to Total Word Ratio
(details)
64.3172%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the storage unit that Bill insisted on calling "the office," and Molly was starting to wonder why she'd agreed to any of this. "So let me get this straight," she said, arms crossed. "We're robbing a seven-year-old's birthday party." "Turning eight," Bill said, holding up a finger like this mattered. "And we're not robbing the kid. We're robbing the presents. There's a legal distinction." "There is no legal distinction, Bill. There's just a moral one, and it's bad." "Molly. Molly. Think about it. Forty kids, minimum forty presents. You know what rich Riverside parents buy their kids these days? Gaming consoles. Tablets. That kid's getting a drone, I guarantee it." "You don't know that." "I do know that. I did recon." "You did recon on a children's birthday party." "I stood outside a Toys R Us for two hours, yes, that counts as recon." Molly rubbed her temples. "And how exactly are two grown adults supposed to walk into a birthday party unnoticed?" Bill grinned, and that grin had gotten them into trouble more times than she could count. "That's the beautiful part. We don't sneak in. We get invited." "Nobody invited us, Bill." "Not yet." He pulled out a folded piece of paper and slapped it on the table between them. It was a flyer. Bounce house rental. Party entertainment. "I called the number on this. Told them our clown canceled. Boom. We're the entertainment." "You're going to be a clown." "We're going to be clowns. Plural. It's a package deal, Molly, one clown looks suspicious, two clowns is a business." "I am not wearing a clown costume." "You're wearing a clown costume." "Bill." "The suits have pockets, Molly. Big ones. Big enough for, say, a Nintendo Switch. Or three." She stared at him for a long moment, and hated that the math was starting to make sense in her head. "And the parents? You think they're just going to let two clowns wander around unsupervised near a present table?" "They're going to be thrilled. Because I already told them we do balloon animals and a magic act." "You don't know magic." "I know one trick." "What trick?" "The disappearing trick." He spread his hands wide, grinning like an idiot. "Presents go in, presents don't come out." Molly groaned and put her face in her hands, but she was already reaching for the costume bag in the corner, and Bill knew he'd won the second she didn't argue back. "Fine," she said. "But if we get caught, I'm telling them the whole thing was your idea." "It was my idea." "I know. That's why I'm telling them." "Fair." Bill tossed her a red rubber nose. "Showtime's at three. Try not to look like you hate children." "I don't hate children. I hate this plan." "Same thing, really." He put on his own nose and checked his reflection in the dusty mirror leaning against the wall. "Let's go steal a birthday."
created_at
"2026-07-01T07:43:40.12Z"
7/1/2026, 7:43:40 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1680
moderation_latency
(null)
generation_time
16251
tokens_prompt
77
tokens_completion
765
native_tokens_prompt
93
native_tokens_completion
1014
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010326
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01P5Uf7VYvHhS5coUdR5d7d5"
is_byok
false
latency
1680
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891820-K30bHgBlYIt8IeJMVRgO"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891820-TeC8vOU9tjNofY6b5TCI"
upstream_id
"msg_01P5Uf7VYvHhS5coUdR5d7d5"
total_cost
0.010326
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
28.65%
Dialogue to Total Word Ratio
Ratio: 65.00%, Deviation: 5.00%
neededClean
false
wordsTotal
500
wordsDialogue
325
64.3172%