NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE924K3DWBST9Y80QAJRW22
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
17.4s
Cost:
$0.010806
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled like stale coffee and regret, which Bill figured was appropriate given their line of work. He'd parked it three houses down from the Hendersons' place, where a pink banner reading "Emma Turns 7!" flapped cheerfully in the breeze like it had no idea what was coming. Molly was studying the house through binoculars she definitely didn't need, since they were parked close enough to smell the birthday cake. "Twelve presents," she said, not looking up. "Maybe fourteen. I counted twice." "Fourteen presents for one seven-year-old? What's she getting, a pony?" "Ponies don't come wrapped, Bill." "Some do. If you're rich enough." Molly lowered the binoculars and gave him a look that suggested she questioned every decision that had led her to this particular partnership. "The plan is simple. I go in dressed as the entertainment. You wait by the side gate with the bag. We grab the presents during the piñata distraction, we're gone before anyone notices." "I still don't understand why I can't be the entertainment." "Because you can't do a balloon animal to save your life, and last time you tried to be a clown you made a child cry." "That child was going through some stuff." "You called her a 'gremlin' and told her Santa was watching her specifically." "He is, though. That's the whole point of Santa." Molly rubbed her temples, a gesture Bill recognized as the physical manifestation of her considering whether the fourteen presents were worth the headache of working with him. "Just stick to the plan. Side gate. Bag. Presents. Simple." "And if someone sees me?" "You tell them you're the pizza delivery guy." "We don't have a pizza." "Then you improvise, Bill. It's called improvise. It's a skill some people have." Bill considered this insult beneath a response and instead focused on smoothing down his ridiculous rented clown wig, the one Molly insisted made him "blend in" despite the fact that it was neon green and looked like it had personally offended several small children already that morning just by existing in their vicinity. "What if the piñata distraction doesn't work?" "It'll work. Nothing distracts children faster than the possibility of candy raining from the sky." "Unless the candy's the sugar-free kind. Then they riot." "We're stealing candy-adjacent presents, Bill, not orchestrating a revolution. Focus." He adjusted the wig one final time, feeling less like a criminal mastermind and more like a man about to be arrested at a child's birthday party, which, considering the circumstances, was an entirely reasonable fear to have. "Fine," he said. "Side gate, bag, presents. But if this goes wrong—" "It won't." "But if it does—" "Then we run, Bill. Like always." She was already climbing out of the van, straightening her ridiculous magician's cape with the confidence of someone who had never once considered that stealing from seven-year-olds might constitute a new low, even for them. "Now come on. Fourteen presents. And possibly a pony."
created_at
"2026-07-01T07:24:57.326Z"
7/1/2026, 7:24:57 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2920
moderation_latency
(null)
generation_time
17339
tokens_prompt
77
tokens_completion
859
native_tokens_prompt
93
native_tokens_completion
1062
native_tokens_completion_images
(null)
native_tokens_reasoning
62
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010806
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01NbUh45FcCx4dN4ArmPPSAP"
is_byok
false
latency
1650
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890697-7ht0c6B6Zm4TPdaxIJCs"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890697-aodetKMqSIZ3U6E6XMLm"
upstream_id
"msg_01NbUh45FcCx4dN4ArmPPSAP"
total_cost
0.010806
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
0.00%
Dialogue to Total Word Ratio
Ratio: 53.41%, Deviation: 23.41%
neededClean
false
wordsTotal
498
wordsDialogue
266
47.9934%