NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9SBKHXYGN1CSKEZ8CB2AJ
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
18.1s
Cost:
$0.011866
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0152%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse smelled like stale popcorn and desperation, which suited Molly and Bill just fine, since that was more or less their brand. Molly had the blueprints spread across an overturned crate—if you could call a crayon drawing of a house with "Timmy's Party!!!" scrawled in glitter glue a blueprint. "This is it?" Bill said, squinting. "This is our intel?" "Timmy's mom posted it on Facebook. Public profile. Rookie mistake on her part." "Molly, it's a six-year-old's birthday party." "With presents, Bill. A mountain of them. Xboxes, Legos, probably a pony if the rich aunt came through." Bill rubbed his temples. "We're professional thieves. We rob banks. Casinos. That one Ferris wheel job—" "Which failed." "Only because you can't reload a crossbow while spinning." "The point," Molly said, "is that nobody guards a six-year-old's presents. There's no vault. No lasers. Just a bouncy castle and maybe a magician named Gary." "There's always a Gary." Bill sighed, sitting on an overturned bucket. "Fine. Walk me through it." Molly grinned and tapped the crayon drawing. "We go in disguised as clowns." "Absolutely not." "Bill." "I said no clowns. Last time I wore a clown suit, I got chased out of a Chuck E. Cheese by a woman named Denise wielding a fire extinguisher." "That's because you improvised a balloon animal that looked like a weapon." "It was a giraffe!" "It looked like a musket, Bill." He crossed his arms. "Fine. No clowns. What's plan B?" "We pose as the entertainment. I'll be the face painter. You be the magician." "I don't know any magic." "You don't need to. Kids believe anything. Just say 'abracadabra' and shove a present up your sleeve." "That's not magic, that's just theft with flair." "Exactly! That's our whole brand, Bill, keep up." He considered this, chewing on the idea like stale gum. "Okay, say we get in. Face paint, fake magic, the whole bit. How do we get a mountain of presents out without twenty parents tackling us?" Molly's eyes sparkled, which was never a good sign. "Piñata." "…Come again?" "We build a piñata. Huge. Shaped like a present. We hide inside it, and when it's time to leave, the 'piñata' just… rolls out to the truck." "You want us to hide inside a piñata." "Yes." "That children will hit with a stick." "That's the risky part, sure." Bill stared at her for a long moment. "One time. One time I said 'let's just get real jobs,' and you laughed for eleven minutes." "Because it was funny, Bill." "This is worse than the Ferris wheel." "Nothing is worse than the Ferris wheel." He grabbed his jacket, muttering, already defeated. "If I get hit in the face with a Nerf bat, I'm retiring." "Deal," Molly said, rolling up the crayon blueprint. "Now, let's go steal Timmy's childhood."
created_at
"2026-07-01T07:37:38.174Z"
7/1/2026, 7:37:38 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4139
moderation_latency
(null)
generation_time
18086
tokens_prompt
77
tokens_completion
814
native_tokens_prompt
93
native_tokens_completion
1168
native_tokens_completion_images
(null)
native_tokens_reasoning
59
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011866
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_013HheWZLtFsaJk5pHi6syGF"
is_byok
false
latency
1906
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891458-rcIdHUoqEI4HjYIqethQ"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891458-fTvLKszDeyfBE03SmbvJ"
upstream_id
"msg_013HheWZLtFsaJk5pHi6syGF"
total_cost
0.011866
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
470
0.00%
Dialogue to Total Word Ratio
Ratio: 75.32%, Deviation: 25.32%
neededClean
false
wordsTotal
474
wordsDialogue
357
0.0152%