NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE8X544YAYD7733TTXJNEWG
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
17.2s
Cost:
$0.010396
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8722%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly stood at the corner of Maple Street, peering through binoculars at the modest suburban house draped in balloons and streamers. A banner reading "Happy 8th Birthday, Timmy!" flapped in the breeze above the front porch. "You're kidding me," Bill said, leaning against their beat-up van, arms crossed. "This is the job? A kid's birthday party?" "Not just any kid's birthday party," Molly said, lowering the binoculars with the flair of someone unveiling a masterpiece. "Timmy Hargrove's birthday party. His grandmother is Eleanor Hargrove." Bill blinked. "The jewelry lady? The one who does those insane custom pieces?" "The very same. And according to my source, she got her only grandson a little something extra special this year. A necklace. Diamonds. The whole nine yards. Wrapped up nice and pretty, sitting under a pile of Tonka trucks and Lego sets." "So we're stealing presents from an eight-year-old." "We're stealing one present from an eight-year-old. The rest is just cover." Bill rubbed his temples. "This feels wrong on a spiritual level, Molly." "You said that about the church bake sale job too, and we walked away with four grand." "That was different. Nobody was wearing a cone hat and eating cake." Molly ignored him, pulling a clipboard from the van's console. "Here's the plan. I go in as the entertainment—magician, specifically. I've got the whole getup in the back. While I'm doing card tricks and pulling rabbits out of hats, you come in through the side gate dressed as catering staff. Nobody questions catering staff." "And then what? I just waltz up to the present table and grab a diamond necklace in a room full of parents?" "You grab the entire present table. Load it into a laundry cart, wheel it out back like you're taking dirty tablecloths to the van. By the time anyone notices the gifts are gone, we're three streets away." Bill considered this, chewing his lip. "What if a kid sees me?" "Kids don't see catering staff. You're furniture to them. Trust me." "And the magician thing—you actually know magic tricks?" Molly grinned and produced a playing card from thin air, flicking it between her fingers before making it vanish again. Bill's eyebrows rose despite himself. "Okay, fine, that's impressive. But if we get caught stealing a birthday present from a child, we're going to be the most hated criminals in state history." "We won't get caught," Molly said, already climbing into the back of the van to retrieve her sequined cape. "And besides, think of it this way—we're not stealing his birthday. We're just relocating one very shiny problem before it causes his grandmother a heart attack when she realizes what she actually spent." "That's not comforting." "It wasn't meant to be. Now put on the little vest. You look like a waiter who lost a bet, which is exactly what we need." Bill groaned, grabbing the folded catering uniform from the passenger seat. "The things I do for diamonds."
created_at
"2026-07-01T07:22:14.03Z"
7/1/2026, 7:22:14 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1682
moderation_latency
(null)
generation_time
17195
tokens_prompt
77
tokens_completion
774
native_tokens_prompt
93
native_tokens_completion
1021
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010396
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_0164DsPy3M4dB9NTRjmojVV5"
is_byok
false
latency
1681
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890534-TtKOtWIrFN01p21gR7Ip"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890534-Ai6BJTJuE8bjSfL835XD"
upstream_id
"msg_0164DsPy3M4dB9NTRjmojVV5"
total_cost
0.010396
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
496
0.00%
Dialogue to Total Word Ratio
Ratio: 74.45%, Deviation: 44.45%
neededClean
false
wordsTotal
501
wordsDialogue
373
49.8722%