NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9WWS8DY38D0R6747A2VGS
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
17.8s
Cost:
$0.011286
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0001%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of Big Sal's Pawn Shop smelled like mothballs and bad decisions, which suited Molly just fine. She spread the blueprints across the table—crayon drawings, actually, stolen from the birthday invitation itself. "Bill, look at this," she said, tapping the paper. "Bouncy castle here. Pin-the-tail station here. And the present table, right by the cake." Bill squinted at the crayon rendering of what appeared to be a purple dinosaur wearing a party hat. "Are we robbing a six-year-old's birthday party or trying to decode a ransom note written by a toddler?" "Both, if we're lucky." "Molly, I want to point out that we are grown adults planning to steal presents from children." "I want to point out that those presents are worth four hundred dollars, easy. Little Timmy's got rich grandparents." "His name is Timmy?" "His name is Marcus, but I like Timmy better. Focus." Bill rubbed his temples. "How do we even get in? It's not like we can just walk through the front door." "That's exactly how we get in. We dress up." "Dress up as what?" "Clowns." Bill stared at her. "Absolutely not." "Bill—" "I did clown time once, Molly. Nineteen ninety-four. Bad birthday, worse balloon animals. I am not doing clown again." "Fine, fine. We go as caterers, then. Nobody questions the guy with the cheese platter." "Now that," Bill said, brightening, "I can work with. I make a very convincing cheese platter guy." "You made a convincing everything guy. That's your whole thing." "It's a gift." He paused. "So we walk in with cheese, walk out with presents. What's the catch?" "The catch," Molly said, "is Grandma Rose." "Who?" "Timmy's grandmother. She's ex-military. Ran a tight ship at three different birthday parties last year. Foiled a magician mid-trick because she thought his rabbit was 'suspicious.'" "The rabbit was probably fine." "The rabbit had a knife taped to its foot, Bill. She was right." Bill sat back, impressed despite himself. "Okay. So Grandma Rose is the wild card." "Grandma Rose is the whole deck." "What's the plan for her?" "Cake diversion. You distract her with cake, I grab the presents, we're out before she finishes her second slice." "And if she doesn't want cake?" "Everybody wants cake, Bill. It's cake." "What if she's diabetic?" Molly paused. "...We bring a backup pie." "A backup pie." "For contingencies." Bill looked down at the crayon map again, at the dinosaur, at the scribbled arrow pointing to "PRESENTS!!!" in glitter pen. "We're really doing this." "We're really doing this." "Robbing children." "Redistributing wealth from children who have too many Legos to us, who have none." Bill considered this. "When you put it that way, it almost sounds noble." "It's very noble. Now go find a cheese platter. We've got a party to crash."
created_at
"2026-07-01T07:39:34.067Z"
7/1/2026, 7:39:34 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2695
moderation_latency
(null)
generation_time
17781
tokens_prompt
77
tokens_completion
812
native_tokens_prompt
93
native_tokens_completion
1110
native_tokens_completion_images
(null)
native_tokens_reasoning
47
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011286
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01GT7tbZ2SyYZFQZBqiHrQoL"
is_byok
false
latency
1862
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891574-dELkrqtmEjXE9qChIdtF"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891574-dS9V1LoPzmLD9OikOVK1"
upstream_id
"msg_01GT7tbZ2SyYZFQZBqiHrQoL"
total_cost
0.011286
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
466
0.00%
Dialogue to Total Word Ratio
Ratio: 77.97%, Deviation: 27.97%
neededClean
false
wordsTotal
472
wordsDialogue
368
0.0001%