NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9YG2VX2N218PCQ41TJQP4
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.8s
Cost:
$0.011046
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1376%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the Rusty Spoon Diner as Molly spread the blueprints across the table, knocking over a ketchup bottle in the process. Bill caught it just before it rolled off the edge. "A five-year-old's birthday party," Bill said, staring at the crayon drawing Molly had somehow acquired. "You want us to rob a five-year-old's birthday party." "Not just any birthday party, Bill. The Hendersons throw the biggest bash in three counties. We're talking bounce houses, a professional clown, and presents. So many presents." Molly's eyes sparkled with the kind of greed usually reserved for bank vaults. "I heard little Tommy Henderson got a PlayStation 5 last year and never even opened the box." "That's just sad." "That's just opportunity." Bill rubbed his temples. He'd been doing this long enough to know when Molly had that look, the one that meant they were about to do something either brilliant or catastrophically stupid. Usually both. "So what's the plan? We just walk in wearing ski masks and start grabbing gift bags?" "Give me some credit." Molly pulled two costumes from a duffel bag. One was a slightly moth-eaten clown suit. The other was, inexplicably, a giant inflatable dinosaur. "Absolutely not." "Bill, nobody questions clowns at a birthday party. It's literally their job to be there." "And the dinosaur?" "Kids love dinosaurs." "I am not wearing an inflatable T-Rex costume to rob toddlers." "It's not robbing, it's *redistributing*. Rich kid gets fifty presents, opens three, forgets the rest exist by Tuesday. We're basically doing charity work." Bill stared at the dinosaur costume, then back at Molly. Somehow her logic always sounded reasonable at two in the morning after four cups of coffee. That should have worried him more than it did. "Fine. But I'm not doing balloon animals. Last time I gave a kid a balloon dog, it looked like a balloon crime scene." "Deal. You just guard the present table and look intimidating. Or, well, dinosaur-intimidating." "How exactly does one look intimidating in an inflatable dinosaur suit?" "You'll figure it out. Meanwhile, I'll be doing balloon animals, making small talk with parents, and slowly, subtly, moving presents to the van parked out back." "And when they notice half the presents are missing?" Molly grinned, tapping the blueprint where she'd drawn a small circle around the driveway. "By then, we'll be three towns over, selling a PS5 to some very confused pawn shop owner." Bill sighed, already picturing himself sweating inside rubber dinosaur skin while eight-year-olds screamed with joy around him. It wasn't the heist he'd imagined when he got into this life. Diamonds. Bank vaults. Maybe an art gallery. But Molly was already zipping up her clown wig case, humming "Happy Birthday" under her breath like it was a battle hymn. "For the record," Bill muttered, "if this goes sideways, I'm blaming you." "It always goes sideways, Bill. That's half the fun."
created_at
"2026-07-01T07:40:26.599Z"
7/1/2026, 7:40:26 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2080
moderation_latency
(null)
generation_time
16733
tokens_prompt
77
tokens_completion
783
native_tokens_prompt
93
native_tokens_completion
1086
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011046
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01JPJUjK5mVGJeNTpKbQ7QHR"
is_byok
false
latency
1680
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891626-uxbMqzfWIRbrZfyayYfe"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891626-k5bwY8EZSkBytA4ywzqF"
upstream_id
"msg_01JPJUjK5mVGJeNTpKbQ7QHR"
total_cost
0.011046
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
485
0.00%
Dialogue to Total Word Ratio
Ratio: 53.75%, Deviation: 23.75%
neededClean
false
wordsTotal
493
wordsDialogue
265
30.1376%