NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9J0DW2PF5M2PJKXJB4AHT
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
14.8s
Cost:
$0.009766
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
9.61%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
4.8091%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the van, and Bill checked his watch for the fourth time in as many minutes. Molly sat beside him, tapping a pen against a crumpled piece of paper covered in scribbled diagrams that looked more like a toddler's drawing than an actual heist plan. "So let me get this straight," Bill said, rubbing his temples. "The plan is to break into a seven-year-old's birthday party and steal the presents." "That's the plan," Molly said, grinning. "That's not a plan, Molly. That's a crime against decency." "It's a crime against decency with excellent payout potential. Do you know how much stuff rich kids get for their birthdays these days? I saw the invitation list. There's a senator's kid on there. A senator, Bill." "So we're stealing from senators now." "We're stealing from senators' kids. Slight difference." Bill groaned and slumped back into his seat. "There's going to be, what, thirty kids there? Parents? Clowns? Face painters? How exactly do you propose we walk in and walk out with a truckload of presents without anyone noticing?" Molly held up the paper triumphantly, jabbing a finger at a badly drawn rectangle labeled "TABLE." "We dress as the entertainment. I already called and canceled the real magician. Told him there was a scheduling conflict. Little birthday boy gets us instead." "You're the magician?" "I'm the magician's assistant. You're the magician." "I can't do magic, Molly." "You don't need to do magic, Bill. You need to distract thirty children while I load the presents into the bounce house." "The bounce house." "It deflates. It's basically a duffel bag with balloons on it." Bill stared at her, blinking slowly, as though trying to process whether this was actually happening or some fever dream brought on by too much bad diner coffee. "And if someone notices the presents are missing before the bounce house deflates?" "They won't. Kids don't count presents until the parents make them write thank-you cards, and by then we'll be three states away." "This is insane." "This is genius." "It's insane genius." "See, now you're catching on." Molly reached into the back seat and pulled out a top hat, slightly dented, and a cape that had clearly seen better decades. She tossed them onto Bill's lap. "Chin up, big guy. You're about to become the greatest magician a seven-year-old's birthday has ever seen." Bill turned the hat over in his hands, sighing. "If we get caught, I want it on record that this was your idea." "Noted. Now put on the cape. We've got a driveway full of wrapped boxes waiting for us, and a very sad clown who thinks he still has a gig today." Bill pulled the cape over his shoulders, muttering under his breath the entire time, while Molly checked her watch, smiled, and pulled the van out into the rain.
created_at
"2026-07-01T07:33:37.35Z"
7/1/2026, 7:33:37 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1643
moderation_latency
(null)
generation_time
14709
tokens_prompt
77
tokens_completion
740
native_tokens_prompt
93
native_tokens_completion
958
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.009766
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01DALhXoVdj6TUTvjiau3dwi"
is_byok
false
latency
1629
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891217-drwA25YoEjkspMyLKfbX"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891217-9L1V2aooPYbxqrnFNEpH"
upstream_id
"msg_01DALhXoVdj6TUTvjiau3dwi"
total_cost
0.009766
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
9.61%
Matches word count
n/a
neededClean
false
words
478
0.01%
Dialogue to Total Word Ratio
Ratio: 63.56%, Deviation: 13.56%
neededClean
false
wordsTotal
483
wordsDialogue
307
4.8091%