NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWEA9GVDHK581YFNYAJJTH2Y
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
18.1s
Cost:
$0.011156
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
98.17%
Dialogue to Total Word Ratio
(details)
70.7724%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of the Rusty Spoon smelled like old beer and desperation, which suited Molly just fine. She spread the blueprint across the table—a crayon drawing of a house that she'd stolen from the birthday kid's own refrigerator. "Bill, look at this. Bounce house here. Present table there. Cake in the middle like some kind of sugary fortress." "You stole a six-year-old's artwork." "I stole *intel*, Bill. There's a difference." Bill leaned back in his chair, arms crossed. "We're robbing a birthday party. Timmy Fitzgerald's seventh birthday party. I want that on record. When they write the true crime podcast about us, I want it noted that I said this was a bad idea." "They're not gonna write a podcast about a bounce house heist." "They will if we pull it off." "When did you get so philosophical about grand larceny?" "Since we started stealing from children, Molly!" Molly tapped the drawing where a stick figure clown stood next to what was either a dog or a very unfortunate horse. "The presents alone are worth four grand. Timmy's family is loaded. Grandma sends checks in birthday cards, Bill. Actual checks." "You're planning to rob a seven-year-old's grandmother." "I'm planning to rob a seven-year-old's grandmother's *money*, which is sitting in an envelope, unguarded, next to a pin the tail on the donkey game." Bill rubbed his temples. "How do we even get in? It's a birthday party. There's gonna be, what, twenty kids? Parents? A clown?" "That's where you come in." Molly slid a garment bag across the table. He unzipped it slowly, like it might bite him. Inside was a rainbow wig and a red foam nose. "No." "Bill." "Absolutely not." "You're the clown, Bill. You're Chuckles the Magnificent. I already told the mom you'd be here at two." "You booked me as entertainment?" "I booked *us* as entertainment. I'm your lovely assistant, Sparkles. While you're doing balloon animals and making children cry with your card tricks—" "My card tricks are excellent—" "—I'm gonna be in the present room, loading gifts into the trunk of the car through the side window." Bill stared at the wig like it owed him money. "This is insane. We rob banks, Molly. We rob jewelry stores. We do not rob a party where there is a pin the tail on the donkey game." "The banks have security guards. The jewelry stores have cameras. You know what Timmy's party has? A bounce house and a mother who's one mimosa deep by noon." "That's genuinely more depressing than reassuring." "Think of it as reconnaissance for a softer world, Bill. A gentler heist." "There is nothing gentle about stealing a seven-year-old's Nintendo Switch." Molly grinned and pushed the wig toward him. "Then don't think about it. Just put on the nose." Bill picked up the foam nose, turned it over in his hands like a man contemplating his own descent into madness, and sighed. "If I end up on a wanted poster in clown makeup, I will haunt you." "Deal. Now go practice your balloon dog. We roll at one forty-five."
created_at
"2026-07-01T07:46:27.835Z"
7/1/2026, 7:46:27 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1890
moderation_latency
(null)
generation_time
18094
tokens_prompt
77
tokens_completion
803
native_tokens_prompt
93
native_tokens_completion
1097
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011156
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01AB7xSharUg6qS5vqgyWdBj"
is_byok
false
latency
1890
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891987-D4cqYvjmjDuny29UMOpR"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891987-XudZ5o3VZWpQNjJBqOmW"
upstream_id
"msg_01AB7xSharUg6qS5vqgyWdBj"
total_cost
0.011156
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
517
98.17%
Dialogue to Total Word Ratio
Ratio: 69.39%, Deviation: 0.61%
neededClean
false
wordsTotal
526
wordsDialogue
365
70.7724%