NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWEAGQV7B5HPDW6N93D24WQW
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
18.6s
Cost:
$0.011136
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
80.19%
Dialogue to Total Word Ratio
(details)
74.1472%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse smelled like motor oil and stale pizza, which is where Molly and Bill always met when they were planning something truly stupid. "A children's party," Bill said, not looking up from his sandwich. "You want us to rob a children's party." "Not rob," Molly said. "Liberate. There's a difference." "Enlighten me." "Robbing implies violence. This is going to be smooth. In and out. Nobody even notices we were there until the kid opens his presents and finds a note that says 'thanks, buddy.'" Bill set the sandwich down like it had personally offended him. "You're leaving a note." "I'm leaving a note." "Molly. We are stealing from children. There is no note that fixes that." "It's not the kids' presents, it's the parents' money, and half these people bought garbage the kid's gonna forget about in a week anyway. We're doing him a favor. Simplifying his life." "By taking everything he owns." "By taking everything he was *given.* Ownership's a construct." Bill rubbed his temples like he was trying to physically push the headache back into his skull. "How many presents are we talking?" "Kid's turning seven. Rich family. Country club birthday party, bounce house, the whole circus. I'm talking forty, fifty gifts, easy. Bikes. Electronics. Somebody's getting a drone, Bill. A *drone.*" "And how exactly do two adults who are not on any guest list waltz into a country club party and walk out with a truckload of children's gifts?" Molly grinned, which was never a good sign. "Clowns." "No." "We rent clown costumes—" "Absolutely not." "—and we tell the staff we're the entertainment, the real clown canceled, tragic story, food poisoning, very sad—" "Molly." "—and once we're in, there's a gift table right by the entrance, unattended, because who's watching presents at a *seven-year-old's birthday party*, Bill? Nobody. That's the beauty of it." Bill stared at her for a long moment. "You want us to dress as clowns, infiltrate a private party, and steal a mountain of gifts in broad daylight, in front of forty children, while balloon animals are happening." "I didn't say balloon animals were mandatory." "They will absolutely ask us to make balloon animals." "Then we make bad balloon animals. Nobody questions a bad clown. They just feel sorry for him." Bill picked his sandwich back up, considered it, then set it down again. "This is the dumbest plan you've ever had." "You said that about the yacht." "The yacht plan *worked.*" "Exactly." Molly leaned back, satisfied, like she'd already won. "Look, we get the gifts, we fence the electronics, the bikes go to that guy in Queens who doesn't ask questions, and we're three towns over before anyone thinks to check the clowns." Bill was quiet for a second too long, which Molly knew meant he was already in. "I'm not wearing the shoes," he said finally. "The big floppy ones. I trip in those, and it's over." "Fine. No shoes. But you're doing the wig." "I'm doing the wig." "And the nose." "I draw the line at the nose being *red.* Orange." "Deal."
created_at
"2026-07-01T07:50:24.371Z"
7/1/2026, 7:50:24 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2151
moderation_latency
(null)
generation_time
18590
tokens_prompt
77
tokens_completion
820
native_tokens_prompt
93
native_tokens_completion
1095
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011136
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_016EsaWiVhgDwxgYYxsmguAP"
is_byok
false
latency
2149
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782892224-gXSj1BjN41gnt1LFyOu4"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782892224-MsBdz3bBo1iWBLT4OtOf"
upstream_id
"msg_016EsaWiVhgDwxgYYxsmguAP"
total_cost
0.011136
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
514
80.19%
Dialogue to Total Word Ratio
Ratio: 67.90%, Deviation: 2.10%
neededClean
false
wordsTotal
514
wordsDialogue
349
74.1472%