NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE8W1AQVHQB2N6VYMN9H85T
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
17.3s
Cost:
$0.011026
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9995%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the van, which Bill had painted to look like a florist's delivery vehicle, except he'd spelled it "FLOREST" and refused to fix it because "nobody reads that carefully anyway." Molly sat in the passenger seat with a clipboard, studying a hand-drawn map of the Hendricks' backyard like it was a bank vault schematic. "Okay," she said. "The party starts at two. Bouncy castle goes up at one-thirty. That's our window." Bill cracked his knuckles. "I still think we should go in as clowns." "We are not going in as clowns." "Kids trust clowns." "Kids are *terrified* of clowns, Bill, that's literally the whole thing about clowns." "Not my clown. My clown's got a gentle energy." Molly closed her eyes and counted to five. This was a tactic her therapist had suggested for anger management, though she doubted the therapist had ever pictured it being used mid-heist with a man in a rubber wig sitting shotgun. "We're going in as the balloon company. Simple. Elegant. Nobody questions a guy delivering forty balloons to a seven-year-old's birthday party." "And the presents?" "We load them into the balloon truck under the guise of, quote, 'inflating them properly at the warehouse.'" Bill turned to look at her. "You're going to tell a birthday party full of parents that presents need to be inflated." "I'm going to tell them whatever gets us out of there with a truck full of Nintendo Switches and unopened Lego sets, Bill." He considered this. "There's a piñata." "I saw the piñata." "It's shaped like a dinosaur." "I know what shape it is." "I want the dinosaur." Molly turned to him fully now, clipboard trembling in her grip. "We are not stealing the piñata. The piñata is not the mission. The mission is presents. High-value, resellable, does-not-require-a-baseball-bat-to-retrieve presents." "But it's shaped like a dinosaur, Molly." "Everything is shaped like a dinosaur to a man who peaked emotionally at age nine." Bill sat back, wounded, and stared out at the rain like it had personally betrayed him. After a moment he said, quieter, "You really think the balloon thing'll work?" "It worked in Ohio." "That kid saw through it in four minutes." "That kid was gifted. This is Ridgewood. Average kid, average parents, one of whom is already on his third mimosa according to the intel." She tapped the clipboard. "We walk in, we look natural, we grab everything with a bow on it, we walk out. Presents into truck, truck onto highway, gone before the candles are even lit." Bill pulled the rubber wig from the glovebox and turned it over in his hands like a man saying goodbye to an old friend. "Fine. No dinosaur." "No dinosaur." "But if there's a piñata *later*, at a *different* party—" "We'll discuss it as a team." He smiled, satisfied, and shoved the wig back into the glovebox. Outside, the rain kept falling, and somewhere two blocks away, a bouncy castle began, unknowingly, to inflate.
created_at
"2026-07-01T07:21:37.379Z"
7/1/2026, 7:21:37 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1721
moderation_latency
(null)
generation_time
17248
tokens_prompt
77
tokens_completion
788
native_tokens_prompt
93
native_tokens_completion
1084
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011026
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01MDEwRdrcuwL7SBy63D2Z9y"
is_byok
false
latency
1721
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890497-BxvsSU20HyTVbhh0wc8i"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890497-lLA0XlWE1LikUhN4ky7d"
upstream_id
"msg_01MDEwRdrcuwL7SBy63D2Z9y"
total_cost
0.011026
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
0.00%
Dialogue to Total Word Ratio
Ratio: 62.70%, Deviation: 32.70%
neededClean
false
wordsTotal
512
wordsDialogue
321
49.9995%