NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9281Z6JMKRQXPMA0JVEMY
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
15.4s
Cost:
$0.010366
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
87.18%
Dialogue to Total Word Ratio
(details)
86.7786%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse smelled like motor oil and stale pizza, which Bill found comforting in a way he couldn't quite explain. He spread the blueprint across the folding table, smoothing out the wrinkles with the flat of his hand. "So it's a six-year-old's birthday party," Molly said, peering over his shoulder. "You want us to rob a six-year-old's birthday party." "Not just any six-year-old. Tommy Winthrop. His dad owns half the shipping containers in this city. You know what that means?" "That we're going to prison for a very long time?" "It means top-shelf presents, Molly. We're talking drones, gaming consoles, probably a pony made of solid gold knowing these people." "A pony made of solid gold." "Figure of speech." Molly crossed her arms and stared at the blueprint like it had personally insulted her mother. "There's going to be like thirty kids there. Thirty screaming, sugar-crazed children who notice everything. You ever try to sneak past a six-year-old? They've got radar. Better than the Coast Guard." "That's why we're not sneaking. We're going in as the entertainment." "The what?" "The magician cancelled. I made sure of that." Bill grinned, pulling two cheap top hats from a duffel bag like he was revealing the crown jewels. "You're going to be the Amazing Molissa. I'm your assistant, Bartholomew the Magnificent." "Absolutely not." "Come on, it's perfect. We walk in, do a couple card tricks, pull a rabbit out of a hat—" "We don't have a rabbit." "We'll get a rabbit." "Bill." "Fine, no rabbit. Point is, everyone's watching the show, nobody's watching the present table, and you slip out back with the loot while I'm sawing myself in half or whatever." Molly pinched the bridge of her nose. "And how exactly do we get thirty wrapped presents out of a house full of parents without anyone noticing a giant hole where the gifts used to be?" "We don't take them all at once. We take them during the piñata." "The piñata." "Every kid in that yard is going to be beating a paper donkey to death with a stick. Parents are filming it on their phones. Nobody's looking at the present table. That's our window." "Our window is a piñata." "Our window is a piñata, Molly, yes." She stared at him for a long moment, then picked up the smaller of the two top hats and turned it over in her hands, inspecting the frayed brim like it might explain how she'd ended up here. "If this goes wrong," she said, "if we get caught by a six-year-old's birthday party—" "We won't." "If we do, I'm telling everyone this was your idea." "It was my idea." "I'm telling everyone it was worse than your idea. I'm telling everyone you also brought the rabbit." Bill smiled, already reaching for the hats. "Amazing Molissa. It's got a ring to it." "I hate you," Molly said, but she was already trying on the hat.
created_at
"2026-07-01T07:25:00.876Z"
7/1/2026, 7:25:00 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1966
moderation_latency
(null)
generation_time
15332
tokens_prompt
77
tokens_completion
758
native_tokens_prompt
93
native_tokens_completion
1018
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010366
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01GUzUtc1FsLTjcumN2TsArZ"
is_byok
false
latency
1966
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890700-mStFVixsaB7cdNxfWfJb"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890700-k2z64TnyCVq0XP9u6kCT"
upstream_id
"msg_01GUzUtc1FsLTjcumN2TsArZ"
total_cost
0.010366
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
87.18%
Dialogue to Total Word Ratio
Ratio: 71.66%, Deviation: 1.66%
neededClean
false
wordsTotal
501
wordsDialogue
359
86.7786%