NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZV2K8QA8KAE5NW77F9MY7J
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
15.1s
Cost:
$0.028490
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
21.6892%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the hood of the Buick, anchoring its corners with a juice box, a stolen party hat, and two half-eaten cupcakes she refused to explain. "Okay," she said, tapping the paper. "Backyard entrance, here. Bouncy castle, here. Present table, here. Questions?" Bill squinted. "Yeah. Why is the bouncy castle labeled 'Point of No Return'?" "Because once you're in one, Bill, you're in one." "That's not a strategic concern, that's a personal one." "Everything is personal when you're forty-one and wearing socks with Velcro shoes." Bill adjusted his clip-on tie, which featured a cartoon dinosaur eating a birthday cake. "I look ridiculous." "You look like an uncle." "I look like the uncle parents warn kids about." "Exactly. Perfect cover." Molly handed him a gift bag the size of a small refrigerator. "This is empty. You fill it. Priority targets are the big boxes near the cake. That's where the Legos live." "Legos?" "Bill. Legos are currency. Legos are the gold standard of the under-ten black market. You bring me a sealed Millennium Falcon set and I will personally name my firstborn after your cat." "My cat's name is Gary." "Then Gary it is." Bill scratched his neck where the tie was digging in. "What about the parents?" "Distracted. There's a clown." "You hired a clown?" "I *am* the clown." "Oh no." "Oh yes." Molly lifted a duffel bag and produced a red nose, a rainbow wig, and what appeared to be a very tired squirrel on a leash. "Molly. Is that a squirrel." "His name is Mr. Peanuts and he works for cashews." "That's a felony in at least two states." "Then we won't do it in those states." She shoved the squirrel back in the bag, where he made a noise like a kazoo being strangled. "Focus. You've got fourteen minutes between 'Happy Birthday' and 'piñata time.' That's your window." "Why piñata time?" "Because when candy hits the ground, Bill, society collapses. Children become animals. We will be invisible in the carnage." "And if someone notices?" "You cry." "I cry?" "You cry and you say you're Timmy's dad." "Who's Timmy?" "Doesn't matter. There is always a Timmy. Statistically, at any children's party, forty percent of the boys are named Timmy, Tommy, or Brayden." "Brayden isn't—" "Brayden, Bill. Always Brayden." Bill exhaled slowly, staring at the blueprint like it had personally insulted his mother. "Remind me why we're stealing from children again?" "Because adults lock their doors. Children have streamers." "That's not an ethical justification." "It's not supposed to be. It's a logistical one." He picked up the empty gift bag. It had a picture of a bear on it saying *You're BEARY Special!* "Molly." "Yes, Bill." "If I get tackled by a six-year-old, I want you to know I will fold immediately." "I'd expect nothing less." She slapped the red nose onto her face. "Let's go ruin a childhood."
created_at
"2026-04-24T13:32:41.124Z"
4/24/2026, 1:32:41 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
895
moderation_latency
(null)
generation_time
14824
tokens_prompt
77
tokens_completion
791
native_tokens_prompt
98
native_tokens_completion
1120
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02849
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"11fe8861-1697-4d76-a102-02ff23cd3435"
is_byok
false
latency
895
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037561-gRBnuiFCAL6uylCLm0Hm"
session_id
(null)
api_type
"completions"
id
"gen-1777037561-fCAM9JQOuoIglX1wvRG1"
upstream_id
"11fe8861-1697-4d76-a102-02ff23cd3435"
total_cost
0.02849
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
0.00%
Dialogue to Total Word Ratio
Ratio: 67.48%, Deviation: 17.48%
neededClean
false
wordsTotal
489
wordsDialogue
330
21.6892%