NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZV382CKMG8G9E878TEKEZJ
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
15.6s
Cost:
$0.027365
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
40.6364%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the blueprint across the kitchen table, smoothing it flat with the reverence of a surgeon preparing to operate. Bill leaned in, squinting at the crayon-drawn floor plan. "This is the target?" Bill asked. "Four-year-old's birthday party. Theme is dinosaurs." "Molly, this blueprint is drawn in Crayola." "I stole it off the fridge last week when I was posing as the cable guy. It's accurate." Bill traced a shaky purple line with his finger. "What's this supposed to be?" "The bouncy castle." "And this?" "Also the bouncy castle. The kid drew it twice. He's got range." Bill sat back and rubbed his temples, the way a man does when his life choices arrive all at once. Molly was already laying out surveillance photos beside the blueprint, each one showing a pile of gift-wrapped boxes stacked like a cardboard Everest. "There's gotta be eighty presents in that pile," Bill said. "Ninety-three. I counted through the window." "Ninety-three presents for one kid?" "His dad's a dentist. Guilt gifts." Bill whistled low. "Alright. What's our play?" "You're going in as the clown." "Absolutely not." "Bill." "Molly, I have a documented, clinically diagnosed fear of my own reflection in greasepaint. We've been over this." "Then you're the magician." "Better." Molly slid a red nose across the table anyway, just in case. Bill slapped it away like it was a cockroach. "I'll handle the pony," she said. "There's a pony?" "There will be when I show up with one. Kids chase ponies. Ponies create diversions. Diversions create opportunity." "Where are you getting a pony on a Saturday?" "Don't worry about the pony, Bill. The pony is handled." "That's what you said about the ferret at the bar mitzvah." "The ferret was handled. The ferret was, in fact, overhandled." Bill pinched the bridge of his nose. "Okay. Clown—magician, sorry—distracts the parents with card tricks. Pony handles the children. Then what?" "Then I back the minivan up to the patio door, and we load the presents into giant garbage bags while singing happy birthday loud enough to cover the rustling." "We sing happy birthday." "In harmony. You take the low part." "I always take the low part." "Because you've got a voice like a foghorn with emotional problems. It's a gift, Bill. Use it." He stared at the ceiling, where a water stain shaped vaguely like regret was slowly spreading. "And if a parent catches us?" "We say we're from the Gift Inspection Service. Due to recent regulations, all presents must be removed for lead testing." "That's not a real thing." "Parents don't know that. Parents will believe anything that sounds like it might save their child's life or their property values." Bill exhaled, long and slow, and picked up the red clown nose. He turned it over in his hand with the solemnity of a man accepting a grenade. "Fine," he said. "But if there's a piñata, I'm taking it personally."
created_at
"2026-04-24T13:33:02.423Z"
4/24/2026, 1:33:02 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
862
moderation_latency
(null)
generation_time
15550
tokens_prompt
77
tokens_completion
795
native_tokens_prompt
98
native_tokens_completion
1075
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.027365
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"69b965b2-2ead-4569-a3e2-2b8f42414edf"
is_byok
false
latency
862
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037582-CGVwpGLljRmCdRSiQS8M"
session_id
(null)
api_type
"completions"
id
"gen-1777037582-lEuYRFaD2l6xgvN40iU4"
upstream_id
"69b965b2-2ead-4569-a3e2-2b8f42414edf"
total_cost
0.027365
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
488
0.00%
Dialogue to Total Word Ratio
Ratio: 66.40%, Deviation: 16.40%
neededClean
false
wordsTotal
494
wordsDialogue
328
40.6364%