NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTY7PGG8A3PQTCXVXH3EYS
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
18.0s
Cost:
$0.028190
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
10.0953%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the folding table, smoothing it with the reverence of a woman unveiling the Declaration of Independence. Bill leaned over, squinting through reading glasses held together with a paperclip and hope. "Alright," Molly said. "The target is 142 Maple Drive. Saturday, two p.m. sharp." "And the objective?" "Every single present at Timothy Hargrove's eighth birthday party." Bill whistled. "Eighth birthday. That's a big one. That's a bouncy castle birthday." "Confirmed bouncy castle. Also confirmed: piñata, face painter, and one of those guys who makes balloon animals." "A Twisty Gary?" "We don't know his name, Bill." "They're all named Gary." Molly pinched the bridge of her nose. "Focus. The Hargrove kid is connected. Dad's a dentist, mom's on the PTA. This party is going to be loaded. We're talking LEGO sets, Nintendo Switches, possibly a drone." "A drone?" Bill's eyes lit up. "For an eight-year-old?" "His grandfather's rich and guilty about something. It's in the intel." "What intel?" "Facebook." Bill nodded approvingly. "Solid intel. Okay, so what's the play?" "You go in as the clown." "Absolutely not." "Bill." "I told you, Molly, never again. Not after Akron." "Akron was six years ago." "The children remember." Molly sighed and pointed at the blueprint. "Fine. You go in as Twisty Gary." "Better. I can do balloons. I've got range." "While you're distracting the kids in the backyard, I slip in through the side gate, enter via the sliding glass door—which will be unlocked because nobody locks a door during a child's birthday party—and I locate the present table." "Where's the present table?" "Dining room. Always the dining room. It's the law." "And how do we get the presents out? There's gotta be, what, thirty kids? Thirty presents?" "I've got a collapsible hamper disguised as a gift. I walk in with it, I walk out with it full." Bill frowned. "That's it? That's the plan? You just walk out with a hamper?" "People don't question a woman with a hamper, Bill. It's the perfect cover. I could walk out of the Louvre with the Mona Lisa in a hamper." "What about the parents?" "Drinking rosé in the kitchen. They won't notice a fire, let alone us." "And the birthday boy?" "Timothy will be on the bouncy castle experiencing what doctors call a 'sugar event.' He won't know presents exist for another forty minutes." Bill scratched his chin. "I gotta say, Molly, this feels low. Even for us." "Bill. We stole a kidney from a hospital last month." "That kidney was going to a bad man." "It was going to a retired librarian." "Allegedly." Molly rolled up the blueprint. "Saturday. Two p.m. Bring your balloon pump and wear the rainbow suspenders." "Fine. But if one of those kids cries, I'm out." "Bill, every kid at a birthday party cries." "Then this is going to be a very short job."
created_at
"2026-04-24T13:30:18.207Z"
4/24/2026, 1:30:18 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
720
moderation_latency
(null)
generation_time
17948
tokens_prompt
77
tokens_completion
786
native_tokens_prompt
98
native_tokens_completion
1108
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02819
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"4867e52d-8e20-4267-b9d1-b763dd094a64"
is_byok
false
latency
720
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037418-sF7SX9aAkc2wG5HWHmZv"
session_id
(null)
api_type
"completions"
id
"gen-1777037418-Z2nUanCTHB7LLcQB1fOa"
upstream_id
"4867e52d-8e20-4267-b9d1-b763dd094a64"
total_cost
0.02819
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
480
0.00%
Dialogue to Total Word Ratio
Ratio: 85.27%, Deviation: 15.27%
neededClean
false
wordsTotal
482
wordsDialogue
411
10.0953%