NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTXW9KYR949J0SM7M63KRB
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
15.5s
Cost:
$0.027890
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
10.0949%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky diner table, smoothing them with the reverence of a general before battle. Bill slurped his milkshake. "Stop that," she said. "Stop what?" "That sound. It's distracting." "It's a milkshake, Molly. It's supposed to make sounds." She tapped the paper. "Focus. The Pemberton residence. Seven-year-old's birthday party. Theme is, and I quote, 'Unicorn Pirates of the Caribbean.'" "That's not a theme. That's a cry for help." "The point is, there will be approximately thirty children, fifteen parents, one magician, and a bouncy castle shaped like a dragon." Bill leaned forward. "How much loot are we talking?" "Conservative estimate? Two grand in presents. The Pembertons invited the entire second grade. These are hedge fund children, Bill. Their stocking stuffers cost more than your car." "My car is a classic." "Your car is a 1997 Dodge Neon with a bumper held on by prayer." "It has character." "It has tetanus." She jabbed the blueprint. "Entry point is the sunroom. The presents will be stacked here, on the dining room table, until cake. That's our window." Bill squinted. "And how exactly do two grown adults slip past thirty children? Kids are like motion sensors with juice boxes. They see everything." "Which is why you're going in as the clown." "Absolutely not." "Bill." "Molly, I told you. After last Christmas. After the mall. After what happened with that elf costume—" "That was a misunderstanding." "A child bit me, Molly. On the face." "And you bit him back, which is why we can't go to that mall anymore." Bill sank into the vinyl booth. "I'm not wearing the nose." "You'll wear the nose, and the wig, and the oversized shoes, and you'll juggle, and you'll like it." "I can't juggle." "Then learn. You have forty-eight hours." "What are you going to be doing while I'm getting assaulted by second-graders?" "I'll be the caterer. Black polo, clipboard, resting professional face. Nobody questions a woman with a clipboard. I'll slip into the dining room, load the presents into catering trays, and walk out the front door like I'm delivering crudités." Bill stirred his milkshake gloomily. "This is undignified." "This is a payday." "What if someone wants a balloon animal?" "Make one." "I can only make the snake." "A snake is an animal, Bill." "A snake is a straight balloon, Molly. It's a balloon I didn't twist. It's a failure balloon." She reached across the table and squeezed his hand. "Listen to me. We get in, we smile, we grab the presents, we're gone before they cut the cake. In and out. Clean. Easy. Nobody gets hurt, nobody gets bit." "You can't promise that last part." "I can promise I'll try." Bill sighed, long and defeated. "Fine. But I'm keeping whatever's in the biggest box." "Deal." "And Molly?" "Yeah?" "If a kid bites me, I'm biting back." "I would expect nothing less."
created_at
"2026-04-24T13:30:06.528Z"
4/24/2026, 1:30:06 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1365
moderation_latency
(null)
generation_time
15365
tokens_prompt
77
tokens_completion
792
native_tokens_prompt
98
native_tokens_completion
1096
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02789
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"47249e86-db7e-4356-ae11-12896423136a"
is_byok
false
latency
1365
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037406-j4yb0x4VKHEBMreEPZ6u"
session_id
(null)
api_type
"completions"
id
"gen-1777037406-krkX4gbpmZfWD7xRQL4O"
upstream_id
"47249e86-db7e-4356-ae11-12896423136a"
total_cost
0.02789
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
480
0.00%
Dialogue to Total Word Ratio
Ratio: 86.98%, Deviation: 16.98%
neededClean
false
wordsTotal
484
wordsDialogue
421
10.0949%