NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G69ZT7ES79ZZQ9V3EQPZD
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
15.0s
Cost:
$0.027240
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.02%
Dialogue to Total Word Ratio
(details)
45.2512%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the sticky diner table, weighing down the corners with ketchup bottles and a half-eaten plate of hash browns. Bill squinted at it, then at her, then back at it. "This is a Denny's menu," he said. "It's the back of a Denny's menu. The blueprint's on the front. Flip it over, Bill." He flipped it over. The drawing was rendered in purple crayon, complete with a smiling sun in the upper corner. "Your nephew drew this?" "My nephew went to the party last year. This is reconnaissance." Bill rubbed his temples. "Molly. I've knocked over three armored cars. I've cracked a vault in Zurich. And now we're stealing a Paw Patrol backpack from a seven-year-old?" "Forty-seven presents," Molly corrected, stabbing the menu with her fork. "Rich kid. Hedge fund dad. Last year the birthday boy got a drone, an iPad, and a pony named Reginald." "They gave a kid a pony." "They gave a kid a pony, Bill." He leaned back, considering. A waitress refilled his coffee without making eye contact, which was ideal, operationally speaking. "Alright," he sighed. "Walk me through it." "Party starts at two. Bounce house goes up at one-thirty. That's our window." She tapped a crayon square labeled CASTEL. "I go in as the face painter. You're the magician." "I'm not a magician." "You pulled a quarter out of my ear on our third date." "That was a trick my grandfather taught me to flirt with waitresses." "And it worked, didn't it? You're a magician, Bill. Embrace it." He pinched the bridge of his nose. "Fine. I do balloon animals until cake time, then what?" "Cake time is chaos. Sugar-fueled stampede. Parents filming. Nobody watching the gift table." She drew a swooping arrow with an imaginary pen. "I distract the mom — she'll want a butterfly on her cheek, trust me, she's that kind of mom — and you roll the gift table out the patio doors to the van." "The whole table." "The whole table." "Molly, that's a table. Of presents. At a party." "Which is why nobody will expect it. Who steals a table, Bill? A lunatic. We are lunatics. That is our edge." Bill stared at the crayon sun, which was, he now realized, winking at him. "What about the clown?" "There's no clown." "There's always a clown." "Hedge fund dad hates clowns. Ever since the incident in Aspen." "What incident in Aspen?" "Not our business. Focus." She slid a photograph across the table. A pile of glittering gift bags taller than a Labrador. "Tell me that doesn't sing to you." Bill looked at the picture. He looked at Molly. He looked at his coffee, which had gone cold in a way that felt symbolic. "One condition," he said. "Name it." "If there's a pony this year, we take the pony." Molly grinned and raised her mug. "Obviously we take the pony." They clinked, and the smiling sun winked again.
created_at
"2026-04-24T19:41:42.782Z"
4/24/2026, 7:41:42 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
732
moderation_latency
192
generation_time
14748
tokens_prompt
77
tokens_completion
760
native_tokens_prompt
98
native_tokens_completion
1070
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02724
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"ca1cfe25-f17b-487a-935c-b54701f42922"
is_byok
false
latency
732
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059702-rt2lE1zQ47XEmlj6hEQV"
session_id
(null)
api_type
"completions"
id
"gen-1777059702-cJk1MWuUQCHkovbrXN7W"
upstream_id
"ca1cfe25-f17b-487a-935c-b54701f42922"
total_cost
0.02724
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
0.02%
Dialogue to Total Word Ratio
Ratio: 63.10%, Deviation: 13.10%
neededClean
false
wordsTotal
496
wordsDialogue
313
45.2512%