NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTZESQ8T53WZDBE07WF9J4
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
16.6s
Cost:
$0.029490
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
17.5012%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky diner table, pushing aside a plate of congealed hash browns. Bill leaned in, squinting, a pencil tucked behind his ear like he was a real architect and not a man who had been fired from three different Chuck E. Cheeses. "Okay," Molly said, tapping the paper. "The target is here. The Hendersons' backyard. Timmy's seventh birthday." "Why are we robbing a seven-year-old?" "Because seven-year-olds get the good stuff, Bill. LEGOs. Nintendo Switches. That weird little robot dog that barks and files your taxes." "It files taxes?" "Focus." Bill scratched his chin. "What's the entry point?" "The bouncy castle." "The bouncy castle is an entry point?" "It is if you commit, Bill." He stared at her. "Molly, I weigh two hundred and forty pounds. I am not going to stealth-bounce." "You'll be in disguise." "As what?" "A clown." Bill put his head in his hands. "No." "Yes." "Absolutely not." "You already bought the shoes." "Those were for my cousin's wedding!" Molly slid a glossy photograph across the table. It showed a massive pile of wrapped presents, gleaming beneath a piñata shaped like a unicorn. Bill's eye twitched. "That," Molly whispered, "is six hundred dollars of loot, minimum. Maybe eight with the gift cards." "Gift cards can be traced." "Not Build-A-Bear gift cards, Bill. Build-A-Bear operates outside the law." Bill exhaled slowly. "Fine. Walk me through it." "Okay. 2:15 PM, the magician arrives. That's me." "You're not a magician." "I know three card tricks and I own a cape." "That is not—" "2:30, I announce the Disappearing Present Trick. I gather every gift into a large trunk. Classic misdirection." "And then?" "And then I say 'abracadabra,' and the presents vanish." "Where do they go?" "Into the back of your clown van." "I don't have a clown van." "You have a Previa." "That's my mother's!" "She said you could borrow it." Bill slammed his palm on the table. A waitress glanced over, then decided she didn't get paid enough. "Molly. I am begging you. Can we, for one job, have a plan that doesn't involve me disappointing my mother?" "Bill." Molly took his hand, her eyes shining with a criminal's sincerity. "This is the heist of the year. Kindergarten Class. Little Timmy turns seven. The whole neighborhood is there. Parents distracted by seven-layer dip. We waltz in, we waltz out, and by 4:00 PM we are sipping margaritas bought with the liquidated assets of a child's birthday." Bill was quiet for a long moment. "What flavor margaritas?" "Mango." "Goddammit." "So you're in?" "I'm in." He picked up the clown nose sitting beside the ketchup bottle and stared at it like it was Yorick's skull. "But if that kid cries, Molly, I swear to God—" "He won't cry." "He better not." "He'll be too busy watching the magic." Bill squeezed the nose. It honked, mournfully, like a funeral trumpet.
created_at
"2026-04-24T13:30:58.244Z"
4/24/2026, 1:30:58 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
977
moderation_latency
(null)
generation_time
16447
tokens_prompt
77
tokens_completion
803
native_tokens_prompt
98
native_tokens_completion
1160
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02949
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"8d660b28-3612-4ed7-ada9-cfed63c1b647"
is_byok
false
latency
977
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037458-vDMgUOBYdG5FOHvXQjlQ"
session_id
(null)
api_type
"completions"
id
"gen-1777037458-WZP2Lk56EDadCJgDUlXk"
upstream_id
"8d660b28-3612-4ed7-ada9-cfed63c1b647"
total_cost
0.02949
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
482
0.00%
Dialogue to Total Word Ratio
Ratio: 66.26%, Deviation: 16.26%
neededClean
false
wordsTotal
492
wordsDialogue
326
17.5012%