NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0GG8DY155X67NAN6PBKBB7
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
16.3s
Cost:
$0.029240
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3571%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of Denny's had seen better plans. It had also seen better pie, but Bill was working on that too, shoveling in cherry filling while Molly unrolled a blueprint across the syrup-sticky table. "This," she said, tapping the paper, "is the Kepplinger residence. Fortified. Impenetrable. A stronghold." Bill squinted. "That's a bouncy castle." "It's a *themed* stronghold." "Molly, we're robbing an eight-year-old." "We're robbing an eight-year-old's *guests*, Bill. There's a difference. It's like the difference between a bank and the people standing in line at the bank." "That is not a difference." She ignored him and circled a spot on the blueprint with a red crayon borrowed from the kids' menu. "Gift table is here. Presents arrive between one and one-thirty. Cake at two. Piñata at two-fifteen. That's our window." "The piñata?" "Chaos, Bill. Candy on the floor. Children screaming. Parents pretending they didn't just take a second mimosa. Nobody's watching the loot pile." Bill set down his fork. "And how, exactly, do we get in? The invitation list is, and I quote, 'Mrs. Henderson's third grade class and immediate family.'" "Clowns." "No." "Bill." "Molly, I said no clowns, we agreed, it was in the contract—" "There was no contract." "There was a verbal contract, and it specifically, emphatically, under-no-circumstances-ly excluded clowns." Molly pulled two red foam noses from her purse and set them on the table like poker chips. Bill stared at them the way a man stares at his own obituary. "I'll be Sprinkles," she said. "You can be Mr. Jingles." "I will not be Mr. Jingles." "Fine. Dr. Jingles. You've got a PhD now. Happy?" "What's my dissertation in?" "Balloon animals. Focus, Bill." He picked up a nose between two fingers. "And we just waltz in? No one asks questions?" "Sweetheart, nobody questions clowns. That's the beauty of it. Clowns exist in a liminal space between invited and unwelcome. We are neither guests nor intruders. We are *entertainment.*" "That's disturbingly profound." "I minored in theater." Bill exhaled, long and defeated, the sigh of a man who had once dreamed of robbing armored cars and was now agreeing to juggle for first graders. "And the getaway?" "Ice cream truck." "We don't own an ice cream truck." "We will by Saturday." "Molly." "Bill." "Every time we do this, every *single* time, I end up in a costume, crying in a parking lot, holding something I never asked to hold." She smiled and slid the foam nose across the table until it bumped his hand. "And every time, we walk away rich." "We walked away with forty-two dollars and a fondue pot last time." "A *copper* fondue pot." "It was plastic, Molly." "It was copper-*colored*, and I will not have you slander our portfolio." She stood, tucked the blueprint under her arm, and dropped a ten on the table. "Saturday. One o'clock. Bring your squeaky shoes." Bill looked at the nose. The nose looked back. "Honk," he whispered, and followed her out.
created_at
"2026-04-24T19:47:08.868Z"
4/24/2026, 7:47:08 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
665
moderation_latency
(null)
generation_time
16244
tokens_prompt
77
tokens_completion
815
native_tokens_prompt
98
native_tokens_completion
1150
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02924
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"891e9949-790c-4def-a8c6-4d0c0d7345ab"
is_byok
false
latency
665
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777060028-D8Pi7nhWpGkHWR3j8xIJ"
session_id
(null)
api_type
"completions"
id
"gen-1777060028-WuXWJup38CdsKnnuZiBs"
upstream_id
"891e9949-790c-4def-a8c6-4d0c0d7345ab"
total_cost
0.02924
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
494
0.00%
Dialogue to Total Word Ratio
Ratio: 64.75%, Deviation: 14.75%
neededClean
false
wordsTotal
505
wordsDialogue
327
49.3571%