NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTYY45M1YSCWQSGCFJCKF7
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
18.8s
Cost:
$0.028515
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
25.9628%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky diner table, pinning down the corners with a ketchup bottle, a salt shaker, and two half-eaten pancakes. Bill leaned in, squinting through his reading glasses, which were held together with a paperclip and what appeared to be chewed gum. "Okay," Molly said. "The Hendersons' backyard. Saturday. Fourteen hundred hours." "That's two o'clock, Molly. Just say two o'clock." "It sounds more professional the other way." "We're stealing from a seven-year-old." "Eight," Molly corrected. "Little Timmy turns eight on Saturday, which is precisely why the loot will be at peak volume. Bikes. Tablets. Possibly a Nintendo." Bill scratched his beard. "I still feel weird about this." "Last week you stole a kidney from a hospital." "That was different." "How?" "The kidney didn't have feelings." Molly sighed and tapped the blueprint. "Focus. Bouncy castle here. Clown station here. Cake table here. The presents get piled on this folding table by the oak tree, guarded only by Grandma Henderson, who, according to my research, peaks at three mimosas before noon." "How did you research that?" "I befriended her at Zumba." "You go to Zumba?" "I go where the intel is, Bill." Bill whistled, impressed despite himself. "And the plan?" "Simple. You dress as the clown." "Absolutely not." "Bill." "I have a phobia, Molly. You know this." "You have a phobia of clowns, and your solution is to refuse to be one?" "That's exactly how phobias work!" Molly rubbed her temples. "Fine. You're the magician. Pull a rabbit out of a hat, distract the children, while I swoop in with the duffel bag and extract the presents through the hedge." "What about the parents?" "The parents," Molly said, smiling slowly, "will be occupied by the pony." "What pony?" "The pony I rented." "You rented a pony?" "I rented a pony, Bill. Keep up." Bill took off his glasses, which immediately fell apart. "And when the kids realize all their presents are gone?" "By then we'll be halfway to the fence. Tommy from Fencing and Fenceworks is giving us forty cents on the dollar for anything still in the box." "His name is literally Tommy Fence?" "It's a family business." Bill stared at the blueprint for a long moment. A single tear formed in the corner of his eye, though it might have been from the pancake syrup fumes. "My mother would be so disappointed." "Your mother robbed an armored car in 1987." "Yes, but she had standards. She wouldn't take candy from a baby." "We're not taking candy," Molly said gently, squeezing his hand. "We're taking a PlayStation, a Razor scooter, and, if the gift registry is accurate, a small trampoline." Bill nodded slowly, convinced. "Okay. I'm in." "Good." Molly rolled up the blueprints. "One more thing." "Yeah?" "You're still the clown. The magician quit this morning. Allergic to doves." Bill's scream echoed through the diner, startling a nearby toddler into tears.
created_at
"2026-04-24T13:30:41.172Z"
4/24/2026, 1:30:41 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3767
moderation_latency
(null)
generation_time
18624
tokens_prompt
77
tokens_completion
803
native_tokens_prompt
98
native_tokens_completion
1121
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.028515
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"771ad915-1079-4651-9d93-382f78444263"
is_byok
false
latency
3767
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037441-pQL5Sz4KCdFiAeop1Dd4"
session_id
(null)
api_type
"completions"
id
"gen-1777037441-JD9l4wb9kIiFvvSzd7nJ"
upstream_id
"771ad915-1079-4651-9d93-382f78444263"
total_cost
0.028515
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
484
0.00%
Dialogue to Total Word Ratio
Ratio: 71.46%, Deviation: 21.46%
neededClean
false
wordsTotal
487
wordsDialogue
348
25.9628%