NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTYBMDMVZ3H2ZH4S1PTRZH
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
17.1s
Cost:
$0.031415
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
9.20%
Dialogue to Total Word Ratio
(details)
7.6476%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the kitchen table, pinning down the corners with a half-eaten Pop-Tart and three empty coffee mugs. "Behold," she said. "The Hendersons' backyard. Ground zero." Bill squinted. "That's a drawing of a bouncy castle." "It's a *tactical* drawing of a bouncy castle." "You drew a smiley face on it." "That's the entry point, Bill. Focus." She tapped the paper. "The party starts at two. Cake at two-thirty. Piñata at three. That's our window." "Why the piñata?" "Because," Molly said, leaning back triumphantly, "when twenty hopped-up eight-year-olds are swinging a bat at a papier-mâché donkey, no one is watching the present table." Bill chewed his lip. "I don't love this, Moll." "What's not to love?" "We're stealing from children." "We're stealing from *rich* children. There's a difference." "Is there?" "Morally? Enormously." Bill picked up the Pop-Tart, examined it, put it down. "What if a kid sees us?" "I've thought of that." Molly produced two rubber masks from a duffel bag and slapped them on the table. "Clowns." "Absolutely not." "Bill—" "I said when we started this partnership, no clowns. That was my one rule." "Clowns are disarming." "Clowns are *terrifying*. That's literally their whole deal." "Fine." She huffed and tossed him a different mask. "Happy?" Bill held it up. "This is Shrek." "Kids love Shrek." "Kids loved Shrek in 2003." "Shrek is timeless!" "Molly, I'm not wearing a Shrek mask to commit larceny." "Then wear the clown." "*Fine.* Shrek it is." She grinned and rolled up the blueprints. "Okay. Getaway vehicle?" "Ice cream truck." "Too slow." "It's literally designed to drive through neighborhoods full of children without raising suspicion. It's perfect." Molly paused. "That's... actually a decent point." "I have decent points sometimes." "You have *occasional* points. Don't oversell it." She grabbed a notepad. "Alright. I grab the big stuff—bikes, scooters, that mountain of American Girl dolls Aunt Linda keeps posting about on Facebook." "You've been researching Aunt Linda?" "I've been researching the *target*, Bill. Aunt Linda is a goldmine of intel. Did you know the birthday girl is getting a PlayStation 5?" "How do you—" "Aunt Linda cannot keep a secret to save her life." "God bless Aunt Linda." "God bless Aunt Linda." Molly raised a coffee mug. Bill clinked his against hers. "One question," he said. "Shoot." "What if they have a dog?" "They have a Pomeranian named Biscuit." "Oh no." "What?" "I can't rob a house with a Biscuit in it, Molly. I have a code." "You'll wear Shrek but you won't face a Pomeranian?" "Biscuits are innocent. Shrek is a choice." Molly rubbed her temples. "I'll bring turkey slices. Biscuit gets a feast. Everyone wins." "Everyone except the birthday girl." "She's turning nine, Bill. Builds character." "You are a terrible person." "I'm a *thorough* person." She rolled up the blueprints. "Suit up, Shrek. We've got a party to crash."
created_at
"2026-04-24T13:30:22.236Z"
4/24/2026, 1:30:22 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
714
moderation_latency
(null)
generation_time
17031
tokens_prompt
77
tokens_completion
839
native_tokens_prompt
98
native_tokens_completion
1237
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.031415
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"bc4589b4-2c88-43d2-9480-5324e0853669"
is_byok
false
latency
714
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037422-L49dGTL9njqPh88tdjAH"
session_id
(null)
api_type
"completions"
id
"gen-1777037422-zhW1UGjgYIGW6oZQ6J4S"
upstream_id
"bc4589b4-2c88-43d2-9480-5324e0853669"
total_cost
0.031415
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
477
9.20%
Dialogue to Total Word Ratio
Ratio: 76.91%, Deviation: 6.91%
neededClean
false
wordsTotal
485
wordsDialogue
373
7.6476%