NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G7FWB8ZDBH2X98AAR874M
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.5s
Cost:
$0.027665
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.49%
Matches word count
(details)
1.96%
Dialogue to Total Word Ratio
(details)
1.2242%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the kitchen table, anchoring the corners with half-empty beer cans. Bill leaned over it, squinting like a man reading a ransom note written in crayon. Which, technically, he was. "You got this from where, again?" "Timmy's older brother. Cost me forty bucks and a vape pen." "Molly. Molly, sweetheart. This is a drawing of a house. With a dragon in the backyard." "The dragon's artistic license. Focus on the party zone." Bill jabbed a finger at a lopsided rectangle. "And this? What's this blob?" "Bouncy castle." "And this blob?" "Also bouncy castle. There's two. Timmy's dad is going through a divorce and overcompensating." "Beautiful. A man after my own heart." Bill straightened up, cracking his back. "Okay, walk me through it." "Party starts at two. Presents get stacked on the dining room table by two-fifteen. Cake at three. Piñata at three-thirty. That's our window." "Why the piñata?" "Bill, think. Twenty kids, blindfolded, swinging a bat in a confined space. Every adult in that house is going to be watching that piñata like it owes them money. Nobody's looking at the presents." "Huh." Bill scratched his chin. "That's actually smart." "I have my moments." "What about the clown?" "What clown?" "There's always a clown, Molly. Birthday party, kid turning seven, you better believe there's a clown. Clowns are wild cards. Unpredictable. I knew a guy got bit by one in Reno." "There's no clown. I checked the Evite." "You saw the Evite?" "Timmy's brother forwarded it. I'm RSVP'd as Aunt Deborah." Bill stared at her. "You are something else." "I'm a professional." She tapped the blueprint. "You come in through the side gate dressed as the magician. I'm already inside being Aunt Deborah. When the piñata drops, you distract with a card trick, I load the presents into the minivan." "Why do I have to be the magician?" "Because you look like a guy who owns a cape." "That's fair." "Also, you actually do own a cape." "Separate issue." Bill pulled a notepad from his pocket. "Escape route?" "Down Maple, left on Sycamore, ditch the van behind the Home Depot, switch to the Civic. We're home eating the spoils by five." "The spoils being..." "Bill. It's a seven-year-old's birthday. We're looking at Legos, maybe a Nintendo Switch, gift cards, possibly cash in birthday envelopes from the grandparents. One kid at my nephew's party got a savings bond." "A savings bond." Bill's eyes went misty. "God bless boomer grandparents." "So we're in?" He rolled up the blueprint and tucked it under his arm, stepping back with the grave dignity of a man about to commit a federal crime against a minor. "Molly, I haven't robbed a child since the Santa job of '09. I'm a little rusty." "You'll be fine." "Should I bring the cape?" "Bill. Always bring the cape."
created_at
"2026-04-24T19:42:21.584Z"
4/24/2026, 7:42:21 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
836
moderation_latency
(null)
generation_time
16361
tokens_prompt
77
tokens_completion
768
native_tokens_prompt
98
native_tokens_completion
1087
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.027665
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"ddb53d72-f9d4-45d9-a46f-124c821b4728"
is_byok
false
latency
836
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059741-i3EWwGmGl7tdwLaKCnHH"
session_id
(null)
api_type
"completions"
id
"gen-1777059741-uoeIKVvIptAFEbbHKPL8"
upstream_id
"ddb53d72-f9d4-45d9-a46f-124c821b4728"
total_cost
0.027665
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.49%
Matches word count
n/a
neededClean
false
words
473
1.96%
Dialogue to Total Word Ratio
Ratio: 78.87%, Deviation: 8.87%
neededClean
false
wordsTotal
478
wordsDialogue
377
1.2242%