NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZV9AMNBVK69VQ7DF2845YG
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.9s
Cost:
$0.027765
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
14.30%
Matches word count
(details)
36.50%
Dialogue to Total Word Ratio
(details)
25.3997%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unfolded the blueprints across the hood of the rusted Buick, pinning down the corners with a half-eaten churro and three loose batteries. The paper crinkled in the late afternoon breeze, threatening to take flight like a startled pigeon. Bill leaned in, squinting through glasses held together with a piece of chewed gum and desperate hope. "This is the Pemberton residence," Molly said, tapping the center of the page. "Seven-year-old's birthday party starts at two. Bouncy castle arrives at one-thirty." "What's the theme?" "Dinosaurs." Bill whistled low. "Dinosaurs means foil balloons. Foil balloons mean static electricity. We go in wearing polyester, we light up like a Christmas tree." Molly nodded gravely, as if he'd just quoted scripture. She'd been working with Bill for six years now, and she'd learned to take his party-related concerns seriously. The man had once been tackled by a clown at a bar mitzvah. He didn't talk about it, but the scar on his elbow did. "Cotton only," she agreed. "And no squeaky shoes. The gift table is here, by the piƱata. Mrs. Pemberton is loaded. We're talking name-brand LEGO sets. American Girl dolls. Possibly a Nintendo Switch." "A Switch," Bill breathed, his eyes going misty. He'd always wanted one, though he'd never admit it out loud, because a fifty-three-year-old man with a neck tattoo of a skeleton had certain standards to maintain. The plan was simple, which is to say it was ludicrous and would almost certainly result in arrest. Bill would infiltrate as Uncle Gary, a relative nobody remembered inviting but everyone was too polite to question. Molly would pose as the face-painting lady, a vocation she'd prepared for by watching a YouTube tutorial at 1.5x speed. "What if the real Uncle Gary shows up?" "There is no real Uncle Gary," Molly said. "I checked the family tree on three ancestry sites. There's a Great Aunt Gertrude, but she's dead, and also a nun, so probably not coming." "Probably?" "Ninety-two percent probably." Bill chewed his lip. He pulled a small notebook from his jacket and began writing, his handwriting the frantic scrawl of a man who had once tried to rob a bank with a water pistol and still felt he'd been unfairly judged for it. "Extraction route?" "Ice cream truck. I've got Rico driving. He'll circle the block playing 'The Entertainer' until we flash the signal." "Which is?" "Two thumbs up and a sneeze." "Why a sneeze?" "Adds authenticity." Bill accepted this with the resigned sigh of a man who had long ago stopped questioning Molly's creative flourishes. He rolled up the blueprints, tucking them under his arm like a baton of mischief. The sun was sinking behind the strip mall, gilding the dumpsters in gold. "Molly," he said, quietly, "we're about to steal from children." "Rich children." "Still children." She considered this for a long moment, then shrugged. "They'll build character."
created_at
"2026-04-24T13:36:21.667Z"
4/24/2026, 1:36:21 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1119
moderation_latency
(null)
generation_time
15811
tokens_prompt
77
tokens_completion
767
native_tokens_prompt
98
native_tokens_completion
1091
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.027765
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"6b483790-0938-4455-9d3e-e614c6af05ff"
is_byok
false
latency
1119
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037781-p6RrxBvy9M5kk75uA8bQ"
session_id
(null)
api_type
"completions"
id
"gen-1777037781-L0WMgGFDjCaFXJue9Zh3"
upstream_id
"6b483790-0938-4455-9d3e-e614c6af05ff"
total_cost
0.027765
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
14.30%
Matches word count
n/a
neededClean
false
words
479
36.50%
Dialogue to Total Word Ratio
Ratio: 34.49%, Deviation: 4.49%
neededClean
false
wordsTotal
490
wordsDialogue
169
25.3997%