NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTSTK0TX6GTVKJR0CAVCRH
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
14.3s
Cost:
$0.026240
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
0.33%
Dialogue to Total Word Ratio
(details)
34.2152%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the sticky diner table, narrowly avoiding a puddle of maple syrup that had been there, by Bill's estimation, since the Carter administration. The blueprint was, in fact, a hand-drawn map on the back of a Chuck E. Cheese placemat, but Molly had insisted on calling it a blueprint ever since she'd watched Ocean's Eleven for the fourth time last Tuesday. "The target," she announced, tapping the map with a crayon, "is the Henderson residence. Saturday, fourteen hundred hours. Little Tommy Henderson turns seven." Bill squinted at the drawing. "Why is the house shaped like a potato?" "It's not a potato, it's the house." "It's a potato, Molly." "It's reconnaissance art, Bill. I was sketching from the bushes and a sprinkler went off." She crossed her arms. "Do you want in on this or not?" Bill stirred his coffee with the wrong end of a pen. He had been Molly's partner for six years, and in that time they had successfully stolen exactly one thing: a garden gnome, which Molly had returned a week later because it kept staring at her while she slept. The rest of their capers had ended variously in sprained ankles, misdemeanor charges, and one memorable incident involving a petting zoo that neither of them spoke about anymore. "Presents, Bill," Molly continued, eyes glittering. "Think about it. Upper middle class neighborhood. Grandparents flying in from out of state. We're talking PlayStations. Tablets. Those little hoverboard things that catch fire." "Those catch fire, Molly." "Then we fence them quick." He sighed and picked up the crayon. "What's the plan?" "Simple. We show up in disguise. Blend in. Identify the gift table. Extract the loot during cake time when every adult in that house will be staring at a seven-year-old trying to blow out candles with the lung capacity of a damp kazoo." "Disguise how?" Molly produced, from a grocery bag at her feet, two enormous fuzzy costumes. One was a purple dinosaur of questionable copyright status. The other appeared to be a dog, or possibly a bear, or possibly a manifestation of something that had gone wrong in a factory overseas. "I'm not wearing that," Bill said. "You are the dog." "That is not a dog." "Children's entertainment, Bill. The mother booked a character performer. I called and canceled. We are the replacements." "You called and canceled a child's birthday entertainment." "For the greater good." Bill held up the brown costume. One of its eyes was noticeably higher than the other, giving it the expression of something that had recently been electrocuted. He thought about his life choices. He thought about his mother, who had wanted him to be a dentist. He thought about how much a PlayStation went for these days. "Fine," he said. "But I get first pick of the loot bags." Molly grinned and raised her coffee cup. "To crime." "To crime," Bill muttered, and clinked.
created_at
"2026-04-24T13:27:53.708Z"
4/24/2026, 1:27:53 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1397
moderation_latency
(null)
generation_time
14261
tokens_prompt
77
tokens_completion
762
native_tokens_prompt
98
native_tokens_completion
1030
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02624
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"6a33ff63-7c01-4820-854d-eb24885338e7"
is_byok
false
latency
1396
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037273-d8Q5B0JRryBp8KE0YTNU"
session_id
(null)
api_type
"completions"
id
"gen-1777037273-N3LKG4JMMv04fSAGRJek"
upstream_id
"6a33ff63-7c01-4820-854d-eb24885338e7"
total_cost
0.02624
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
486
0.33%
Dialogue to Total Word Ratio
Ratio: 40.70%, Deviation: 10.70%
neededClean
false
wordsTotal
489
wordsDialogue
199
34.2152%