NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G8KF42BGSA6SD2S5RS8NY
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
14.4s
Cost:
$0.026140
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9609%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of Denny's smelled like burnt coffee and regret, which Molly felt was appropriate for the caliber of crime they were about to commit. She spread the blueprints across the table, knocking over a syrup dispenser in the process. "Those aren't blueprints," Bill said. "That's a placemat with a maze on it." "It's the same maze, Bill. The suburbs are just one big maze designed to crush the human spirit." She tapped a syrup-stained corner. "This is the Hendersons' house. Saturday. Two o'clock. Tobias Henderson turns seven." Bill squinted. "We're robbing a seven-year-old." "We're robbing twenty seven-year-olds. There's a difference, and the difference is math." He leaned back, running a hand through hair that had seen better decades. The fluorescent light above them buzzed like it was trying to warn him. Molly had that look again, the one she'd had right before the goat incident, right before the wedding incident, right before the incident at the aquarium that they had both agreed never to discuss. "What's the play?" he finally said. "Bouncy castle." "What about it?" "I'm inside it." Bill waited. When nothing else came, he said, "That's the whole plan?" "That is the first act of the plan, Bill, and if you'd let me finish, I would tell you the second act, which involves a clown." "Absolutely not." "You're the clown." "Molly." "You already own the shoes." "I own them for personal reasons," Bill said, with the dignity of a man who had lost this argument many times before. He took a long pull of his coffee. "And the presents?" "Gift table is by the sliding door. While the kids are beating the piñata, you distract the parents with balloon animals—" "I only know the snake." "—with balloon snakes, and I come out of the bouncy castle with a garbage bag and clear the table. Ninety seconds, tops. We're in the minivan before anyone notices the gifts are gone, and by then Tobias is elbow-deep in a Spider-Man piñata and couldn't care less." "His parents will care." "His parents have a Peloton and a koi pond. They'll be fine." Bill stared at the placemat. A small voice in his head, which sounded suspiciously like his mother, told him this was the moral event horizon, the actual line in the sand, the bright shining red stop sign of his criminal career. A louder voice, which sounded like rent, told him to shut up. "What kind of presents does a seven-year-old even get?" he asked. Molly's eyes sparkled in a way that was either inspirational or diagnosable. "Nintendo Switches. Bill. Every single kid now shows up with a Nintendo Switch. It's an arms race out there. Twenty Switches, three hundred bucks a pop, we walk away with six grand in plastic." Bill closed his eyes. He could already feel the wig. "Fine," he said. "But I'm not doing the voice." "You're doing the voice." "I'm doing the voice," he agreed, and signaled for the check.
created_at
"2026-04-24T19:42:58.024Z"
4/24/2026, 7:42:58 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
669
moderation_latency
264
generation_time
14036
tokens_prompt
77
tokens_completion
770
native_tokens_prompt
98
native_tokens_completion
1026
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02614
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"a69e92f7-19f5-4f53-8f74-c62920352675"
is_byok
false
latency
669
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059778-F4tQZ76Yjm50AaVrVQF1"
session_id
(null)
api_type
"completions"
id
"gen-1777059778-hBo5tXfTvQklkNCfivYV"
upstream_id
"a69e92f7-19f5-4f53-8f74-c62920352675"
total_cost
0.02614
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
497
0.00%
Dialogue to Total Word Ratio
Ratio: 44.47%, Deviation: 14.47%
neededClean
false
wordsTotal
506
wordsDialogue
225
49.9609%