NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZV694Y14VTCGB79SYS60WZ
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.1s
Cost:
$0.028815
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
46.8248%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky diner table, pinning the corners with a ketchup bottle, a saltshaker, and two half-eaten pancakes. Bill leaned in, squinting through his reading glasses, which were held together with a paperclip and considerable optimism. "Okay," Molly said, tapping the paper. "The target is a bounce house. Specifically, the bounce house at 14 Maplewood Drive, this Saturday, at precisely two o'clock in the afternoon." Bill blinked. "You realize this is a seven-year-old's birthday party." "I realize that." "And you realize we're grown adults." "Bill. Focus." He sighed and peered at the diagram, which Molly had meticulously labeled in colored pencil. There was a gift table marked in red, a piƱata marked in yellow, and a suspicious cluster of X's near the snack zone. "What are the X's?" "Clowns." "Clowns?" "Two of them. Professionally hired. We have to assume they're armed." "With what, Molly? Balloon swords?" "I've seen what a balloon animal can do in the wrong hands." Bill rubbed his temples. He had robbed a jewelry store in Zurich. He had once out-maneuvered Interpol in a gondola. And now he was being briefed on clown neutralization by a woman eating a pancake with her fingers. "Fine," he said. "Walk me through it." Molly grinned and produced a second sheet. "Two o'clock, the parents gather the kids for musical chairs. That's our window. The gift table will be unattended for approximately four minutes. You enter through the side gate dressed as Uncle Greg." "Who's Uncle Greg?" "Doesn't matter. Every family has an Uncle Greg. Nobody ever questions Uncle Greg. He just shows up, eats shrimp, and leaves." "I don't look like an Uncle Greg." "You absolutely look like an Uncle Greg." Bill decided not to fight this one. "Meanwhile," Molly continued, "I'll be on the roof of the garage with the duffel bag. You toss the presents up to me, one by one, and I load the getaway wagon." "The getaway wagon." "It's a Radio Flyer. I painted it matte black." "Of course you did." "Stealth, Bill. Stealth is everything." He stared at her. "What's the score here, Molly? What are we actually looking at? A Nintendo? Some Legos? A gift card to Build-A-Bear?" Molly leaned forward, her eyes gleaming with the specific intensity of a woman who had made several poor life choices and intended to make one more. "The birthday boy's grandfather," she whispered, "is Reginald Worthington the Third." Bill straightened. "The Worthington?" "The very one. And word on the street is that grandpa's gift is wrapped in silver paper with a blue bow. Inside? A vintage Mickey Mantle rookie card. Mint condition. Six figures, easy." Bill was quiet for a long moment. Then he reached for his coffee, took a slow, thoughtful sip, and set the mug down with the gentle finality of a man abandoning his last shred of dignity. "Alright," he said. "I'll be Uncle Greg." Molly beamed. "I'll bring the wagon."
created_at
"2026-04-24T13:34:41.836Z"
4/24/2026, 1:34:41 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
674
moderation_latency
(null)
generation_time
16026
tokens_prompt
77
tokens_completion
790
native_tokens_prompt
98
native_tokens_completion
1133
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.028815
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"9aa758ff-bf4a-4ad0-965c-05160449f6a1"
is_byok
false
latency
670
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037681-CAzTlah7ttzoOVSOBv0b"
session_id
(null)
api_type
"completions"
id
"gen-1777037681-jtW5I6w2CC0F23z4KQD4"
upstream_id
"9aa758ff-bf4a-4ad0-965c-05160449f6a1"
total_cost
0.028815
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
491
0.00%
Dialogue to Total Word Ratio
Ratio: 56.45%, Deviation: 26.45%
neededClean
false
wordsTotal
496
wordsDialogue
280
46.8248%