NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G6Y2HRAAMDEGPXCCC7S6X
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.4s
Cost:
$0.028465
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.02%
Dialogue to Total Word Ratio
(details)
10.1043%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth at Denny's had seen better plans, but none more ambitious. Molly spread the blueprints across the sticky table, weighing down the corners with a ketchup bottle, a saltshaker, and two half-eaten Moons Over My Hammy. "This," she said, tapping the paper, "is the Hendersons' backyard. Bounce house here. Cake table here. Present mountain—right here, under the magician's table." Bill squinted. He'd drawn the map himself on a placemat, and even he couldn't read it. A brown smear obscured most of the patio. "What's that blob?" "Gravy. Or the dog. I forget." "They have a dog?" "Bill." He raised his hands in surrender and slurped his coffee. Outside, rain tapped against the window in a rhythm that felt vaguely judgmental. They were about to rob a seven-year-old named Tyler, and somewhere, deep in Bill's chest, a very small conscience was clearing its throat. "Remind me again," he said, "why we're doing this." "Because rich kids get too much stuff. It's practically charity." "It's practically a felony." "Those aren't mutually exclusive." Molly produced a laminated party invitation—Paw Patrol themed, glitter still clinging to her thumb. She'd lifted it from the Hendersons' mailbox three weeks ago, back when this was just a daydream muttered over microwave burritos. Now it was Operation Birthday Bandit, and Bill was wearing a fake mustache in the parking lot as practice. "Two o'clock, the magician arrives," she said. "Two-fifteen, he does the rabbit bit. Two-twenty, every adult is filming their child's reaction. That's our window." "How long's the rabbit bit?" "Eleven minutes, according to his Yelp reviews." "You Yelped the magician." "I'm thorough, Bill." He chewed the end of his straw. "What about security?" "One nanny. Early twenties. Addicted to her phone. Also, a clown." "A clown is not security." "Have you ever tried to tackle a clown? The shoes alone." Bill conceded the point. He'd tackled a mime once in '09 and still didn't like to talk about it. Molly slid him a tote bag from under the table. It was pink and said WORLD'S OKAYEST AUNT. "Camouflage," she explained. "You'll blend in with the other relatives. Grab presents, stuff the bag, waddle out to the van." "Waddle?" "Casually. Like someone carrying a casserole." He practiced his casserole waddle in the booth. It was not convincing. The waitress, topping off his coffee, gave him the kind of look usually reserved for men on watchlists. "One more thing," Molly said, leaning in. Her voice dropped, conspiratorial, sacred. "No matter what happens—no matter how loud Tyler cries—we do not return the Lego set. Are we clear?" Bill nodded, solemn as a priest. "Clear." She folded the blueprint. Outside, the rain stopped, and a thin ribbon of sunlight slid across their table like a spotlight cueing the curtain. "To childhood," Molly said, raising her mug. "To childhood," Bill echoed, and meant it, mostly.
created_at
"2026-04-24T19:42:03.35Z"
4/24/2026, 7:42:03 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
705
moderation_latency
(null)
generation_time
15378
tokens_prompt
77
tokens_completion
786
native_tokens_prompt
98
native_tokens_completion
1119
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.028465
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"a99a3350-f743-4c11-b88e-124ec9ffe9fc"
is_byok
false
latency
703
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059723-giBeczJL7J6RT9NL4ebp"
session_id
(null)
api_type
"completions"
id
"gen-1777059723-recxIphm8BvJzxr2fXsJ"
upstream_id
"a99a3350-f743-4c11-b88e-124ec9ffe9fc"
total_cost
0.028465
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
480
0.02%
Dialogue to Total Word Ratio
Ratio: 43.09%, Deviation: 13.09%
neededClean
false
wordsTotal
485
wordsDialogue
209
10.1043%