NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTX2DKCXXKCS15FG3BC24Y
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
15.7s
Cost:
$0.029015
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
0.48%
Dialogue to Total Word Ratio
(details)
3.2859%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the card table and tapped it with a chipped fingernail. The paper was a hand-drawn map of the Hendersons' backyard, complete with a stick figure labeled "CLOWN (ARMED?)." "Okay, Bill. Walk me through it one more time." Bill cracked his knuckles with the solemnity of a surgeon. "At fourteen hundred hours, Timmy Henderson turns seven." "Tragic." "At fourteen-thirty, presents get stacked on the gift table near the bouncy castle." "And that's when we strike." "That's when we strike." Molly leaned back and folded her arms. "Bill. We're stealing from a child." "We're stealing from a child's *parents*, technically. Those gifts haven't been legally transferred yet. It's a gray area." "Oh, it's a gray area now?" "Very gray. Charcoal, even." She sighed. "Fine. What's our cover?" "I'm the magician. You're my lovely assistant." "Absolutely not." "Molly—" "I'm not wearing sequins, Bill. We agreed. After the cruise ship job, no more sequins." "Fine. You're the magician. I'll be the lovely assistant." "You're six-foot-four and you have a neck tattoo of a scorpion." "Kids love scorpions." "Kids are *terrified* of scorpions, Bill, that's the whole point of a scorpion." He rubbed his jaw. "Okay, new plan. Balloon animals. I do balloons, you case the gift table, and when the piñata goes up—" "The piñata." "The piñata is our diversion. Every kid at that party will be swinging a bat with a blindfold on. Total chaos. We grab the presents, we go out through the side gate, we're on the 405 before they find the Tootsie Rolls." Molly stared at him. "You want to commit grand larceny during a piñata." "I want to commit grand larceny during the *perfect* piñata. There's a difference." "What about the clown?" "What about him?" "You wrote 'armed' with a question mark, Bill. I need you to resolve that question mark before we go in." Bill scratched his head. "Look, I called the party rental place. Pretending to be a concerned grandmother. The clown's name is Sprinkles and he's a retired accountant." "An accountant." "From Glendale." "So he's not armed." "He's armed with *balloons*, Molly. And, like, emotional damage, probably. Accountants, you know." She folded the blueprint in half, then in half again. "One more question." "Shoot." "What do we do with the presents? They're gonna be, what, a light-up yo-yo? A stuffed giraffe? We can't fence a stuffed giraffe." Bill grinned, the slow grin he got before every disaster. "Molly. Have you seen what kids get these days? iPads. Drones. One of those little Peloton things for toddlers." "Toddlers have Pelotons?" "Rich toddlers do. And Timmy Henderson's dad is a dentist." She was quiet for a long moment. Then she picked up the blueprint, tucked it into her jacket, and stood. "Bill." "Yeah?" "If I have to punch a clown named Sprinkles, I'm taking it out of your cut." "Deal."
created_at
"2026-04-24T13:29:40.033Z"
4/24/2026, 1:29:40 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
676
moderation_latency
(null)
generation_time
15636
tokens_prompt
77
tokens_completion
797
native_tokens_prompt
98
native_tokens_completion
1141
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.029015
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"40501280-985a-4a9d-a5d3-3d8fc78300ca"
is_byok
false
latency
676
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037380-BSMH8sNUktmNOus3GTFY"
session_id
(null)
api_type
"completions"
id
"gen-1777037380-plHudW4WJSqDf6ItKi4v"
upstream_id
"40501280-985a-4a9d-a5d3-3d8fc78300ca"
total_cost
0.029015
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
477
0.48%
Dialogue to Total Word Ratio
Ratio: 80.33%, Deviation: 10.33%
neededClean
false
wordsTotal
483
wordsDialogue
388
3.2859%