NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZV3HEJ1SBKEM3W8KCYZCNS
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
14.2s
Cost:
$0.026115
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
98.21%
Dialogue to Total Word Ratio
(details)
50.9185%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the folding card table, weighing down the corners with a juice box, a half-eaten granola bar, and two decorative ceramic frogs she'd stolen from a dentist's office last Tuesday. The fourth corner kept curling up, so she placed her elbow on it and glared at Bill, who was eating cereal directly from the box. "This is the layout of the Pemberton residence," she said. "Twin boys. Turning seven. Combined guest list of approximately forty-two children." Bill crunched thoughtfully. "Forty-two kids is a lot of kids, Molly." "Forty-two kids is forty-two times the presents, Bill." He considered this, then nodded, because the math was compelling. The Pemberton twins, according to Molly's meticulous research, were being celebrated with a pirate-themed extravaganza featuring a bounce house, a balloon artist named Señor Flamingo, and a rented pony whose name was, inexplicably, Jeff. The cake alone cost more than Bill's car. The presents, stacked on a long buffet table near the rose garden, were rumored to include three Nintendo Switches, a remote-control helicopter, and something Molly's informant had described only as "the big one." "What's the big one?" Bill asked. "I don't know. That's what's exciting about it." "Could be a pony." "They already have a pony. His name is Jeff." "Could be another pony." Molly pinched the bridge of her nose. She had been working with Bill for six years, and roughly forty percent of that time had been spent explaining to Bill that not everything could be a pony. The plan, she explained, was elegant in its stupidity, which was the only kind of elegance Bill could reliably execute. They would arrive disguised as entertainers: Molly as a face painter, Bill as a clown. During the piñata sequence — which Molly had identified as the window of maximum chaos — they would slip around to the buffet table, load the presents into two enormous laundry bags disguised as pirate treasure sacks, and walk out the side gate to the waiting van, which Bill had painted to look like an ice cream truck. "The van plays music," Bill said, proudly. "The van plays one song, Bill, and it's the theme from Jaws." "Kids love Jaws." "Kids have never seen Jaws." Bill shrugged, unbothered, crunching another fistful of cereal. He had the unshakable confidence of a man who had never been correct about anything. Molly tapped the blueprint. "The key is Señor Flamingo. He's the only real professional in the building. If he makes us, we're done. He used to work for Cirque du Soleil." "What happened?" "Unclear. Something with a trapeze." Bill whistled, low and respectful. He respected anybody who had problems with a trapeze. "So we're clear on the plan?" "Face paint. Clown. Piñata. Presents. Jaws." "Perfect." Outside, somewhere in the distance, a child was already screaming about balloons, and Molly took it as a blessing.
created_at
"2026-04-24T13:33:12.031Z"
4/24/2026, 1:33:12 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
784
moderation_latency
(null)
generation_time
14091
tokens_prompt
77
tokens_completion
769
native_tokens_prompt
98
native_tokens_completion
1025
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.026115
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"84b040b7-c802-4a40-b054-96a016b81b75"
is_byok
false
latency
784
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037592-lt3ghkJQ9nZmifzRhOQA"
session_id
(null)
api_type
"completions"
id
"gen-1777037592-iEZpcMsLIc1jLchcr1xn"
upstream_id
"84b040b7-c802-4a40-b054-96a016b81b75"
total_cost
0.026115
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
476
98.21%
Dialogue to Total Word Ratio
Ratio: 29.40%, Deviation: 0.60%
neededClean
false
wordsTotal
483
wordsDialogue
142
50.9185%