NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZV5BTBNP9PE4PK0KYT0TVE
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.2s
Cost:
$0.026640
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.53%
Dialogue to Total Word Ratio
(details)
21.9524%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky diner table, narrowly avoiding a puddle of syrup that had been there long enough to develop its own ecosystem. Bill leaned in, squinting at the hand-drawn map like it was the Dead Sea Scrolls, if the Dead Sea Scrolls had been illustrated by a seven-year-old with a grudge against architecture. "This is the Henderson house," Molly said, tapping a lopsided rectangle. "Saturday at two. Theme is unicorns." "Unicorns." Bill rubbed his stubble. "So we're stealing from a six-year-old." "She's turning seven, Bill. Have some respect." Bill had been in the business for twenty-three years. He'd robbed casinos, art galleries, and once, memorably, a yacht belonging to a Latvian pop star. He had not, until this moment, considered the ethical implications of taking an American Girl doll from a child named Madison. "The presents pile up on the dining room table," Molly continued. "Average suburban birthday haul is eight hundred bucks, tax-free. We hit fifteen parties this summer, we clear twelve grand, easy." "And how exactly do we get in? I don't think I blend with the under-four-foot crowd." Molly grinned and slid a photograph across the table. It showed a limp, deflated costume of a purple dinosaur with one eye hanging off by a thread. The other eye conveyed decades of trauma. "Absolutely not." "Bill." "Molly, no. I have dignity." "You stole a commemorative spoon from a retirement home last month." "That spoon had historical significance!" She sipped her coffee, unmoved. "You're the entertainment. I'm the assistant. While Barney the Dinosaur distracts the kids with a sad little dance—" "I will not dance." "—I quietly load the loot into my magician's trunk and we walk out the front door to a round of applause." Bill stared at the costume photograph. The dinosaur stared back, pleading for release from whatever hell it inhabited. "What if a kid hugs me?" "Then you hug them back, Bill. You are a professional." "What if a kid cries?" "Children cry constantly. It's their primary means of communication. Ignore it." "What if the *parents* ask questions? They book these things months in advance. They'll know we're not the real Unicorn Princess Party Squad." Molly reached into her purse and produced a laminated business card, complete with a glittery logo and a phone number that rang through to a burner in her glove compartment. She had a voicemail greeting recorded in what she called her "suburban mom" voice. It was unsettlingly convincing. Bill exhaled slowly, the way a man does when he realizes his life has taken a turn he cannot explain to his mother. "One question." "Shoot." "Why me in the dinosaur suit? Why not you?" Molly folded the blueprints with the crisp efficiency of a woman who had already won the argument. "Because, Bill," she said, standing and dropping a crumpled five on the table, "I look terrible in purple."
created_at
"2026-04-24T13:34:11.799Z"
4/24/2026, 1:34:11 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
637
moderation_latency
(null)
generation_time
15124
tokens_prompt
77
tokens_completion
776
native_tokens_prompt
98
native_tokens_completion
1046
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02664
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"441207ca-fa19-4644-8fce-f9cf2513f5a9"
is_byok
false
latency
637
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037651-CYNbcMFG0u2UFqxv0Qv5"
session_id
(null)
api_type
"completions"
id
"gen-1777037651-C6ZltJozrFqRWnhoqyaG"
upstream_id
"441207ca-fa19-4644-8fce-f9cf2513f5a9"
total_cost
0.02664
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
0.53%
Dialogue to Total Word Ratio
Ratio: 40.24%, Deviation: 10.24%
neededClean
false
wordsTotal
492
wordsDialogue
198
21.9524%