NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZV4K80ZXQKRAKDYZ7FFR6Y
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.8s
Cost:
$0.027690
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
45.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The coffee shop smelled like burnt espresso and desperation, which Molly felt was appropriate given the circumstances. She slid the blueprints across the sticky table, narrowly avoiding a puddle of what might have been syrup or might have been tears from the woman two booths over arguing with her ex-husband on speakerphone. Bill squinted at the paper. Then he turned it ninety degrees. Then another ninety. "Molly, this is a placemat from Denny's." "Other side." He flipped it, revealing a hand-drawn map of a suburban backyard, complete with a bounce house labeled DANGER ZONE and an X marked where a piñata presumably dangled like a papier-mâché treasure chest. "Okay," Bill said. "Walk me through it." "Party starts at two. Eight-year-old's birthday. Theme is pirates, which is poetic, if you think about it." "I'm not thinking about it." "The presents pile up on the gift table here." She tapped the map. "By three o'clock, we're looking at forty, maybe fifty wrapped gifts. Minimum six LEGO sets. Possibly a Nintendo Switch, because the kid's grandma is loaded and also, I suspect, trying to buy affection." Bill leaned back. His chair creaked in a way that suggested it, too, had given up. "Molly. I want to say something, and I want you to hear me." "Oh, God." "We are grown adults." "Uh-huh." "We are planning to rob children." "We are planning to rob *one* child," Molly corrected, "of presents belonging to a surplus he cannot possibly appreciate. Have you seen how kids open gifts now? They tear through them like raccoons in a dumpster. It's grotesque." "You're describing me at Christmas." "And look how you turned out." Bill considered this, then nodded, because it was fair. He picked up his mug, discovered it was empty, and set it back down with the dignified regret of a man who had peaked in high school. "What's the cover?" he asked. "Clowns." "No." "Hear me out—" "No, Molly. Last time we did clowns I was picking greasepaint out of my sideburns for a month. I looked like a sad raccoon. A dumpster raccoon, if you will." "Fine. Magicians." "Better." "You do the rabbit thing, I work the gift table, and by the time anyone notices the presents are gone, we're halfway to the storage unit." "And the parents?" "Distracted. Parents at kids' parties are either drunk or dissociating. Usually both." Bill tapped his fingers on the table. The couple behind them had escalated to yelling about a timeshare in Boca. Somewhere, an espresso machine screamed like a kettle being murdered. "One condition," he said finally. "We leave the birthday boy's present. Just his. From his parents." Molly stared. "Bill. You sentimental marshmallow." "I have a code." "You have a sticker book." "I have *a code*, Molly." She sighed, folded the placemat, and tucked it into her jacket. "Fine. One present stays. The rest we liberate." "Liberate," Bill repeated. "I love when you talk heist to me."
created_at
"2026-04-24T13:33:46.636Z"
4/24/2026, 1:33:46 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
629
moderation_latency
(null)
generation_time
15729
tokens_prompt
77
tokens_completion
789
native_tokens_prompt
98
native_tokens_completion
1088
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02769
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"002f53bf-aa50-4c45-853c-7664e8e1eded"
is_byok
false
latency
629
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037626-yZ42fgxQaJPWTSldo13b"
session_id
(null)
api_type
"completions"
id
"gen-1777037626-pv2TJKRJi3zyAZrQZ6ss"
upstream_id
"002f53bf-aa50-4c45-853c-7664e8e1eded"
total_cost
0.02769
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
0.00%
Dialogue to Total Word Ratio
Ratio: 57.37%, Deviation: 27.37%
neededClean
false
wordsTotal
495
wordsDialogue
284
45.2419%