NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G275FE6HA8AQQRTE4GBWA
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
15.6s
Cost:
$0.029215
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
1.17%
Dialogue to Total Word Ratio
(details)
49.3969%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of Denny's smelled like burnt coffee and regret, which Molly considered an appropriate ambiance for criminal planning. "Absolutely not," Bill said, stabbing a hash brown like it owed him money. "I am not robbing a seven-year-old." "She's turning eight." "Oh, well, in that case." Molly unfolded a napkin covered in diagrams. "Bill. Look at this intel. Tabitha Worthington-Price. Daddy's a hedge fund guy. Mommy's an influencer. Do you know what rich people give their kids now?" "Love and stability?" "iPads, Bill. Gaming consoles. There's a rumor going around that one of the aunts is bringing a pony." "We can't fence a pony." "I know a guy." "You always know a guy." Bill rubbed his temples. "Why does it always have to be you knowing a guy?" "Because I'm charismatic and you have the face of a disgraced accountant." Molly slid the napkin across. "Here's the layout. Bounce house, east lawn. Face painter, gazebo. Presents pile, sunroom. That's the soft target." "And the guards?" "Two clowns." "Clowns? Like, actual—" "Hired muscle dressed as clowns. Private security. Daddy Worthington is paranoid after the yacht thing." "What yacht thing?" "Doesn't matter. Point is, clowns." Bill exhaled slowly. "What's the extraction plan?" "Ice cream truck." "Molly." "Hear me out. Kids love ice cream trucks. Parents wave at ice cream trucks. Nobody questions an ice cream truck. We pull up at 2:15, which is exactly when Tabitha's mother does her Instagram Live, which means every adult with a phone is trying to angle for the background. We slip in the side gate dressed as entertainers—" "What kind of entertainers?" "Balloon artists." "I can't make balloon animals." "You can make a snake, Bill. A snake is just a balloon." "That's not—" Bill took a long sip of coffee. "Fine. I'm the snake guy. Then what?" "Then I do my magic act by the sunroom. While everyone's watching me pull a rabbit out of a hat, you load the presents into the false bottom of the ice cream truck." "There's a false bottom now?" "Carl's been working on it all week." "Carl has a gambling problem, Molly." "Which is exactly why he works fast." She leaned forward. "Bill. Thirty presents. Minimum six hundred bucks retail per kid at this party. We're looking at twenty grand in merchandise, easy. Plus the pony." "Stop saying the pony." "The pony is happening." Bill stared out the window at the parking lot, where a pigeon was eating a cigarette butt with tremendous conviction. "What about the birthday girl? She's gonna cry." "She has three iPads already. It'll build character." "That's a terrible thing to say." "Bill. Are you in, or are you in?" "Those aren't—" He sighed the sigh of a man who had lost this argument before it started. "What time do I need to learn to make a snake?" Molly smiled and signaled for the check. "Tuesday. Wear something cheerful. And Bill?" "What?" "Bring a carrot. For the pony."
created_at
"2026-04-24T19:39:28.82Z"
4/24/2026, 7:39:28 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
712
moderation_latency
(null)
generation_time
15508
tokens_prompt
77
tokens_completion
799
native_tokens_prompt
98
native_tokens_completion
1149
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.029215
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"c1c17d4f-3264-422f-aea8-61f50a09ea52"
is_byok
false
latency
712
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059568-OjiXh27BuZ9HcqTjMuz0"
session_id
(null)
api_type
"completions"
id
"gen-1777059568-dQYAJmCreLFi3Vu0NW4Q"
upstream_id
"c1c17d4f-3264-422f-aea8-61f50a09ea52"
total_cost
0.029215
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
493
1.17%
Dialogue to Total Word Ratio
Ratio: 79.44%, Deviation: 9.44%
neededClean
false
wordsTotal
496
wordsDialogue
394
49.3969%