NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G31VRXH4HX8XMR7SY5N8G
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
14.8s
Cost:
$0.028140
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
45.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of Denny's smelled like burnt coffee and bad decisions, which suited Molly just fine. She unrolled a napkin across the table and smoothed it out like a treasure map. "Okay, Bill. Saturday. Fourteen hundred hours. The Pendergast residence." Bill squinted at the napkin. "Why's there a syrup stain on Rhode Island?" "That's the bouncy castle." "Rhode Island is the bouncy castle?" "Bill, focus. The target is turning seven. His name is Tyler. His parents are loaded, and according to the Evite, there will be, and I quote, 'a mountain of presents.'" "You got invited to an Evite?" "I got forwarded an Evite," Molly said, stabbing a hash brown. "Big difference. Legally." Bill leaned back, rubbing the stubble on his chin. "I don't know, Moll. Kids' presents? That feels like a new low. Even for us." "We stole a kidney last March." "That kidney was going to a bad person." "The kidney was going to a hospital." "Allegedly." Molly sighed and pulled out a second napkin. This one had a crayon drawing of what appeared to be a clown. "Here's our guy. Bingo the Birthday Clown. Real name Gerald. Sixty-two, two DUIs, desperate for cash. He's our inside man." "You recruited a clown." "I recruited a professional entertainer." "Who honks a horn for a living." "Bill." "I'm just saying, Moll, if this goes sideways, I don't want my mugshot next to a guy named Bingo." Molly tapped the napkin. "Gerald distracts the kids with balloon animals in the backyard. You and I slip in through the side gate dressed as caterers. We load the presents into the minivan. In and out in eight minutes." "We have a minivan?" "I'm stealing one Friday." "From where?" "A different children's party." Bill stared at her for a long moment, then picked up his coffee. "You know, when we met, I thought you were the sensible one." "And when I met you, I thought you could pick a lock without crying." "That was one time, and the lock was emotionally complicated." "Bill." "Fine." He set down the mug. "What about the parents?" "Dad's on grill duty. Mom's refilling the juice boxes. Neither of them has made eye contact with their son since 2019. We're ghosts." "And if a kid sees us?" "Kids are unreliable witnesses. Last year a six-year-old told a cop I looked like Shrek." "You kinda did that day." "Bill." "Sorry." Molly folded the napkins and slid them into her purse. "So. Are you in, or do I call Derek?" "Don't call Derek. Derek ate the last job." "Derek literally ate the evidence." "Derek is a liability." "Then we're agreed." She extended her hand across the table. "Operation Party Favor." Bill shook it reluctantly. "I want it on record that I have a bad feeling about this." "Noted." "And I'm not wearing the apron this time." "You're absolutely wearing the apron." "Molly." "It has pockets, Bill. Think of the pockets."
created_at
"2026-04-24T19:39:56.156Z"
4/24/2026, 7:39:56 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
660
moderation_latency
(null)
generation_time
14766
tokens_prompt
77
tokens_completion
784
native_tokens_prompt
98
native_tokens_completion
1106
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02814
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"2d53c255-6fc3-4986-b9ce-6e62a336a393"
is_byok
false
latency
659
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059596-1ILquBDcsOhycCo5Dpmi"
session_id
(null)
api_type
"completions"
id
"gen-1777059596-9d7Rn3awWXA0MaqSpLSS"
upstream_id
"2d53c255-6fc3-4986-b9ce-6e62a336a393"
total_cost
0.02814
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
0.00%
Dialogue to Total Word Ratio
Ratio: 76.47%, Deviation: 26.47%
neededClean
false
wordsTotal
493
wordsDialogue
377
45.2419%