NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G6HY3C63N7KARCECCH8ZQ
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.3s
Cost:
$0.026890
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of Denny's smelled like burnt coffee and regret, which Molly considered an appropriate atmosphere for criminal planning. She spread a crayon-drawn map across the sticky table and weighed down the corners with creamer cups. Bill squinted at it. "Why is the bounce house shaped like a kidney?" "It's a castle." "It looks like a kidney." "Bill, focus. The party starts at two. Cake at two-thirty. Presents at three." She tapped the map with a chewed pencil. "That's our window." "Our window is a kid's birthday party." "Our window is a gold mine. Do you know what kids are getting these days? Tablets. Drones. One of my nephews got a real, actual pony. A horse, Bill." Bill set down his fork. "We are not stealing a pony." "I'm not saying we steal the pony. I'm saying the economy has changed." He rubbed his eyes. Six years ago they had knocked over a jewelry store in Reno. Now he was reviewing the floor plan of a backyard belonging to someone named Tyler, age seven. "What's our cover?" he asked. "Entertainers. I'm the clown. You're the magician." "Why am I the magician?" "Because you have the sad face of a man who does card tricks alone." "That is specifically hurtful." Molly slid a paper bag across the table. Inside were two wigs, a red foam nose, and what appeared to be a cape sewn from an old shower curtain. "Absolutely not," Bill said. "It's Tuesday. The party is Saturday. You have four days to learn three tricks and stop complaining." "And the extraction?" "Gift table is here." She circled a spot near the kidney. "While I'm twisting balloon animals, you do the disappearing act. Except what disappears is the presents. Into a duffel. Into the van. Into our retirement." "Our retirement is stolen Bluey merchandise." "Our retirement is diversified." She sipped her coffee. "Also there's a piñata." "Why does the piñata matter?" "Parents stuff them with gift cards now. It's a whole thing. We bust it open during the chaos and scoop." Bill stared at her for a long moment. A waitress refilled his cup without asking. Somewhere a child was screaming about pancakes, which felt like foreshadowing. "Molly," he said carefully, "at what point did our lives become this." "Around the time the bank job went sideways and we owed Desmond eighteen grand." "Right. That." "So Saturday. Two o'clock. You're Mysterio the Marvelous." "I'm not calling myself that." "The invoice is already printed." She handed him a business card featuring a clip-art rabbit and his face, badly Photoshopped, wearing a top hat. Bill looked at it the way a man looks at his own obituary. "Fine," he said. "But if one of those kids cries, we abort." "If one of those kids cries, we lean into it. Chaos is cover." He pocketed the nose. Molly grinned, and outside the window, the sun set over a Target parking lot like a promise.
created_at
"2026-04-24T19:41:50.921Z"
4/24/2026, 7:41:50 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
660
moderation_latency
(null)
generation_time
15230
tokens_prompt
77
tokens_completion
768
native_tokens_prompt
98
native_tokens_completion
1056
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02689
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"278c1da6-229a-4d62-86e2-778b20ed5806"
is_byok
false
latency
660
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059710-h6vl7zJiaEauirl2XwlU"
session_id
(null)
api_type
"completions"
id
"gen-1777059710-i6HHHhI4nAQuZxtQIkan"
upstream_id
"278c1da6-229a-4d62-86e2-778b20ed5806"
total_cost
0.02689
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
0.00%
Dialogue to Total Word Ratio
Ratio: 56.16%, Deviation: 26.16%
neededClean
false
wordsTotal
495
wordsDialogue
278
47.9934%