NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G39J6DNSJ2V20DVH6287W
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.1s
Cost:
$0.027565
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
40.6364%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of Denny's at 2 AM smelled like burnt coffee and bad decisions, which Molly figured was appropriate, considering. She spread the blueprint across the table, knocking over a sugar caddy in the process. "Is that a floor plan or a place mat?" Bill asked, squinting. "It's both. I traced it onto the place mat because I left the actual floor plan in the other pants." "The other pants." "Bill, focus." She tapped the crayon outline with a fingernail. "Bouncy castle here. Piñata here. Presents table — and this is the money shot — right next to the cake, which is shaped like a unicorn, according to Evite." Bill leaned back, arms crossed over his Members Only jacket. "I want it on the record that I have reservations." "Noted." "Serious reservations." "Also noted." "We are stealing from a seven-year-old." "We are stealing from a seven-year-old's *parents,*" Molly corrected, "who, according to their registry, are buying her a pony. A real one. Named Cinnamon. These people deserve what's coming to them." Bill sighed the sigh of a man who had, over the course of three decades, been talked into worse. "What's the play?" "Simple. You're the clown." "No." "Bill." "No, Molly, we talked about this, the last time I was a clown a child bit me, I still have the—" "You're the clown because nobody looks at the clown. Clowns are furniture. Clowns are wallpaper with feet. You waltz in through the side gate, the mom hands you a juice box, you mingle." "And you?" "I'm the magician." She produced, from inside her hoodie, a collapsible top hat and snapped it open with a flourish that dislodged a half-eaten breadstick. "Abracadabra. While I'm pulling scarves out of little Timmy's ear, you're loading presents into the garbage bags we've stashed in the bouncy castle." "Why are the bags in the bouncy castle?" "Because nobody suspects the bouncy castle. The bouncy castle is Switzerland." Bill rubbed his temples. "And the getaway?" "Ice cream truck." "We don't have an ice cream truck." "We will by Saturday. Craigslist. Guy in Reseda. He threw in the jingle for free, though the jingle is — I want to prepare you — it's *Taps.*" "Taps." "Played on a xylophone. It's distinctive." "It's a funeral song, Molly." "It's *our* funeral song." She grinned, and Bill had to admit the grin still did something to him, even after the incident in Tulsa, even after the parrot. "Think about it. Fifty gifts, minimum. Apple Watches for kids. iPads. Some kid's gonna give her a Fabergé egg, I can feel it." "Fabergé eggs are not a common birthday gift." "You don't know these people." Bill stared at the crayon map for a long moment, at the crude unicorn Molly had drawn where the cake was supposed to be. He picked up his fork and pointed it at her. "I'm not wearing the big shoes." "Bill. Baby. You're absolutely wearing the big shoes."
created_at
"2026-04-24T19:40:04.044Z"
4/24/2026, 7:40:04 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
659
moderation_latency
(null)
generation_time
15027
tokens_prompt
77
tokens_completion
773
native_tokens_prompt
98
native_tokens_completion
1083
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.027565
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"aca62090-701d-4a32-8926-b4522202a556"
is_byok
false
latency
659
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059604-e8Z89TcnV8QJKeel3yV3"
session_id
(null)
api_type
"completions"
id
"gen-1777059604-fGSMx4RyJyJNCz0PobUT"
upstream_id
"aca62090-701d-4a32-8926-b4522202a556"
total_cost
0.027565
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
488
0.00%
Dialogue to Total Word Ratio
Ratio: 66.94%, Deviation: 36.94%
neededClean
false
wordsTotal
493
wordsDialogue
330
40.6364%