NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0GBS5C0KRHDB5B3GZZ1HG0
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
14.4s
Cost:
$0.026915
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
46.8250%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back booth of Denny's smelled like burnt coffee and bad decisions, which Molly considered the ideal atmosphere for criminal planning. She unfolded a napkin across the table and smoothed it with the palm of her hand like it was a blueprint from a Swiss bank. "Okay," she said. "The target is a seven-year-old named Tristan." Bill stopped chewing his hash browns. "Tristan." "Tristan." "We're robbing a Tristan." "His mother rented a bouncy castle, Bill. A *themed* bouncy castle. Do you know what that costs? These are not people who buy their child socks." Bill set down his fork. "Molly. Sweetheart. Love of my life. We are adult human beings." "Speak for yourself." "I have a 401k." "You have a coffee can buried in your mom's yard." "That is a 401k to me," Bill said, with real dignity. "And I'm just saying, maybe we aim higher than a birthday party. Maybe a jewelry store. Maybe a nice, grown-up liquor mart." Molly leaned forward. "Bill. Listen to me. Last year at his sister's party, the parting gifts included actual Nintendo Switches. For the *guests*. Do you understand what I'm telling you? The loot table on this job is insane." "And what's the plan? We kick in the door? We shoot a clown?" "No one is shooting a clown." "I want that in writing." Molly slid him a second napkin, on which she had drawn something that looked vaguely like a house, or possibly a dog. "I go in as Auntie Denise. Nobody knows Auntie Denise. Every family has one. You arrive twenty minutes later as the magician." "I am not a magician." "You own a cape." "I own a *robe*, Molly, and it has a hole in it where I—" "Close enough. You do three card tricks, you herd the kids into the living room for the show, and I clean out the present table. Gift bags too. All of it goes into the minivan." "We have a minivan?" "We will have stolen a minivan." Bill rubbed his forehead with both hands. "So to summarize. To steal toys from a child, we are going to commit grand theft auto, impersonate a relative, impersonate a performing artist, and deceive approximately fifteen minors." "And eat cake." "And eat cake?" "There is always cake, Bill. Factor in the cake." He was quiet for a long moment. A waitress refilled their coffees and moved on, sensing things. "What kind of cake?" Bill finally said. "Ice cream cake. Confirmed. I drove past the Carvel on the way here." Bill exhaled the sigh of a man whose moral compass had been rotating freely since 2008. He picked up his fork, speared a hash brown, and pointed it at her like a gavel. "Fine," he said. "But if there's a piñata, I get the piñata." "Bill." "Non-negotiable." "Fine. You get the piñata." They clinked coffee mugs across the napkin, and somewhere across town, a seven-year-old named Tristan slept, unaware.
created_at
"2026-04-24T19:44:42.161Z"
4/24/2026, 7:44:42 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
675
moderation_latency
(null)
generation_time
13800
tokens_prompt
77
tokens_completion
752
native_tokens_prompt
98
native_tokens_completion
1057
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.026915
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"89ad692a-17b9-4080-91aa-ffed78d11077"
is_byok
false
latency
674
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059882-7LKaX0Tg4L7Csc82vxLt"
session_id
(null)
api_type
"completions"
id
"gen-1777059882-LDY9ArO2k6altjulFlUY"
upstream_id
"89ad692a-17b9-4080-91aa-ffed78d11077"
total_cost
0.026915
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
491
0.00%
Dialogue to Total Word Ratio
Ratio: 65.79%, Deviation: 15.79%
neededClean
false
wordsTotal
497
wordsDialogue
327
46.8250%