NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WYWD21FY20JPCNJS7W2KM
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 47s
Cost:
$0.232885
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.99%
Dialogue to Total Word Ratio
(details)
99.9956%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn blueprint across the hood of their dented minivan, weighing down the corners with a juice box and a handful of loose Cheerios. The blueprint was, in fairness, drawn on the back of a Denny's placemat, but she'd used two different colored crayons, which she felt lent the whole operation a professional edge. The afternoon sun beat down on the strip mall parking lot where they'd agreed to meet, casting harsh shadows from a nearby shopping cart corral. "The target is the Henderson residence on Maple Street," she said. "Kid turns seven at two o'clock. We're looking at twenty-odd guests, each bringing a wrapped gift." Bill squinted at the blueprint, turning it sideways. He was a large man with a soft face and the general demeanor of someone who had been confused since birth and had simply learned to live with it. "Is that a dog or a security guard?" he asked, pointing at a brown blob near what Molly had labeled BACK DORE. "That's a bouncy castle." "Ah." Molly had been casing the Henderson house for three days, which mostly involved sitting in the minivan eating gas station sandwiches and watching a woman in yoga pants carry inflatable flamingos inside. She had identified two points of entry, one structural weakness, and a deeply troubling number of lawn gnomes. "Here's how it goes down," she said. "Party kicks off at two. By two-fifteen, every kid is hopped up on sugar and screaming. Parents will be hiding in the kitchen. Nobody watches the present table." "What about the clown?" Bill asked. "What clown?" "Every kids' party has a clown, Molly. You can't just ignore the clown." She stared at him for a long moment. He had a point, and she hated that. "Fine. If there's a clown, you handle the clown." "How exactly do I handle a clown?" "Engage him in conversation. Ask about the economy. Use your imagination." Bill nodded slowly, the way he always did when he was storing information in the swampy filing cabinet behind his eyes. He pulled a small notebook from his jacket and wrote CLOWNS in block letters, underlining it twice. The plan itself was elegant in its simplicity. Molly would enter through the side gate disguised as a party entertainer, carrying a bag large enough to hold a small pony. Bill would create a diversion in the backyard, the nature of which Molly had left deliberately vague because Bill's diversions tended to be more effective when unscripted. Last Fourth of July, he'd somehow started a conga line at a funeral, and they'd walked out with an entire catering spread. "We grab the gifts, load the van, and we're gone before they cut the cake," Molly said. "What if the cake is good?" Bill asked, his eyes suddenly earnest. "What if it's ice cream cake?" Molly folded the placemat with the careful precision of a woman who had planned eleven heists and regretted nine of them. "Then God help us both," she said.
created_at
"2026-02-24T22:40:17.324Z"
2/24/2026, 10:40:17 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1422
moderation_latency
276
generation_time
106628
tokens_prompt
77
tokens_completion
2518
native_tokens_prompt
92
native_tokens_completion
9297
native_tokens_completion_images
(null)
native_tokens_reasoning
1759
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.232885
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"7d728b0a-f239-4b43-92ac-eff92021985b"
is_byok
false
latency
1389
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972817-ItokFu6Cjmt7vu35mQdT"
upstream_id
"7d728b0a-f239-4b43-92ac-eff92021985b"
total_cost
0.232885
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
501
99.99%
Dialogue to Total Word Ratio
Ratio: 29.96%, Deviation: 0.04%
neededClean
false
wordsTotal
504
wordsDialogue
151
99.9956%