NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGZ06M1V0ZWE2H60QC2RK6PM
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn blueprint across the sticky table at Denny's, knocking over a small tower of creamer cups. The crayon lines were smudged in places where she'd erased with her thumb, but the layout was unmistakable: a suburban backyard complete with a bouncy castle, a folding table marked PRESENTS in aggressive red lettering, and what appeared to be a pony standing near the fence. "There's a pony?" Bill asked, leaning forward. "There's a pony," Molly confirmed. "His name is Butterscotch, and he bites." "How do you know his name?" "I did reconnaissance. I dressed up as a clown and walked past the house three times. The mother came outside and told me to stop frightening the neighbors." Bill rubbed his temples. He had not slept well. He had not slept well in years, but last night was particularly bad because Molly had called him at two in the morning to say she'd identified a soft entry point near the recycling bins. "Okay, walk me through the timeline," he said. "The party starts at noon. Pin the tail on the donkey runs from twelve fifteen to twelve thirty. That's our window." "Fifteen minutes? That's tight, Molly." "It's plenty. Those kids will be blindfolded and spinning. They won't see a thing. Half of them will be crying. Trust me, I've been to children's parties. It's chaos. Beautiful, exploitable chaos." "And the parents?" "Wine. They'll have wine by twelve oh five. I've studied the Facebook group. These parents drink like they're trying to forget something." Bill stared at the blueprint. Something nagged at him. He pointed to a series of small X marks along the perimeter of the yard. "What are these?" "Dogs." "How many dogs?" "Eleven. It's also a dog's birthday, apparently. The family is unhinged." Bill sat back in the booth and let out a long breath. A waitress refilled his coffee without making eye contact, which he appreciated. He was wearing a black turtleneck at nine in the morning and did not want to explain himself. "Why are we doing this again?" he asked. "Because little Aiden Peterson told my nephew that Santa isn't real, and his mother laughed about it at book club. This is justice, Bill." "This is felony larceny." "It's misdemeanor larceny at best. I priced the gifts. We're looking at maybe two hundred dollars in Legos and gift cards to Target." "So we're risking arrest for two hundred dollars in Legos." "And the principle of the thing. Don't forget the principle." Bill picked up a crayon and drew a small arrow near the recycling bins. He hated himself for how quickly his mind started solving the logistics. The bouncy castle provided natural sound cover. The pony would be a distraction. If they timed it right, they could load everything into a wagon and walk out looking like late-arriving guests. "We'll need disguises," he said quietly. Molly grinned and reached under the table, producing two full clown costumes. "Way ahead of you."
created_at
"2026-02-08T16:09:09.622348+00:00"
Sunday, February 8th, 2026 at 4:09:09 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1411
moderation_latency
160
generation_time
18207
tokens_prompt
61
tokens_completion
653
native_tokens_prompt
75
native_tokens_completion
745
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.019
router
(null)
provider_responses
0
id
"eb67a0aa-6132-4472-a711-0339bf90fcbd"
status
200
is_byok
false
latency
1409
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770566930-3jFtB1KLmcj57wHoJV76"
upstream_id
"eb67a0aa-6132-4472-a711-0339bf90fcbd"
total_cost
0.019
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9840%
Matches word count
n/a
neededClean
false
words
498
78.3529%
Dialogue to Total Word Ratio
Ratio: 47.79%, Deviation: 2.21%
neededClean
false
wordsTotal
498
wordsDialogue
238
89.1684%