NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYYF5TZS59NHD57EKQFVQ29
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the hood of the minivan, smoothing out the crayon marks with her palm. The blueprint had cost her three juice boxes and a pack of fruit snacks, courtesy of a very entrepreneurial six-year-old named Timmy who lived next door to the target. "Alright, here's what we're looking at," she said, tapping the paper. "The Hendersons' backyard. Party starts at two. We've got a bouncy castle here, face painting station here, and the present table is right next to the sliding glass door." Bill leaned over the map and squinted. "Is that a dragon or a dog?" "That's Mrs. Henderson. Timmy isn't exactly Rembrandt. Focus, Bill." "I'm focused. I'm just saying, if the woman actually looks like that, we've got bigger problems than security." Molly pulled a pair of binoculars from her fanny pack and handed them to him. "I've been doing surveillance for three days. The gift haul is going to be massive. We're talking thirty kids, all from the wealthy part of the subdivision. These parents don't show up with anything less than fifty bucks a pop." Bill did the math on his fingers, his eyes widening as the number climbed. "That's fifteen hundred dollars in presents." "Retail value, easy. We fence them at forty cents on the dollar, and we're still walking away with a nice payday." "And you're sure about the getaway route?" "Positive. We go through the side gate, cut across the Petersons' yard, and the van will be parked on Maple Street. Thirty seconds, tops." Bill scratched his chin, looking genuinely troubled for the first time. The afternoon sun caught his receding hairline and made it gleam like a warning beacon. He shifted his weight from one worn sneaker to the other. "What about the kids, Molly? I mean, these are children. They're going to cry." "They're eight-year-olds, Bill. They have the emotional memory of goldfish. By Monday they won't even remember what they were upset about." "I don't know. My therapist says I need to stop avoiding guilt." "Your therapist also told you to start journaling, and you used that notebook to plan the pharmacy job." He couldn't argue with that. She watched him wrestle with what remained of his conscience, a quick and uneventful match. "Fine. But I'm not wearing the clown costume again." "You have to wear the clown costume. That's the whole plan. You distract them with balloon animals while I load the presents into garbage bags." "Last time I wore that thing, a kid bit me." "You made him a snake. He wanted a giraffe. That's on you." Bill sighed, the deep sigh of a man who had made peace with a life of increasingly absurd crime. He folded the map carefully, respecting Timmy's artistry despite everything. "What time do we move?" Molly checked her watch and smiled the kind of smile that had launched a thousand bad decisions. "Right after cake. Nobody leaves during cake."
created_at
"2026-02-08T15:38:58.486449+00:00"
Sunday, February 8th, 2026 at 3:38:58 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3965
moderation_latency
(null)
generation_time
24301
tokens_prompt
61
tokens_completion
641
native_tokens_prompt
75
native_tokens_completion
730
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.018625
router
(null)
provider_responses
0
id
"msg_019NvU2tun3evy1UHWJNoDbY"
status
200
is_byok
false
latency
3965
endpoint_id
"c926c048-563e-44b0-8434-95965bee924c"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770565113-KAtrKUNmT12vbNU3e2SF"
upstream_id
"msg_019NvU2tun3evy1UHWJNoDbY"
total_cost
0.018625
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9190%
Matches word count
n/a
neededClean
false
words
497
0.3046%
Dialogue to Total Word Ratio
Ratio: 60.76%, Deviation: 10.76%
neededClean
false
wordsTotal
497
wordsDialogue
302
50.1118%