NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYYM8N4WX1A1X9HBW0DWXH0
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the sticky table of the doughnut shop, pushing aside two cruller wrappers and a coffee cup that had seen better days. The map was done in colored pencil, mostly because Bill had forgotten to buy actual pens, and the legend in the corner featured a tiny skull and crossbones next to the words "bouncy castle" as if it were a genuine hazard. "The party starts at two," Molly said, tapping a purple square that apparently represented the birthday boy's house. "We've got a forty-five-minute window between the gift table reaching maximum capacity and the moment they start opening presents." Bill leaned forward and squinted. He rotated the map twice before settling on an orientation that may or may not have been correct. "Why is the dog circled in red?" "Because the dog is a problem, Bill. The dog is always a problem. Last Tuesday I did recon disguised as a meter reader, and that animal chased me into a recycling bin. I was in there for twenty minutes breathing in old milk cartons." Bill nodded solemnly, as though this were a perfectly normal occupational hazard. In fairness, for them it was. Their last job had involved stealing a six-foot teddy bear from a carnival, and Molly had ended up dangling from a Ferris wheel while Bill argued with a clown about property rights. "So we neutralize the dog," Bill said, cracking his knuckles with the confidence of a man who had never successfully neutralized anything. "We don't neutralize the dog. We distract the dog. I bought seventeen dollars' worth of bacon treats. You're going to lure him into the garage while I slip through the side gate." "What if the garage is locked?" "It won't be. The dad leaves it open because he smokes cigars in there when his wife isn't looking. I've seen him do it three times. The man has a routine." Bill studied the map again, tracing the route from the side gate to the gift table with his finger. The path wound past the bouncy castle, through a cluster of symbols that Molly explained were lawn chairs, and around something she had labeled "slip zone" due to a garden sprinkler that ran on a timer. "How do we carry the presents out?" Bill asked. "There could be dozens." "IKEA bags. The big blue ones. They hold everything. I once fit an entire drum set in two of them." "That was a dark day for music." "Focus, Bill." She pulled out a photograph she had taken from across the street, zoomed in just enough to show a folding table draped in superhero-themed wrapping paper. Beneath the cheerful images of caped figures were boxes of varying sizes, each one representing someone's thoughtful gesture toward a seven-year-old named Tristan. Molly felt nothing about this. She had bills to pay and a fence in Burbank who would move Hot Wheels sets without asking questions. "We go in quiet, we go in fast, and we do not engage with any child," she said firmly. Bill saluted. They were going to get caught. They always got caught. But hope springs eternal in the hearts of idiots.
created_at
"2026-02-08T15:41:41.114368+00:00"
Sunday, February 8th, 2026 at 3:41:41 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1258
moderation_latency
158
generation_time
20009
tokens_prompt
61
tokens_completion
649
native_tokens_prompt
75
native_tokens_completion
732
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.018675
router
(null)
provider_responses
0
id
"03823e3a-ddc1-41f5-ad93-a81e43e81a44"
status
200
is_byok
false
latency
1257
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770565280-fSDLAoeNeY1rldZ8Xieh"
upstream_id
"03823e3a-ddc1-41f5-ad93-a81e43e81a44"
total_cost
0.018675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
537
1.3334%
Dialogue to Total Word Ratio
Ratio: 39.29%, Deviation: 9.29%
neededClean
false
wordsTotal
537
wordsDialogue
211
0.6667%