Model

Claude Opus 4.6

Test

Dialogue tags

Scenario

Write 500 words with 30% dialogue

Run ID

2026-02-08-01KGYYM8N4WX1A1X9HBW0DWXH0

Model input

User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly spread the hand-drawn map across the sticky table of the doughnut shop, pushing aside two cruller wrappers and a coffee cup that had seen better days. The map was done in colored pencil, mostly because Bill had forgotten to buy actual pens, and the legend in the corner featured a tiny skull and crossbones next to the words "bouncy castle" as if it were a genuine hazard. "The party starts at two," Molly said, tapping a purple square that apparently represented the birthday boy's house. "We've got a forty-five-minute window between the gift table reaching maximum capacity and the moment they start opening presents." Bill leaned forward and squinted. He rotated the map twice before settling on an orientation that may or may not have been correct. "Why is the dog circled in red?" "Because the dog is a problem, Bill. The dog is always a problem. Last Tuesday I did recon disguised as a meter reader, and that animal chased me into a recycling bin. I was in there for twenty minutes breathing in old milk cartons." Bill nodded solemnly, as though this were a perfectly normal occupational hazard. In fairness, for them it was. Their last job had involved stealing a six-foot teddy bear from a carnival, and Molly had ended up dangling from a Ferris wheel while Bill argued with a clown about property rights. "So we neutralize the dog," Bill said, cracking his knuckles with the confidence of a man who had never successfully neutralized anything. "We don't neutralize the dog. We distract the dog. I bought seventeen dollars' worth of bacon treats. You're going to lure him into the garage while I slip through the side gate." "What if the garage is locked?" "It won't be. The dad leaves it open because he smokes cigars in there when his wife isn't looking. I've seen him do it three times. The man has a routine." Bill studied the map again, tracing the route from the side gate to the gift table with his finger. The path wound past the bouncy castle, through a cluster of symbols that Molly explained were lawn chairs, and around something she had labeled "slip zone" due to a garden sprinkler that ran on a timer. "How do we carry the presents out?" Bill asked. "There could be dozens." "IKEA bags. The big blue ones. They hold everything. I once fit an entire drum set in two of them." "That was a dark day for music." "Focus, Bill." She pulled out a photograph she had taken from across the street, zoomed in just enough to show a folding table draped in superhero-themed wrapping paper. Beneath the cheerful images of caped figures were boxes of varying sizes, each one representing someone's thoughtful gesture toward a seven-year-old named Tristan. Molly felt nothing about this. She had bills to pay and a fence in Burbank who would move Hot Wheels sets without asking questions. "We go in quiet, we go in fast, and we do not engage with any child," she said firmly. Bill saluted. They were going to get caught. They always got caught. But hope springs eternal in the hearts of idiots.
created_at"2026-02-08T15:41:41.114368+00:00" Sunday, February 8th, 2026 at 3:41:41 PM GMT+00:00
model"anthropic/claude-4.6-opus-20260205" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1258
moderation_latency158
generation_time20009
tokens_prompt61
tokens_completion649
native_tokens_prompt75
native_tokens_completion732
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.018675
router(null)
provider_responses
0
id"03823e3a-ddc1-41f5-ad93-a81e43e81a44" 
status200
is_byokfalse
latency1257
endpoint_id"a8708fb9-08ea-43bb-a3b1-165c87ef09df" 
provider_name"Amazon Bedrock" 
model_permaslug"anthropic/claude-4.6-opus-20260205" 
api_type"completions" 
id"gen-1770565280-fSDLAoeNeY1rldZ8Xieh" 
upstream_id"03823e3a-ddc1-41f5-ad93-a81e43e81a44" 
total_cost0.018675
cache_discount(null)
upstream_inference_cost0
provider_name"Amazon Bedrock" 

Evaluation details

Result Evaluator Details Meta Data
0.0000% Matches word count n/a
neededCleanfalse
words537
1.3334% Dialogue to Total Word Ratio
Ratio: 39.29%, Deviation: 9.29%
neededCleanfalse
wordsTotal537
wordsDialogue211
0.6667%