NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGZ05K4VNJTJNS34CZNCGMBZ
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn blueprint across the sticky table of the diner booth, knocking over a bottle of ketchup in the process. The blueprint, rendered in crayon on the back of a Denny's placemat, depicted the Thompson residence in what could charitably be called architectural detail. "Alright, here's the situation," Molly said, tapping the placemat with a french fry. "Little Timmy Thompson turns seven on Saturday. His parents have gone absolutely berserk. We're talking premium gifts from thirty-two confirmed attendees." Bill squinted at the drawing. "Is that a dog or a security camera?" "That's Mrs. Thompson's mother. She sits on the porch. Doesn't move much but sees everything. Think of her as a fleshy surveillance system." "Wonderful. So how do we get past Grandma Panopticon?" "She naps at two. Like clockwork. That gives us a window of exactly forty minutes to infiltrate, locate the gift table, and extract the packages." Bill leaned back and crossed his arms. He had the look of a man who had robbed many stupid places but was beginning to suspect this might be the stupidest. "And we're doing this because?" "Because last year those little monsters got a combined haul worth four thousand dollars in toys, Bill. Four thousand. You know what we pulled on our last job? Eleven hundred dollars and a decorative gourd." "I liked that gourd." "The gourd is not the point. The point is that there's a mountain of unopened merchandise just sitting there in a suburban living room, guarded by nothing but balloon animals and a man in a rented Elmo costume." Bill picked up the placemat and rotated it, trying to make sense of the scribbled entry points. Molly had drawn arrows in four different colors, none of which she had bothered to explain in a legend. There was a small star near what appeared to be the backyard that simply read DANGER ZONE in block letters. "What's the danger zone?" he asked. "Bouncy castle. If we get near it, we're finished. Those kids will swarm. They can smell adult fear." "You're telling me our primary tactical threat is a bouncy castle." "I'm telling you to respect the bouncy castle, Bill. I've seen what happens to people who don't." He set the placemat down and rubbed his temples. Outside, a bus wheezed past the diner window, and a pigeon strutted along the sidewalk like it owned the mortgage on the whole block. "Fine. What's our disguise situation?" Molly reached under the table and produced a plastic bag. She pulled out two matching purple shirts emblazoned with the words PRINCESS SPARKLE PARTY ENTERTAINMENT. "We go in as the hired magicians. I already called and canceled the real ones." "I don't know any magic tricks, Molly." "Neither do real magicians. That's the whole point of magic." Bill stared at her for a long moment, then picked up his coffee and drained it like a man heading to war. "I want a bigger cut than the gourd job." "Done."
created_at
"2026-02-08T16:08:36.444857+00:00"
Sunday, February 8th, 2026 at 4:08:36 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1345
moderation_latency
163
generation_time
18871
tokens_prompt
61
tokens_completion
644
native_tokens_prompt
75
native_tokens_completion
731
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01865
router
(null)
provider_responses
0
id
"01730856-e0a6-47c4-833e-89bc896dca82"
status
200
is_byok
false
latency
1343
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770566896-ICZlJmSbdFHcZD8gXhZ6"
upstream_id
"01730856-e0a6-47c4-833e-89bc896dca82"
total_cost
0.01865
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
499
43.0042%
Dialogue to Total Word Ratio
Ratio: 54.11%, Deviation: 4.11%
neededClean
false
wordsTotal
499
wordsDialogue
270
71.5016%