NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYYE99EQXR2EYR4RJ2SXDX9
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn blueprint across the sticky table at Denny's, pushing aside two plates of picked-over Grand Slams to make room. The crayon-scrawled map showed every critical detail of the Hendersons' backyard, from the inflatable bounce house to the folding table where, according to Molly's intelligence, the presents would be stacked in a magnificent pyramid of gift wrap and curly ribbon. "We hit the party at exactly two fifteen," Molly said, tapping the map with a syrup-smeared finger. "That's when the clown arrives. Every kid, every parent, every grandparent with a camcorder is going to be watching a man in oversized shoes make balloon animals. Nobody watches the present table during balloon animals. It's a scientific fact." Bill leaned forward and squinted at the map. He turned it sideways, then upside down. "Did a child draw this?" "I drew it, Bill." "With crayons?" "I was out of pens. Focus, please." Bill focused. Or at least he made the face he always made when he was trying to focus, which involved a great deal of nostril flaring and very little actual comprehension. He had the look of a man who had peaked intellectually somewhere around the tenth grade and had been coasting on charm and flexible morals ever since. Molly had recruited him for his van, which was windowless and forgettable, and for his surprising ability to carry large quantities of things while running at full speed. She did not recruit him for strategy sessions, and yet here he was, poking holes in her crayon cartography. "What's this brown blob?" he asked. "That's the dog. Labradoodle. Name's Biscuit. Friendly but loud. I've got a pound of bacon in the van to keep him occupied." "And this orange blob?" "That's the piñata. Do not engage the piñata, Bill. The moment you touch that thing, you'll have fifteen six-year-olds descending on you like hyenas on a zebra carcass." Bill nodded slowly, absorbing this with the gravity of a man being briefed on a bank vault. He cracked his knuckles one at a time, a habit Molly found both disgusting and oddly reassuring. It meant he was committing. She outlined the exit strategy. They would load the presents into two large IKEA bags she had purchased specifically for the job, cut through the neighbor's yard where a conveniently broken fence panel provided passage, and reach the van parked on Mulberry Street in under ninety seconds. She had timed the route three times already, once while wearing the novelty sunglasses and floppy hat that would constitute their disguises. "What if a kid sees us?" Bill asked. "Kids see everything, Bill. They just don't process it correctly. We'll look like parents helping move presents inside. Confidence is the disguise." "And what's our take looking like?" Molly grinned. She pulled out her phone and showed him the birthday boy's mother's Instagram page, where every gift had been lovingly photographed and tagged. "LEGO sets, Bill. Twelve of them. A drone. Two iPads. And a karaoke machine." Bill's eyes went wide with a greed normally reserved for casino floors and all-you-can-eat buffets. "I'm in." He had been in since the bacon.
created_at
"2026-02-08T15:38:29.986955+00:00"
Sunday, February 8th, 2026 at 3:38:29 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2052
moderation_latency
(null)
generation_time
25053
tokens_prompt
61
tokens_completion
677
native_tokens_prompt
75
native_tokens_completion
790
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.020125
router
(null)
provider_responses
0
id
"msg_01FxSYPDZzzbMH3JB425trQg"
status
200
is_byok
false
latency
2050
endpoint_id
"c926c048-563e-44b0-8434-95965bee924c"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770565084-HzAWkQqsTtjqf47sZ0hj"
upstream_id
"msg_01FxSYPDZzzbMH3JB425trQg"
total_cost
0.020125
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0028%
Matches word count
n/a
neededClean
false
words
532
2.2293%
Dialogue to Total Word Ratio
Ratio: 38.72%, Deviation: 8.72%
neededClean
false
wordsTotal
532
wordsDialogue
206
1.1160%