NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYY4WK6W6TYRRX2H5BZNT89
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the sticky table at Denny's, knocking over a syrup dispenser in the process. The architectural plans for Chuck E. Cheese had cost her forty dollars and a footlong sub, which she considered highway robbery, but that was beside the point. They were the robbers now. "Walk me through it again," Bill said, adjusting his reading glasses and squinting at the paper. "I want to make sure I understand the extraction protocol." "It's a children's birthday party, Bill, not a diamond vault in Monaco." "That's exactly what they want you to think." He tapped the blueprint with a grave finger. "You see this room here? That's the party zone. Saturday at two o'clock, little Timmy Henderson turns seven. His parents are loaded. We're talking high-end presents. LEGOs, Bill. The expensive LEGO sets." Molly leaned back and crossed her arms. "I've had my eye on those Star Wars ones for years. You know what they retail for?" "More than my car is worth," Bill admitted, which was not saying much since his car had been declared legally dead by three separate mechanics. "So here's the plan. You go in through the front dressed as a party entertainer. I put you down as Bingo the Magnificent." Bill's face darkened. "I told you, I don't do clown work anymore. Not after Phoenix." "You're not a clown. You're a magician. There's a difference." "The greasepaint doesn't know the difference, Molly." She sighed and rubbed her temples. The waitress refilled their coffees without making eye contact, which Molly appreciated. They looked like exactly what they were: two middle-aged people plotting something ill-advised over a Grand Slam breakfast. "Fine. No face paint. You wear the top hat, you do card tricks, and while every kid in that room is mesmerized by your stunning mediocrity, I slip in through the emergency exit and load the presents into the van." "What about the parents?" "They'll be at the salad bar. Trust me. The parents always go to the salad bar. It's the only edible thing in the building." Bill folded his arms and stared at the ceiling, performing what Molly recognized as his thinking ritual. It looked physically painful. After a long moment, he lowered his gaze. "What about the animatronic band? Those things have cameras in their eyes now." "No they don't, Bill." "I read it online." "You read that the moon is a hologram online. Focus, please." He picked up a crayon that had been left behind by a previous diner's child and began marking routes on the blueprint. His tongue poked out of the corner of his mouth. Molly watched him draw what appeared to be a series of increasingly confident arrows that led directly into the women's restroom. "That's the bathroom, Bill." He rotated the blueprint ninety degrees. "That's still the bathroom." He set down the crayon with dignity. "I'll drive the van." "You'll drive the van," she agreed, and stole his last piece of bacon.
created_at
"2026-02-08T15:33:19.47048+00:00"
Sunday, February 8th, 2026 at 3:33:19 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1850
moderation_latency
133
generation_time
22460
tokens_prompt
61
tokens_completion
641
native_tokens_prompt
75
native_tokens_completion
730
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.018625
router
(null)
provider_responses
0
id
"msg_01M9Awh7MKPQvLfKhSYnWdGJ"
status
200
is_byok
false
latency
1850
endpoint_id
"c926c048-563e-44b0-8434-95965bee924c"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770564776-v3PdgBn686xepzZYoFmA"
upstream_id
"msg_01M9Awh7MKPQvLfKhSYnWdGJ"
total_cost
0.018625
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
499
83.4248%
Dialogue to Total Word Ratio
Ratio: 51.90%, Deviation: 1.90%
neededClean
false
wordsTotal
499
wordsDialogue
259
91.7119%