NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYX0YQ0GTX5DN4X9TH49JT8
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled like stale coffee and regret, which Molly had come to recognize as the official scent of their operations. She spread the hand-drawn map across the dashboard, squinting at Bill's artistic interpretation of the Henderson property. "Is that a dog or a swimming pool?" she asked, tapping a brown blob near what she assumed was the back entrance. Bill leaned over, his reading glasses sliding down his nose. "That's Mrs. Henderson. I drew her to scale." Molly decided not to pursue that line of questioning. Through the windshield, she could see the target: a modest suburban home currently being invaded by a swarm of seven-year-olds hopped up on sugar and the promise of cake. Balloons bobbed cheerfully from the mailbox, mocking their criminal intentions. "Run me through the timeline again," she said. Bill consulted his phone, which displayed a schedule he'd obtained through what he called "advanced reconnaissance" and what Molly called "stalking a mommy blog." He cleared his throat with the gravity of a general briefing troops before battle. "The party started at two. They've done the bounce house and the face painting. Pin the tail on the donkey is happening now, which means the kids are blindfolded and disoriented. Perfect cover." "And the presents?" "Stacked in the dining room. I counted at least fifteen through the window. We're talking premium haul here, Molly. Legos, probably some video games, maybe even one of those robot dogs that does backflips." Molly watched a father stumble out the front door, his face painted like a butterfly, his eyes hollow with exhaustion. He sat on the porch steps and stared at nothing, a broken man. She felt a kinship with him. The plan was simple, as all their plans started out. They would enter through the side gate during the cake ceremony, when all attention would be focused on the birthday boy and his sugar-crusted moment of glory. Bill would create a distraction if needed, though his distraction ideas historically ranged from questionable to legally actionable. Molly would grab what she could carry, they'd rendezvous at the van, and they'd be three towns over before anyone noticed the gift table had been liberated. "What's our exit strategy if things go sideways?" Molly asked. Bill grinned, producing two items from his jacket pocket. "I brought backup. Noise-canceling headphones and a bag of organic fruit snacks. If any kid spots us, we bribe them. If they start screaming, we can't hear it." "That's not how noise-canceling headphones work." "It's not?" Molly rubbed her temples. She had a master's degree in art history. She had once been employee of the month at a respectable auction house. Now she was plotting to steal presents from a child named Tyler who, according to the mommy blog, really really really wanted a dinosaur tent. "Let's just do this," she sighed.
created_at
"2026-02-08T15:13:38.965409+00:00"
Sunday, February 8th, 2026 at 3:13:38 PM GMT+00:00
model
"anthropic/claude-4.5-opus-20251124"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1484
moderation_latency
(null)
generation_time
19395
tokens_prompt
61
tokens_completion
601
native_tokens_prompt
75
native_tokens_completion
684
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.017475
router
(null)
provider_responses
0
id
"msg_01QHw7prWFjowjuconeF1RbW"
status
200
is_byok
false
latency
1484
endpoint_id
"be883404-eb42-4b2d-b6e4-c7daa3aa8d62"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.5-opus-20251124"
api_type
"completions"
id
"gen-1770563599-4E9XTkgg5UUXTtwrfCfO"
upstream_id
"msg_01QHw7prWFjowjuconeF1RbW"
total_cost
0.017475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
20.1897%
Matches word count
n/a
neededClean
false
words
480
96.5874%
Dialogue to Total Word Ratio
Ratio: 30.83%, Deviation: 0.83%
neededClean
false
wordsTotal
480
wordsDialogue
148
58.3885%