NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYXPMW2CFR049KFAAS9FJAE
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The basement of Molly's apartment smelled like stale pizza and ambition, which Bill had always thought was a peculiar combination but one that seemed to follow his partner wherever she went. Molly spread the hand-drawn map across her coffee table, pushing aside three empty energy drink cans and what appeared to be a taxidermied squirrel she claimed was for another job entirely. "The target is Timmy Henderson's seventh birthday party," she announced with the gravitas of a general planning an invasion. "Saturday at two. The presents will be staged in the living room, northwest corner, approximately four feet from the bounce house perimeter." Bill squinted at the map. "Is that supposed to be a dog or a security guard?" "It's a labradoodle named Mr. Sprinkles. He's the only variable I'm worried about." "You're worried about a labradoodle." "He bit a mailman last spring. The man needed six stitches, Bill. Six." Bill leaned back in his chair, which creaked ominously under his weight. Molly had purchased her furniture from a going-out-of-business sale at a funeral home, and everything in her apartment seemed to be in a constant state of mourning for its own existence. "Walk me through the entry point again," he said. Molly traced her finger along a crude rendering of the Henderson backyard. "We go in through the side gate during the magic show. Every parent will be watching some guy in a cape pull rabbits out of things. Meanwhile, we slip in dressed as party entertainers." "What kind of entertainers?" "Clowns." Bill felt his stomach drop. "Absolutely not. We agreed after the circus incident that I would never wear face paint again." "That was different. That was mime paint. This is clown paint. Totally separate categories." "Paint is paint, Molly." She rolled her eyes with such theatrical exaggeration that Bill worried they might get stuck. "Fine. You can be a balloon animal artist. No paint required. Just a vest and a vague sense of inadequacy." Bill considered this. He did own several vests, and inadequacy was something he carried with him like a wallet. The plan itself was beautifully simple in that way Molly's plans always started before becoming catastrophically complicated. They would infiltrate during peak distraction, load the presents into large canvas bags disguised as equipment carriers, and exit through the back gate before anyone noticed that little Timmy's haul had vanished. "What about the parents?" Bill asked. "Wine. There's always wine at these things. By three o'clock, they'll be too busy complaining about their mortgages to notice anything." Molly pulled out two laminated badges that read "Party Pals Entertainment" in cheerful rainbow letters. "I had these made at the copy shop on Fifth. The teenager working there didn't even ask questions." Bill picked up his badge and examined the photo. "This isn't me." "Close enough. You both have faces." He couldn't argue with that logic, mostly because arguing with Molly was like trying to nail jelly to a cloud.
created_at
"2026-02-08T15:25:30.550815+00:00"
Sunday, February 8th, 2026 at 3:25:30 PM GMT+00:00
model
"anthropic/claude-4.5-opus-20251124"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1573
moderation_latency
(null)
generation_time
19751
tokens_prompt
61
tokens_completion
621
native_tokens_prompt
75
native_tokens_completion
715
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01825
router
(null)
provider_responses
0
id
"msg_01BDhkpuiB9frAEDfcRi8UQo"
status
200
is_byok
false
latency
1560
endpoint_id
"be883404-eb42-4b2d-b6e4-c7daa3aa8d62"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.5-opus-20251124"
api_type
"completions"
id
"gen-1770564309-Og5Ela6wXGikIY0XSZ7b"
upstream_id
"msg_01BDhkpuiB9frAEDfcRi8UQo"
total_cost
0.01825
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9190%
Matches word count
n/a
neededClean
false
words
497
0.0005%
Dialogue to Total Word Ratio
Ratio: 45.67%, Deviation: 15.67%
neededClean
false
wordsTotal
497
wordsDialogue
227
49.9597%