NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.5
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYY74MJN6NJ7PKV25FPCHMW
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The basement of Molly's apartment smelled like stale pizza and desperation, which Bill always found oddly comforting. He spread the hand-drawn map across the coffee table, knocking over three empty energy drink cans in the process. Molly squinted at his artwork. "Is that supposed to be a bouncy castle or a tumor?" "It's the inflatable obstacle course. Very important tactical consideration." Bill tapped the crude drawing with his pencil. "Kids go in one side, come out the other looking like they've survived a war. That's our window." She pulled her reading glasses down from her hair, which made her look like a librarian who'd made several questionable life choices. The glasses were stolen, naturally. Everything in Molly's life was stolen except her cat, and she had serious suspicions about where that animal came from. "Walk me through it again," she said. Bill cleared his throat and adopted his planning voice, which was slightly deeper and made him sound like a GPS navigator with trust issues. "Birthday party starts at two. Parents drop kids off like they're releasing hostages. By two-thirty, pure chaos. Screaming. Sugar. Someone will definitely vomit." "That's your in?" "The presents get stacked on a table near the back door. Everyone's focused on little Timmy or whatever his name is." "It's actually Bradley." Bill waved dismissively. "We slip in dressed as entertainment, load the gifts into the van, and we're gone before anyone notices." Molly leaned back and crossed her arms. The gesture reminded Bill of his mother, which was troubling for several reasons. "And what entertainment, exactly, are we supposed to be?" He reached into his backpack and produced two rainbow wigs and a pair of oversized shoes that looked like they'd been rejected from a circus for being too sad. "Absolutely not." "Clowns are invisible at children's parties. It's science." "It's humiliating, is what it is." She grabbed one of the wigs anyway, examining it like a detective inspecting evidence at a crime scene. The synthetic hair felt like it had been harvested from a very cheap doll. "Where did you even get these?" "You don't want to know." She really didn't. That was the foundation of their partnership. Bill acquired things through methods that Molly deliberately never questioned, and she provided the common sense that kept them both out of prison. So far the system had worked reasonably well, minus that incident in Tampa that they'd agreed never to discuss. The party was scheduled for Saturday at a community center on Maple Street. Bill had done reconnaissance, which in his case meant driving past twice and making assumptions. The presents at these events were substantial. Parents in that neighborhood competed through gift-giving like it was an Olympic sport. Molly studied the map again, calculating angles and exits and the likelihood that this whole thing would collapse into a spectacular failure. The odds weren't great, but they never were. "Fine," she said. "But I'm not making balloon animals."
created_at
"2026-02-08T15:34:31.161978+00:00"
Sunday, February 8th, 2026 at 3:34:31 PM GMT+00:00
model
"anthropic/claude-4.5-opus-20251124"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2139
moderation_latency
(null)
generation_time
20418
tokens_prompt
61
tokens_completion
615
native_tokens_prompt
75
native_tokens_completion
684
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.017475
router
(null)
provider_responses
0
id
"msg_01GMETFzBWaimASxtg4G9Kxo"
status
200
is_byok
false
latency
2139
endpoint_id
"be883404-eb42-4b2d-b6e4-c7daa3aa8d62"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.5-opus-20251124"
api_type
"completions"
id
"gen-1770564850-RqJx0k3C4iytPdgpiMuX"
upstream_id
"msg_01GMETFzBWaimASxtg4G9Kxo"
total_cost
0.017475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
99.3769%
Matches word count
n/a
neededClean
false
words
495
91.7396%
Dialogue to Total Word Ratio
Ratio: 31.31%, Deviation: 1.31%
neededClean
false
wordsTotal
495
wordsDialogue
155
95.5583%