NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.5
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYYY7MMAT1RQHQD17V0MQ47
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crayon-drawn map across the sticky diner table, knocking over the ketchup bottle with her elbow. "Alright, Bill, this is serious business. Little Timmy Henderson is turning seven, and his parents have gone all out." Bill squinted at the colorful scribbles. "Did you draw this yourself?" "I traced it from his sister's homework folder. Focus, Bill. We've got a window of exactly two hours between the arrival of Bongo the Clown and the cutting of the cake." Bill scratched his stubbled chin. "Why are we doing this again? These are children, Molly. Actual small humans." "Because little Timmy's father is Marcus Henderson, the man who fired me from Henderson Electronics for allegedly stealing printer paper." "You did steal printer paper." "That's beside the point. The point is revenge, served cold, wrapped in dinosaur paper." The waitress refilled their coffees without making eye contact. She had learned not to ask questions about the strange pair who always ordered pancakes and spoke in hushed tones about bounce houses. Bill studied the map more closely. "What's this red X in the backyard?" "That's the present table. Sandra Henderson always puts it near the inflatable slide. The woman is predictable as clockwork." "And the blue circles?" "Those are the parents. They'll be clustered around the cooler, drinking wine from plastic cups and pretending their lives aren't hollow." Bill leaned back in the booth, crossing his arms. "How do we get past them without being noticed?" Molly grinned, pulling a crumpled shopping bag from beneath her seat. "Costumes. I managed to acquire two official Bongo the Clown assistant outfits." "There's no way I'm wearing that." "You absolutely are wearing it. The polka dots are slimming." "Molly, I have dignity." "You have a criminal record and a timeshare in Delaware. Dignity left the building years ago." Bill grabbed the bag and peered inside, his face falling at the sight of the rainbow wig. He imagined himself in the getup, squeaking across suburban grass while small children watched with their sticky fingers and judging eyes. "Fine. But I want sixty percent of whatever we fence." "These are children's toys, Bill. We're not fencing them." "Then what exactly is the plan?" Molly's eyes sparkled with mischief. She pulled out a second map, this one printed on actual paper. "We donate everything to the Riverside Orphanage. Every last present ends up with kids who actually need them." Bill stared at her for a long moment. "We're stealing from a birthday party to give to orphans?" "Think of it as aggressive philanthropy." "That's the most ridiculous thing I've ever heard." "You're still going to help me, aren't you?" Bill sighed heavily, grabbing his rainbow wig and placing it experimentally on his head. "What time does this disaster start?" "Saturday at noon. Bring your squeaky shoes."
created_at
"2026-02-08T15:47:04.467208+00:00"
Sunday, February 8th, 2026 at 3:47:04 PM GMT+00:00
model
"anthropic/claude-4.5-opus-20251124"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1928
moderation_latency
153
generation_time
16597
tokens_prompt
61
tokens_completion
617
native_tokens_prompt
75
native_tokens_completion
713
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0182
router
(null)
provider_responses
0
id
"8d4d613c-e491-486c-902b-6ba5e42772f2"
status
200
is_byok
false
latency
1924
endpoint_id
"de32581c-2e11-4f54-8d29-e8a6e19d3038"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-opus-20251124"
api_type
"completions"
id
"gen-1770565607-f2sOkm3zYz8CxbpmRTzL"
upstream_id
"8d4d613c-e491-486c-902b-6ba5e42772f2"
total_cost
0.0182
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0028%
Matches word count
n/a
neededClean
false
words
468
0.0019%
Dialogue to Total Word Ratio
Ratio: 64.74%, Deviation: 14.74%
neededClean
false
wordsTotal
468
wordsDialogue
303
0.0023%