NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2025-02-24-01JMX18V1CEREC7T6N7VHR7P1E
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly drummed her fingers on the table, staring at the crude map Bill had sketched on a coffee-stained napkin. The children's party venue looked like a fortress on paper, albeit one decorated with balloons and streamers. "So explain this to me one more time," she said, twirling a lock of red hair around her finger. "We're seriously planning to rob a bunch of six-year-olds?" Bill leaned forward, his eyes bright with excitement. "Not the kids themselves, Mols. The presents. Do you have any idea how much loot is sitting in that gift pile? We're talking tablets, gaming systems, designer clothes - rich parents trying to outdo each other." "Seems pretty low, even for us," Molly replied, though her interest was clearly piqued. "What's the setup again?" Bill pointed to various spots on his napkin diagram. "The birthday boy is Timmy Winchester, son of that tech mogul. Party's at Funland Adventure Zone from two to five on Saturday. Gift table is here, by the rock climbing wall." "And security?" Molly asked, raising an eyebrow. "One bored teenager checking wristbands at the entrance. Parents drop the kids off and most leave to enjoy their freedom. The few that stay are usually face-deep in their phones in the waiting area." Molly took a sip of her coffee. "So what's the plan? We can't just walk in and grab everything." "That's where this gets good," Bill said, grinning. "We go in as entertainment. I called the place pretending to be Winchester's assistant. Said the boss wanted to surprise everyone with extra characters." "Characters?" Molly's eyes narrowed suspiciously. "I got us dinosaur costumes. T-Rex to be specific." "You want me to dress as a dinosaur?" Molly's voice rose sharply. "Have you completely lost your mind?" "Think about it!" Bill insisted. "Arms are hidden inside, perfect for swiping presents. We entertain the kids for fifteen minutes, then one of us creates a distraction while the other hits the gift table. We stuff everything in the costume tails and waddle out." "This is ridiculous," Molly muttered. "What kind of distraction?" Bill shrugged. "I figured you could pretend to get sick. Nothing scares parents more than vomit near their precious offspring. They'll all be looking at you while I clean out the table." "Charming. And the costumes? Where exactly did you get those?" "My cousin works in theater." "The same cousin who got us those police uniforms that fell apart during the jewelry store job?" Bill shifted uncomfortably. "He's improved since then." Molly stared at him for a long moment before sighing. "You know what the worst part is? This might actually work. But if those dinosaur costumes rip, or if one kid sees us taking their precious new PlayStation..." "It's foolproof," Bill said confidently. "In and out in thirty minutes with a haul that'll keep us comfortable for months." "Fine," Molly conceded. "But next time, we're back to banks. At least they have adults' money." "So we're on?" Bill asked, his face lighting up. "Yeah," Molly said, draining her coffee. "Let's go steal from some kindergartners."
id
"gen-1740436173-JbzfFdoy8Ld8eyYRpwLB"
upstream_id
"msg_01ERzzL3XaMHzk7b3sF2S745"
total_cost
0.012015
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:29:49.467787+00:00"
Monday, February 24th, 2025 at 10:29:49 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
347
moderation_latency
(null)
generation_time
14360
tokens_prompt
68
tokens_completion
683
native_tokens_prompt
75
native_tokens_completion
786
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.012015
Evaluation details
Result
Evaluator
Details
Meta Data
81.2727%
Matches word count
n/a
neededClean
false
words
512
0.0000%
Dialogue to Total Word Ratio
Ratio: 74.41%, Deviation: 24.41%
neededClean
false
wordsTotal
512
wordsDialogue
381
40.6364%