NC Bench
Overview
About
Tests
Models
Model
Claude 2.0
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2024-07-19-01J35SS8M3HJGMPTPJWN9XR4F1
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Here is a 491 word scene with 30% dialogue between Molly and Bill planning a heist: Molly paced back and forth in the alleyway behind the community center, wringing her hands together nervously. "I don't know about this, Bill. It seems pretty risky, even for us." Bill leaned against the brick wall, arms folded casually across his chest. "Oh come on, it'll be easy. A quick in and out job." He paused, raising an eyebrow. "Unless you're getting cold feet?" "No, of course not!" Molly frowned, stopping her pacing to face him. "It's just...all those kids. And their parents. Seems like an awful lot could go wrong." "Yeah, if we screw up. But we won't." Bill shrugged. "Besides, rich folks throwin' a big fancy party in a place like this? They're just askin' to get ripped off if you ask me." Molly sighed, glancing around the corner towards the community center's entrance. Groups of overdressed parents and hyper children were filing inside, arms loaded with colorfully wrapped packages. "I guess you've got a point. They do seem like easy targets." "Exactly. We'll slip in with the crowd, snag the gifts, and disappear before anyone's the wiser." Bill mimed the smooth lift of a present. "Easy money." "Okay, okay. You've convinced me." Molly squared her shoulders. "But we'll have to do this carefully. I'll keep watch while you grab the loot. With all those people, it'll be pure chaos in there." Bill frowned. "Now hold on. I'm the better pickpocket and you know it. Why do I have to be the one grabbing presents while you stand there looking innocent?" "Because I'm a better lookout," Molly retorted. "I can distract people and keep their attention away from you. We need to play to our strengths here." Rolling his eyes, Bill reluctantly nodded. "Fine. But you owe me one after this." Molly grinned. "Deal. Now come on, looks like most of the guests have arrived. Let's slip inside and scope things out." They joined the queue of partygoers entering the community center. Inside was a raucous scene - throngs of children running amok while harried parents tried in vain to corral them. Molly shot Bill a knowing look as they stepped further inside. Perfect chaos for them to take advantage of. They melded into the crowd, glancing casually around as they formulated a plan. Across the room was a large table piled high with presents just waiting to be snatched. "There's our score," Bill murmured, jerking his chin towards it. "Told ya this would be easy money." Molly nodded, mind already whirring with ideas. "You circulate and grab small packages first. I'll head over to the snacks and cause a distraction if I need to." "Sounds good. Let's get to work." Bill rubbed his hands together eagerly before disappearing into the sea of people. Smirking to herself, Molly headed in the opposite direction. If all went smoothly, they'd be making off with quite the profitable haul. Just another day's work for a couple of expert thieves.
id
"gen-6nUkmqABhlL6hOVIaAfAKOVfKehu"
total_cost
0.018064
upstream_id
"msg_012wNKBSDUeowK1VgNMM4x3D"
created_at
"2024-07-19T15:30:04.955699+00:00"
Friday, July 19th, 2024 at 3:30:04 PM GMT+00:00
model
"anthropic/claude-2.0:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
46251
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
688
native_tokens_prompt
74
native_tokens_completion
728
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.018064
Evaluation details
Result
Evaluator
Details
Meta Data
75.1556%
Matches word count
n/a
neededClean
true
words
487
0.0000%
Dialogue to Total Word Ratio
Ratio: 52.57%, Deviation: 22.57%
neededClean
true
wordsTotal
487
wordsDialogue
256
37.5778%