NC Bench
Overview
About
Tests
Models
Model
Claude 2.0
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2024-07-19-01J35T2RVAGVX3CTNN3QD32BXT
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Here is a 499 word scene with 30% dialogue between Molly and Bill planning a heist: Molly tapped her foot impatiently as she waited in the alley behind Jolly Jack's Jamboree. She checked her watch again and rolled her eyes. Bill was late, as usual. This was an important meeting to finalize their plans for the heist tonight, and he was probably off goofing around somewhere instead of taking this seriously. Finally, she heard footsteps approaching and looked up to see Bill sauntering towards her, a ridiculous grin on his face. "Heya Moll Doll, sorry I'm late. I got a little caught up with some business." Molly glared at him. "Business? Do you mean you were busy stuffing your face with pastries at the bakery again? We've got a job to pull off tonight. I need you focused." "Aw don't get your knickers in a twist, I'm focused, I'm focused," Bill said, putting his hands up in mock surrender. "So what's the plan, boss lady?" "Okay, here's the deal," Molly said, lowering her voice. "There's a big birthday party happening at six o'clock tonight for some rich kid named Mikey. All his fancy little friends will be there and so will a mountain of presents." Bill's eyes lit up. "Presents you say? Those are my favorite two syllables." Molly nodded. " Exactly. While the kids are busy stuffing cake in their faces and the parents are distracted, we sneak in through the back door of Jolly Jack's. I'll pick the lock while you stand lookout. Then we grab as many presents as we can carry and sneak back out to load up the van." "And then it's off to Fence Freddy's with our loot," Bill chuckled while rubbing his hands together gleefully. "This is our biggest haul yet if we pull it off," Molly said. "So I need you focused tonight. No goofing off!" "You can count on me!" Bill declared, puffing out his chest. "I'm a model of professionalism." Molly looked at him skeptically. "Uh huh. Well just stick to the plan and keep the joking around to a minimum tonight, okay?" "You got it, boss," Bill said, giving her a mock salute. Molly checked her watch again. "Okay, we've got three hours until party time. Let's go over the plan again to make sure we didn't miss anything." They hunched over the crude map Molly had drawn in the dirt and talked through the plan step-by-step. Where they would park the van, how to get to the back door without being seen, what excuses to use if they did get caught. After hashing out the details, Molly felt confident they had covered everything. This was going to be their biggest payout ever. The van was ready, their costumes were prepped. Now they just had to wait for the right moment to strike. As they slipped out of the alley and went their separate ways to prepare, Molly called over her shoulder, "And don't be late this time! I'll leave without you if I have to." Bill just chuckled and waved as he sauntered off down the street.
id
"gen-38I7WWLlO7AfqrssOBEanC7Lsi5N"
total_cost
0.018136
upstream_id
"msg_015886RjXRtVK7WEC5U1Decs"
created_at
"2024-07-19T15:35:29.109605+00:00"
Friday, July 19th, 2024 at 3:35:29 PM GMT+00:00
model
"anthropic/claude-2.0:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
45091
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
675
native_tokens_prompt
74
native_tokens_completion
731
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.018136
Evaluation details
Result
Evaluator
Details
Meta Data
99.7443%
Matches word count
n/a
neededClean
true
words
504
0.0000%
Dialogue to Total Word Ratio
Ratio: 50.60%, Deviation: 20.60%
neededClean
true
wordsTotal
504
wordsDialogue
255
49.8722%