NC Bench
Overview
About
Tests
Models
Model
Claude 3 Haiku
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2024-07-19-01J35T7GMYBXFA5P7BCG18F6GX
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched in the shadows, their eyes fixed on the brightly lit house ahead. The air was thick with the sounds of laughter and music, a telltale sign that a children's party was in full swing. "This is it, Bill," Molly whispered, a mischievous grin spreading across her face. "All those presents, just waiting to be taken." Bill scratched his chin, a rare moment of hesitation flickering in his eyes. "I don't know, Molly. A children's party? That seems a little... low, even for us." Molly rolled her eyes and nudged him playfully. "Don't be such a softie. Think of all the valuable stuff those kids are getting. We can't let it go to waste, can we?" "I suppose not," Bill conceded, his resolve wavering. "But how do we get in without being seen?" Molly's eyes sparkled with excitement. "Leave that to me. I've got a plan." She crept forward, Bill reluctantly following, and they scouted the perimeter of the house. Molly spotted a partially open window on the second floor and nodded to Bill. "That's our way in. You boost me up, and I'll unlock the door from the inside." Bill sighed, hoisting Molly up to the window. "Why do I always end up being the muscle in these schemes?" "Because you're good at it, darling," Molly called out, disappearing through the window. A few tense minutes passed before Molly's head popped back out, a triumphant grin on her face. "All clear. Let's go." They slipped inside the house, their footsteps muffled by the sound of laughter and music from the party below. Molly led the way, her eyes scanning the hallway for any signs of trouble. "Alright, Bill, you take the bedrooms and see what you can find. I'll handle the presents downstairs." "Got it," Bill whispered, already heading towards the nearest door. Molly crept down the stairs, her heart racing with excitement. She could almost taste the thrill of the heist. Reaching the living room, she spotted a large pile of brightly wrapped gifts, each one more tempting than the last. "Jackpot," she murmured, already planning which ones to take. Suddenly, a high-pitched squeal echoed from the hallway, and Molly froze. "What was that?" a voice called out, and Molly cursed under her breath. "Bill, what did you do?" she hissed, her eyes darting towards the stairs. A moment later, Bill came barreling down the stairs, his arms full of stuffed animals and toy cars. "Run, Molly! They found me!" Molly didn't need to be told twice. Snatching a few of the choicest presents, she bolted after Bill, their laughter and the sounds of pursuit echoing through the house. As they burst out the front door and into the night, Molly couldn't help but feel a twinge of regret. "So much for a clean getaway," she muttered, but the adrenaline coursing through her veins was undeniably exhilarating.
id
"gen-GesDZIGPLVbk8gtum4HgqJHVM1wU"
total_cost
0.0009675
upstream_id
"msg_01DPLNJUpRY4EfxxTFmBWgrA"
created_at
"2024-07-19T15:37:12.187931+00:00"
Friday, July 19th, 2024 at 3:37:12 PM GMT+00:00
model
"anthropic/claude-3-haiku:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
6475
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
647
native_tokens_prompt
75
native_tokens_completion
759
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0009675
Evaluation details
Result
Evaluator
Details
Meta Data
43.3783%
Matches word count
n/a
neededClean
false
words
483
97.9614%
Dialogue to Total Word Ratio
Ratio: 30.64%, Deviation: 0.64%
neededClean
false
wordsTotal
483
wordsDialogue
148
70.6699%