NC Bench
Overview
About
Tests
Models
Model
Hermes 3 405B
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2024-08-26-01J68D15H5Y62TKWA51H67A42S
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill sat hunched over a rickety table in their hideout, a dilapidated apartment in the wrong part of town. The flickering fluorescent light cast an eerie glow on their faces as they pored over the blueprints of their next target: a lavish children's party at the opulent Vanderpshire mansion. "Okay, so here's the plan," Molly began, tracing her finger along the blueprint. "I'll infiltrate the party disguised as a clown, and you'll be my loyal assistant." Bill raised an eyebrow. "A clown? Really? That's the best you could come up with?" "Hey, don't knock it 'til you try it," Molly retorted. "Besides, who's gonna suspect a clown of stealing presents?" "I guess you have a point," Bill conceded. "So, what's my role in this little charade?" "You'll be in charge of the getaway vehicle," Molly explained. "Once I've loaded up all the presents, we'll make a run for it." "And how exactly do you propose I distract a hoard of screaming children while you're cleaning out the gift table?" Bill asked. Molly grinned mischievously. "That's where my clowning skills come in handy. I'll keep them entertained with balloon animals and face painting while you work your magic." Bill sighed, his skepticism evident. "Fine, but if this goes sideways, I'm blaming you." "Duly noted," Molly replied, unfazed. "Now, let's go over the guest list and see if there are any potential obstacles." As they studied the list, Molly's eyes widened. "Well, would you look at that? The mayor's kid is going to be there. Talk about a high-profile target." Bill groaned. "Great, just what we need. Extra security and a bunch of politicians' offspring running around." "Relax," Molly assured him. "I've got a plan for that too. We'll create a diversion to keep the security occupied while we make our move." "And what kind of diversion are we talking about here?" Bill asked warily. Molly's eyes twinkled with mischief. "Let's just say it involves a strategically placed whoopee cushion and a well-timed release of a dozen doves." Bill couldn't help but chuckle. "You're insane, you know that?" "But it's why you love me," Molly replied with a wink. "Now, let's get our gear together and go over the plan one more time. We've got a party to crash and some presents to steal." As they gathered their supplies, Molly couldn't help but feel a surge of excitement. This was going to be their biggest heist yet, and she was determined to pull it off without a hitch. With Bill by her side, she knew they could handle anything that came their way – even a room full of sugar-fueled children and overprotective parents. Little did they know, however, that the real challenge would come in the form of an unexpected guest – a pint-sized party-goer with a knack for uncovering secrets and a penchant for mischief.
id
"gen-DboRej8WOn28n7VSo8E3VkCYuoef"
total_cost
0
upstream_id
"chatcmpl-265752e6-959b-496c-be3f-574aa96d6727"
created_at
"2024-08-26T22:31:51.035917+00:00"
Monday, August 26th, 2024 at 10:31:51 PM GMT+00:00
model
"nousresearch/hermes-3-llama-3.1-405b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Lambda"
latency
1781
moderation_latency
(null)
generation_time
34209
finish_reason
"stop"
tokens_prompt
67
tokens_completion
647
native_tokens_prompt
83
native_tokens_completion
648
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0
Evaluation details
Result
Evaluator
Details
Meta Data
20.1897%
Matches word count
n/a
neededClean
false
words
480
0.0000%
Dialogue to Total Word Ratio
Ratio: 56.88%, Deviation: 26.88%
neededClean
false
wordsTotal
480
wordsDialogue
273
10.0948%