NC Bench
Overview
About
Tests
Models
Model
Llama 3.2 90B (Vision)
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2024-09-26-01J8Q0WA35Q2WX6MVX346G2B9V
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly poured herself a glass of wine as she studied the schematics of the children's play center on her laptop. Bill, her partner in crime, paced back and forth behind her, rubbing his hands together in excitement. "Alright, so we've got the party schedule, the security system, and the layout of the place. What's the plan, Stan?" Bill asked, using his favorite alias. Molly raised an eyebrow. "Stan is a middle-aged man, Bill. You're not fooling anyone with that name." "Hey, it's a classic. Okay, fine. What's the plan, Molly?" Molly smiled sweetly. "We go in as the piñata fillers. They're always hiring new staff, so they won't suspect a thing." "And then we make our move during the kid's playtime chaos?" "Exactly. The noise, the screaming, the sugar highs – it's the perfect cover. We slip into the present room, grab the good stuff, and make a quick exit through the emergency door." Bill chuckled. "I love how they think they can secure anything with a flimsy door and a sign that says 'Authorized Personnel Only.'" Molly grinned. "Amateurs. Now, let's discuss our disguises. I've got a wig, a tutu, and a name tag that says 'Binky."' Bill snorted. "You're going as a children's party entertainer. What's my excuse for being there?" "You're my trusty sidekick, Bubbles. You'll be wearing a bright orange vest and a name tag that says 'Bubble Blower Extraordinaire.'" Molly handed Bill a folder full of ridiculous-looking ID badges and outfits. Bill leafed through them, making incredulous faces. "Bubble Blower Extraordinaire? You know I'm a professional, right?" "Hey, someone's got to fill the bubble machines, and it's not going to be me. I have to keep the kids distracted while you do your thing." Bill shook his head. "You're going to get us caught with that wig. It's...unsettling." Molly adjusted her bright pink afro. "I can blend in with the best of them. Now let's go over the checklist one more time. We've got three minutes to get in, grab the goods, and get out." Bill nodded. "I've got it memorized. Okay, so we enter at 2 PM, present room is third door on the left, disable the security cameras, grab the presents, and out the emergency door by 2:05 PM." Molly raised her glass in a toast. "To a successful heist and a kid-friendly exit strategy." Bill clinked his glass against hers. "May the odds be ever in our favor...or at least may the kids be distracted." The two thieves shared a laugh before their eyes turned serious. They knew that in their line of work, a well-executed plan was the difference between success and a group of sugar-fueled kids tackling them to the ground. Molly stood, grabbed her bag, and headed for the door. "Let's go make some bad decisions and get some great presents."
id
"gen-gISkhA8DJul2Ky2jryJAvMojCqKg"
total_cost
0.0006615
upstream_id
"5207ba63-e28a-4b70-98eb-330f00d75e98"
created_at
"2024-09-26T11:19:21.699289+00:00"
Thursday, September 26th, 2024 at 11:19:21 AM GMT+00:00
model
"meta-llama/llama-3.2-90b-vision-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Fireworks"
latency
511
moderation_latency
(null)
generation_time
28158
finish_reason
"stop"
tokens_prompt
68
tokens_completion
663
native_tokens_prompt
71
native_tokens_completion
664
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0006615
Evaluation details
Result
Evaluator
Details
Meta Data
3.6234%
Matches word count
n/a
neededClean
false
words
476
0.0000%
Dialogue to Total Word Ratio
Ratio: 68.28%, Deviation: 18.28%
neededClean
false
wordsTotal
476
wordsDialogue
325
1.8117%