NC Bench
Overview
About
Tests
Models
Model
Llama 3.2 90B (Vision)
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2024-09-26-01J8Q0TDCMEWZE2BZR28HZQ9QX
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The sun was setting over the suburban streets, casting a warm orange glow over the neatly manicured lawns and perfectly placed sprinkler systems. Molly and Bill sat in their getaway car, a nondescript gray sedan with a wonky air conditioner, parked a few houses away from their target. They were discussing the finer points of their plan, one that involved breaking into a children's birthday party and making off with all the presents. "I'm telling you, Bill, this is a goldmine," Molly said, her eyes scanning the surroundings. "Kids' parties are always crawling with expensive gifts. iPads, video games, the latest must-have toys... it's like a treasure trove of resale value." Bill rubbed his chin thoughtfully, his scruffy beard scratching against his fingers. "Yeah, but have you considered the security? I mean, these parties can get pretty wild. There's gonna be screaming kids, frazzled parents... we're gonna stick out like sore thumbs." Molly waved a hand dismissively. "Security, schmeecurity. We've got costumes, Bill. We're going in as clowns. Who's gonna suspect a pair of clowns at a kids' party?" Bill raised an eyebrow. "Clowns? You mean, like, full-on face paint and baggy pants clowns?" Molly grinned mischievously. "The works. We'll be the life of the party... until we make our move, that is." Bill sighed, running a hand through his thinning hair. "You're enjoying this a little too much, Molls. Remember, this is about the score, not about having fun." Molly's grin didn't falter. "Oh, I'm having fun, all right. And I'm going to have even more fun when we're laughing all the way to the bank." The two thieves sat in silence for a moment, watching as a group of noisy children spilled out of the party house and ran around the backyard. Molly leaned over, peering at the house through a pair of high-tech binoculars. "Okay, I see the cake table. And... oh boy, I see a table filled with presents. We're talking Lego sets, PlayStation games... these kids are living the high life." Bill perked up, his eyes lighting up with interest. "What about the parents? How many are there?" Molly scanned the area, her voice low and serious. "I count at least five adults, maybe six. But they're all distracted by the bouncy castle and the pony rides. We've got a small window of opportunity here." Bill nodded, a plan forming in his mind. "Alright, let's get this show on the road. Time to paint on those clowns smiles and get to work." Molly let out a whoop of excitement, grabbing Bill's arm and pulling him towards the party house. "Last one to the cake table has to wear the bright red nose!" As they slipped out of the getaway car, Bill turned to Molly with a mock-serious expression. "You know, I'm only doing this for the presents. Not because I secretly love wearing face paint and scaring small children." Molly just laughed, the sound ringing out into the evening air as they made their way towards the unsuspecting partygoers.
id
"gen-Trkpjtq8XigrtBOPfZvHoM9QxVwF"
total_cost
0.0006777
upstream_id
"805c7717-e335-4706-9ddd-14bc7ebd2b3e"
created_at
"2024-09-26T11:18:11.777825+00:00"
Thursday, September 26th, 2024 at 11:18:11 AM GMT+00:00
model
"meta-llama/llama-3.2-90b-vision-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Fireworks"
latency
443
moderation_latency
(null)
generation_time
20452
finish_reason
"stop"
tokens_prompt
68
tokens_completion
681
native_tokens_prompt
71
native_tokens_completion
682
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0006777
Evaluation details
Result
Evaluator
Details
Meta Data
95.9868%
Matches word count
n/a
neededClean
false
words
508
0.0000%
Dialogue to Total Word Ratio
Ratio: 53.15%, Deviation: 23.15%
neededClean
false
wordsTotal
508
wordsDialogue
270
47.9934%