NC Bench
Overview
About
Tests
Models
Model
Llama 3.2 90B (Vision)
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2024-09-26-01J8Q0QJ2XN68F8YW2HSGES8MW
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly sipped her coffee, her eyes fixed on the catering brochure in front of her. "Okay, so the party's at 2 pm on Saturday. We've got a three-hour window before the bouncy castle gets deflated." Bill adjusted his fedora, a ridiculous gesture considering they were sitting in a dingy diner at 8 am. "And you're sure the security is lax?" "Lax is an understatement," Molly replied, flipping through the pages. "These are parents who think a 'bouncy castle' constitutes entertainment. They'll be too busy Instagramming their kids' sugar highs to notice us." Bill snorted. "You know, I used to think we were a little scummy for stealing from charities. But kids' parties? That's a whole new level of low." Molly shot him a withering look. "Hey, we're not stealing from the Kids. We're liberating valuable gifts from overly indulgent parents. It's practically a public service." The waitress refilled their cups, giving Bill a flirtatious wink that he returned with a charming smile. "You know, sweetheart, we're just trying to make the world a fairer place, one PlayStation console at a time." The waitress giggled, oblivious to the absurdity of their conversation. As she walked away, Molly rolled her eyes. "Seriously, Bill, focus. We need to discuss the getaway car." "Right, right. I was thinking we use the old sedan with the fake mustache and Groucho glasses attached to the windshield. That thing's got more aliases than a fugitive on the run." Molly groaned. "Not that ridiculous thing again. What's wrong with the van?" "Where's the fun in that?" Bill asked, his grin mischievous. "Besides, the sedan's got character. It's like the Artful Dodger on wheels." Molly sighed. "Fine, but if we get pulled over because the mustache decides to fall off, I'm blaming you." As they chatted, two parents walked into the diner, their entourage of screaming children in tow. Molly nodded discreetly toward them. "Check out the competition. I bet they're spending a fortune on that mini iPad for little Timmy's birthday." Bill's eyes lit up. "Mmm... mini iPad. That's like catnip to a thief like me. Now, let's get down to business. Who's got the skills to get past the piñata?" Molly smiled sweetly. "I've been practicing my pinata-whacking skills, don't you worry. I can take down that thing like a pro." The waitress returned, confused, as Bill burst into laughter. "Okay, okay, you're the piñata expert. Just don't get too cocky; we don't need any, ahem, 'accidental' whacks to the getaway car's tires." Molly kicked him under the table. "Shut up, Bill. Let's get out of here before we raise any more suspicions." As they left the diner, Bill muttered, "You know, sometimes I wonder how we've managed to evade arrest this long." Molly grinned, eyes glinting with mischief. "It's simple, Bill: we're professionals. Now let's go steal us some birthday presents."
id
"gen-8itb7gTaG3L68i6qdznCFsxyvxQl"
total_cost
0.000684
upstream_id
"fc2ac6fc-4ed4-4c87-a1ba-2abbd1a5e36d"
created_at
"2024-09-26T11:16:37.2963+00:00"
Thursday, September 26th, 2024 at 11:16:37 AM GMT+00:00
model
"meta-llama/llama-3.2-90b-vision-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Fireworks"
latency
287
moderation_latency
(null)
generation_time
19604
finish_reason
"stop"
tokens_prompt
68
tokens_completion
688
native_tokens_prompt
71
native_tokens_completion
689
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.000684
Evaluation details
Result
Evaluator
Details
Meta Data
6.0907%
Matches word count
n/a
neededClean
false
words
477
0.0000%
Dialogue to Total Word Ratio
Ratio: 69.18%, Deviation: 19.18%
neededClean
false
wordsTotal
477
wordsDialogue
330
3.0453%