NC Bench
Overview
About
Tests
Models
Model
Llama 3.2 90B (Vision)
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2024-09-26-01J8Q0Q7F5TJ9RGFA4R7B2K2TY
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled in the unmarked van, parked a block away from the birthday bash. They were dressed in their best "concerned parents" attire, complete with forced smiles and awkwardly styled hairdos. "Alright, let's review the plan," Molly said, studying the blueprint of the party venue on her phone. "We go in, blend in with the other parents, and wait for the kids to start doing the pinata thing." "They're going to do a pinata?" Bill repeated, his eyes wide with excitement. "That's the best thing I've heard all week. I love a good pinata." Molly shot him a withering look. "Focus, Bill. We're here to steal presents, not indulge in nostalgic revelry." Bill held up his hands in mock defense. "I'm focused, I swear. Presents, got it. What's the estimated haul?" Molly consulted her notes. "We're looking at around two hundred kids, each with an average of three to five gifts apiece. If we can get in and out within the hour, I estimate we can clear at least five grand worth of toys and swag." Bill whistled. "Not bad for an afternoon's work. But have you thought about the risks? I mean, we're talking about a room full of sugared-up children and over-caffeinated parents. It's like walking into a hostage situation." Molly shrugged. "We've handled crazier crowds. Remember the time we hit that Black Friday sale at Best Buy?" Bill shuddered. "How can I forget? The horror, the humanity...the discounts." Molly grinned. "Exactly. We're seasoned pros. We can handle a few screaming kids and some frazzled parents. Besides, we have our trusty disguises." Bill adjusted his wig and fake mustache. "Yeah, because nothing says ' normal suburban parent' like a poorly applied toupee and a handlebar mustache." Molly chuckled. "Hey, you look like a normal-ish person to me. Now let's get ready to put on a show." As they prepared to exit the van, Bill turned to Molly with a mischievous glint in his eye. "You know, I've got a great idea for a diversion. While you're snagging the presents, I can –" Molly held up a hand. "Don't. I don't want to know what kind of insane plan you've cooked up. Just stick to the script, okay?" Bill pouted. "Fine, but I assure you, my plan would have involved a impressive display of breakdancing and at least three small pyrotechnic devices." Molly rolled her eyes. "That's exactly what I'm afraid of."
id
"gen-BJK7CH87MMPvCv5qKwqKzUt6K7Su"
total_cost
0.0005796
upstream_id
"1f7d8551-faa4-40ce-806a-2df056907d22"
created_at
"2024-09-26T11:16:23.161242+00:00"
Thursday, September 26th, 2024 at 11:16:23 AM GMT+00:00
model
"meta-llama/llama-3.2-90b-vision-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Fireworks"
latency
480
moderation_latency
(null)
generation_time
16223
finish_reason
"stop"
tokens_prompt
68
tokens_completion
572
native_tokens_prompt
71
native_tokens_completion
573
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0005796
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
406
0.0000%
Dialogue to Total Word Ratio
Ratio: 71.92%, Deviation: 21.92%
neededClean
false
wordsTotal
406
wordsDialogue
292
0.0000%