NC Bench
Overview
About
Tests
Models
Model
Llama 3.2 11B (Vision)
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2024-09-26-01J8PZ4M6C4A2PXPP335S8WJ23
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her eavesdropping mirror, a fancy tool that allowed her to see the security camera without being seen herself. She panned it over the happy chaos of the children's party, the colorfully wrapped gifts stacked enticingly on the table. Bill wandered into the room, a look of focused distraction fixed on his face. "So, we're doing this, right?" he said, flopping onto the couch beside her. "Still a maybe, depending on what we can find out," Molly replied, not taking her eyes off the mirror. "Security team is supposed to come in at 4 pm for a 'safety inspection.'" Bill snorted. "These guys need a safety inspection? They've got enough razor wire to trip over and zero actual awareness of the world around them." Molly chuckled, nudging him with her elbow. "That's why they're the perfect mark. Now, help me decipher the layout. What's the high-voltage stuff around the back entrance?" Bill got up and started pacing. "Alright, let's see...the corner of Elsmont and Elm is a dead zone. Plenty of parked cars blocking the view, and the security posts are at the other end of the block. I think I can take out the cameras there without being noticed." Molly's eyes twinkled with amusement. "You and your cameras. What's your skill level on hotwiring a bicycle lock?" Bill shot her a mock-defensive look. "Two point five out of five, thank you very much. What's your angle on getting to the presents without setting off the alarms?" "Well, I took a peek at the entrance. Two guards at the door, very chatty. They're completely going to drop their posts to watch the magician's stupid balloon animal act. We just need to–" "To time it perfectly," Bill interrupted, snagging her wrist. "Your whole 'blatantly obvious' plan has a certain...appeal to it." Molly wiggled free. "Pay attention, genius. We've got to coordinate with the magician. Get him to clear a path, let us slip in unnoticed. You – " "I've already got the magician lined up," Bill said, flashing his most unconvincing big shot grin. "He's going to give us a discount on his show in exchange for me telling him some really bad jokes during the prep." Molly rolled her eyes. "Bad jokes? You're going to bribe him with crap improv." Bill winked. "Hey, my improv skills are top-notch." As the sun continued its inevitable march toward 3 pm, their chatter turned more business-like. Discussion veered to backup plan B and potential obstacles: babysitter alert systems, partygoer distractions, and an inescapable reality: one unfortunate party guest was apparently allergic to most commercial candy.
id
"gen-tmEFLoy3KLz7x0Ryt8mLtLNZcSRi"
total_cost
0.0001318
upstream_id
"10071c22-fad7-4f8f-a062-5177d3f90989"
created_at
"2024-09-26T10:48:33.391217+00:00"
Thursday, September 26th, 2024 at 10:48:33 AM GMT+00:00
model
"meta-llama/llama-3.2-11b-vision-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Fireworks"
latency
147
moderation_latency
(null)
generation_time
4937
finish_reason
"stop"
tokens_prompt
68
tokens_completion
587
native_tokens_prompt
71
native_tokens_completion
588
native_tokens_reasoning
(null)
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0001318
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
434
4.0165%
Dialogue to Total Word Ratio
Ratio: 61.98%, Deviation: 8.02%
neededClean
false
wordsTotal
434
wordsDialogue
269
2.0082%