NC Bench
Overview
About
Tests
Models
Model
Llama 3.1 405B
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2024-07-23-01J3G4ZXPXHWDT421VHBGZN48G
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly poured herself a cup of coffee and plopped down beside Bill at the kitchen table, where he was scrutinizing a layout of the children's party venue. "Okay, so we've got the bouncy castle, the ball pit, the photo booth...this is going to be a nightmare," Molly said, running her hand through her tangled brown hair. "Hey, it'll be fun," Bill said, his eyes crinkling at the corners as he smiled. "We get to relive our childhoods and make off with a bunch of awesome toys." "I don't know, man. I'm starting to feel a little bad about this one," Molly said, wincing as she took a sip of her coffee. "I mean, these are kids we're talking about. They're going to be devastated when they wake up and all their presents are gone." "Come on, Moll. We've stolen from worse people," Bill said, snatching the coffee cup out of her hand and taking a swig. "Remember that time we swiped the safes from the frat house? Now those guys were entitled. These kids, on the other hand, are just going to get more stuff from their parents anyway." "Yeah, but it's the principle of the thing," Molly said, exasperated. "The principle of the thing is that we need the cash and this is an easy score," Bill said, tapping his finger on the layout. "Now, let's get down to business. We've got the guest list, the catering schedule...oh, and I found out the party planner is a total flake, so we don't have to worry about her being on top of things." "Good, that's a relief," Molly said, standing up to rummage through the kitchen drawers for a granola bar. "I was worried we were going to have to deal with some hyper-competent Martha Stewart type." "Nah, this woman is more like...I don't know, the anti-Martha," Bill said, chuckling. "She's got a Pinterest board full of fail. Glitter explosions, cake collapses...it's like she has a reverse Midas touch." Molly snorted. "That's amazing. Okay, I'm feeling a little better about this now. What's the plan?" "Well, I was thinking we go in during the chaos of the parachute drop," Bill said, leaning back in his chair. "Everyone will be looking up, waiting for the parachuting clown to land, and we slip in unnoticed." "A parachuting clown?" Molly repeated, incredulous. "What kind of sick individual comes up with this stuff?" "I don't know, but I'm kind of excited to see it," Bill said, grinning. Molly shook her head. "You're such a weirdo. Alright, let's get to work. We've got some presents to steal."
id
"gen-lUBBn1KTBJbOo1QztubI4awEF1bg"
total_cost
0.002007
upstream_id
"b482baf4-59b3-4270-b8f1-4ea36033e449"
created_at
"2024-07-23T15:57:57.279726+00:00"
Tuesday, July 23rd, 2024 at 3:57:57 PM GMT+00:00
model
"meta-llama/llama-3.1-405b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
21608
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
102
tokens_completion
598
native_tokens_prompt
70
native_tokens_completion
599
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.002007
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
433
76.9992%
Dialogue to Total Word Ratio
Ratio: 72.29%, Deviation: 2.29%
neededClean
false
wordsTotal
433
wordsDialogue
313
38.4996%