NC Bench
Overview
About
Tests
Models
Model
Llama 3.1 8B
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2024-07-23-01J3G82ERF9T0FKS1T6ZWX33CZ
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the beat-up Subaru, sipping on a lukewarm cup of coffee as Bill paced back and forth across the parking lot. "So, we're in. We've got the blueprints, we've got the schedule... we've got the all-important naptime estimate," Bill said, ticking off each point on his fingers. "And we've got the best disguises ever," Molly added, grinning. "Who wouldn't trust a guy dressed as a giant frog and a woman with a haircut that says 'I'm trying to relive my MySpace days'?" Bill shot her a look, but a small smile played on his lips. "Hey, this is a legitimate business. We have investors. Okay, not investors, but we have friends who have friends who own laser tag places and think it's a great idea to invest in us." Molly snorted. "Yeah, because nothing says 'legitimate business' like a guy who invested in a solid gold toilet." "Hey, that was a tax write-off!" Bill protested. "And it was a great conversation starter at parties. 'Oh yeah, we've got a solid gold toilet in the office.' People are just intimidated by that kind of wealth." Molly raised an eyebrow. "I think you might be the only person I've ever met who's been to a party where the conversation starter is a solid gold toilet." Bill stopped pacing and leaned against the car, next to Molly. "Well, when you're a guy who's been known to show up to dates wearing a 'World's Okayest Golfer' t-shirt, you make an impression." Molly laughed. "You know, for a guy who's trying to be taken seriously as a cat burglar, you're doing a pretty bad job of it." Bill looked affronted. "Hey, those t-shirts are a compliment. They're saying, 'I'm unapologetically me, and I don't care if you judge me.'" Molly shook her head. "You're something else, Bill." Bill beamed with pride. "And that's why we're going to be the best thieves we can be. Because when we're in, everyone's going to be so focused on how ridiculous we look that they'll forget all about the presents." Molly's grin faltered for a second before she nodded. "Yeah, yeah. I like the plan. Okay, so what's the plan for containing the kids?" Bill thought for a moment. "I was thinking we could just tell them there's a magic show going on outside and lead them away one by one." Molly sighed. "That's going to take hours. Can't we just, I don't know, knock them out or something?" Bill's eyes went wide. "We can't knock out children! That's... that's... child endangering!" Molly rolled her eyes. "You're really selling me on this whole 'professional thief' vibe, aren't you?" Bill stood up straight, his eyes flashing with mock seriousness. "Hey, we're not just thieves, Molly. We're artists. We're the John McClanes of... of... party theft." Molly facepalmed, just as Bill's phone rang, breaking the tension. "Hey, boss," he said, picking up. "Yeah, we're set for the plan. See you in three, two... Uh-huh. Yeah, Molly's hanging up on me right now." Molly jumped away from the car, laughing. "I'm going to go put some more ridiculous clothes on and we can get ready to rock this joint."
id
"gen-hgW664DOkrFE6d95xCTjQG3aAKBL"
total_cost
0.0001642
upstream_id
"fc79eb88-2716-4afe-bb05-4e17b104dc8f"
created_at
"2024-07-23T16:51:27.311964+00:00"
Tuesday, July 23rd, 2024 at 4:51:27 PM GMT+00:00
model
"meta-llama/llama-3.1-8b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
2816
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
102
tokens_completion
749
native_tokens_prompt
70
native_tokens_completion
751
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0001642
Evaluation details
Result
Evaluator
Details
Meta Data
0.0007%
Matches word count
n/a
neededClean
false
words
533
36.5922%
Dialogue to Total Word Ratio
Ratio: 74.48%, Deviation: 4.48%
neededClean
false
wordsTotal
533
wordsDialogue
397
18.2964%