NC Bench
Overview
About
Tests
Models
Model
Llama 3.1 70B
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2024-07-23-01J3G81WW0T13E9S2XPXYZVS0R
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the rearview mirror, scrutinizing her reflection. "Do you think I look like a respectable children's entertainer?" Bill chuckled, "You mean, do you look like a grown adult who's not currently plotting to steal a bunch of kids' birthday presents?" He nodded, "Yeah, I think you're a natural." Molly smiled, wincing at the bright pink wig perched atop her head. "Hey, this is a masterpiece. I raided every grandma's closet in the neighborhood to get this look." The two thieves sat parked outside the local community center, surrounded by families in their Sunday best, shuffling into the annual Under-10 Extravaganza. Molly's mission was to infiltrate the festivities, crack the safe – disguised as a pinata – and relieve the children of their precious gifts. Bill was in charge of the diversion. "Okay, run through the plan one more time," Molly said, as she buttoned up the final hooks on her bright orange blouse. "Alright, I'll sneak in, 'accidentally' knock over the bounce house, and everyone will rush to gawk at the chaos," Bill explained, his eyes dancing. "Meanwhile, you slink in, slip into the birthday coordinator's office, and crack the safe. Simple." Molly's eyes gleamed with excitement. "And the password is ' Sparkles123', right?" "That's right," Bill confirmed. "Though, honestly, it's almost as if they wanted us to steal it. I mean, who uses a password as obvious as 'Sparkles123'?" Molly winked. "Desperation breeds predictability, my friend. We're the professionals. We'll show them what a real 'Sparkles123' looks like." Bill checked the clock on the dashboard. "Ten minutes to go-time. Time to create a diversion." As he climbed out of the car, Molly whispered, "Don't make too much of a mess, okay? I like this outfit. Don't want it to get ruined in the fray." Bill's response was a mischievous chuckle, followed by him sprinting toward the bounce house. Molly slid on a pair of oversized sunglasses, donned a fake smile, and headed into the fray, greeting the birthday coordinator with an inane, "HAPPY BIRTHDAY TO YOU!"
id
"gen-0kUmV48Tx8HzgHkEyZNfbghYTMAQ"
total_cost
0.0004986
upstream_id
"07f37527-a15c-4e97-97d0-b7a875b161ac"
created_at
"2024-07-23T16:51:10.758359+00:00"
Tuesday, July 23rd, 2024 at 4:51:10 PM GMT+00:00
model
"meta-llama/llama-3.1-70b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
4586
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
102
tokens_completion
482
native_tokens_prompt
70
native_tokens_completion
484
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0004986
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
337
67.2108%
Dialogue to Total Word Ratio
Ratio: 52.82%, Deviation: 2.82%
neededClean
false
wordsTotal
337
wordsDialogue
178
33.6054%