Model

Llama 3.1 8B

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-23-01J3G81TY4FPJQ6QKE0JNAMTHJ

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly crouched in front of a dusty old map of the suburbs, a coffee-stained highlighter clutched in her hand. Bill hovered above her, a cigarette dangling from his lip. "Okay, so the party's scheduled for two PM. Parents will be busy pinning the tail on the donkey, kids will be distracted by the piƱata. We slip in through the kitchen window, grab as many gifts as we can carry, and get out before the cake is served." "Sounds like a solid plan," Bill said, exhaling a plume of smoke. "But what about the bouncy castle?" Molly asked, pointing at the map. "That's a definite pain to navigate." "Meh, it's just a bouncy castle. We can bounce around it." "Bouncing into the host's prized collection of antique teapots? No thanks." "Fine, we'll just have to make sure no one's in it when we make our move." "You're sweating, Bill," Molly said, holding out her hand for the cigarette. "Right, focus," Bill muttered, shoving the cigarette back into his mouth. "Okay, last thing to discuss. What's our exit strategy?" "You're thinking we're going to get caught and have to make a break for it?" Bill snorted. "Not on your life." "Fair enough. But just in case, I'm memorizing the nearest sewer entrance." Bill raised an eyebrow. "The sewer?" Molly shrugged. "Hey, we've watched our fair share of movies, Bill. It's a classic escape plan."
id"gen-345MHc7enCRwjjQIlEq0J3KiXBxj" 
total_cost0.00008
upstream_id"97ff781a-e29b-4799-8505-8d0664b92952" 
created_at"2024-07-23T16:51:05.679854+00:00" Tuesday, July 23rd, 2024 at 4:51:05 PM GMT+00:00
model"meta-llama/llama-3.1-8b-instruct" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Fireworks" 
latency1479
moderation_latency(null)
generation_time0
finish_reason"stop" 
tokens_prompt102
tokens_completion328
native_tokens_prompt70
native_tokens_completion330
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.00008

Evaluation details

Result Evaluator Details Meta Data
0.0007% Matches word count n/a
neededCleanfalse
words233
0.0000% Dialogue to Total Word Ratio
Ratio: 70.39%, Deviation: 19.61%
neededCleanfalse
wordsTotal233
wordsDialogue164
0.0004%