NC Bench
Overview
About
Tests
Models
Model
Llama 3.1 8B
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-23-01J3G81TY4FPJQ6QKE0JNAMTHJ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched in front of a dusty old map of the suburbs, a coffee-stained highlighter clutched in her hand. Bill hovered above her, a cigarette dangling from his lip. "Okay, so the party's scheduled for two PM. Parents will be busy pinning the tail on the donkey, kids will be distracted by the piƱata. We slip in through the kitchen window, grab as many gifts as we can carry, and get out before the cake is served." "Sounds like a solid plan," Bill said, exhaling a plume of smoke. "But what about the bouncy castle?" Molly asked, pointing at the map. "That's a definite pain to navigate." "Meh, it's just a bouncy castle. We can bounce around it." "Bouncing into the host's prized collection of antique teapots? No thanks." "Fine, we'll just have to make sure no one's in it when we make our move." "You're sweating, Bill," Molly said, holding out her hand for the cigarette. "Right, focus," Bill muttered, shoving the cigarette back into his mouth. "Okay, last thing to discuss. What's our exit strategy?" "You're thinking we're going to get caught and have to make a break for it?" Bill snorted. "Not on your life." "Fair enough. But just in case, I'm memorizing the nearest sewer entrance." Bill raised an eyebrow. "The sewer?" Molly shrugged. "Hey, we've watched our fair share of movies, Bill. It's a classic escape plan."
id
"gen-345MHc7enCRwjjQIlEq0J3KiXBxj"
total_cost
0.00008
upstream_id
"97ff781a-e29b-4799-8505-8d0664b92952"
created_at
"2024-07-23T16:51:05.679854+00:00"
Tuesday, July 23rd, 2024 at 4:51:05 PM GMT+00:00
model
"meta-llama/llama-3.1-8b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
1479
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
102
tokens_completion
328
native_tokens_prompt
70
native_tokens_completion
330
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00008
Evaluation details
Result
Evaluator
Details
Meta Data
0.0007%
Matches word count
n/a
neededClean
false
words
233
0.0000%
Dialogue to Total Word Ratio
Ratio: 70.39%, Deviation: 19.61%
neededClean
false
wordsTotal
233
wordsDialogue
164
0.0004%