NC Bench
Overview
About
Tests
Models
Model
Llama 3.1 405B
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-23-01J3G5028XP7DKD6WZKWVCYJ12
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her cat burglar suit, a sleek black jumpsuit complete with a fuzzy cat ear headband. Bill, her partner in crime, looked at her with a raised eyebrow. "Really, Molly? The cat ears?" "They're for good luck," Molly said, winking. "Besides, we're about to crash a children's birthday party. I want to blend in." Bill snorted. "You're going to blend in with a bunch of sugar-high six-year-olds? You're going to stick out like a sore thumb." Molly patted the bag of tools slung over her shoulder. "Hey, I've got all the essentials: cake-cutting wire, piƱata-picking locks, and an emergency stash of kid-friendly snacks in case we get caught." Bill shook his head. "You're enjoying this way too much." "Hey, someone's got to bring a little joy to the art of thievery," Molly said, grinning. "Besides, have you seen the haul we're looking at? Laser guns, talking teddy bears... this kid's parents must have spent a small fortune." Bill's eyes lit up. "Alright, alright. I'm in. But if I get puked on by a birthday boy with a stomach ache, I'm blaming you."
id
"gen-f4VW4n7E4nsVR1TEQmXAe518R3rz"
total_cost
0.001023
upstream_id
"afcf7b35-57e5-4bb7-ad7d-a05365c77abd"
created_at
"2024-07-23T15:57:51.212016+00:00"
Tuesday, July 23rd, 2024 at 3:57:51 PM GMT+00:00
model
"meta-llama/llama-3.1-405b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
10852
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
102
tokens_completion
269
native_tokens_prompt
70
native_tokens_completion
271
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.001023
Evaluation details
Result
Evaluator
Details
Meta Data
86.3803%
Matches word count
n/a
neededClean
false
words
189
0.0000%
Dialogue to Total Word Ratio
Ratio: 70.90%, Deviation: 20.90%
neededClean
false
wordsTotal
189
wordsDialogue
134
43.1902%