NC Bench
Overview
About
Tests
Models
Model
Hermes 3 405B
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-08-26-01J68DAFW7WKBFN18NH44NA316
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly narrowed her eyes, studying the colorful invitations strewn across the table. "A children's party, Bill. It's genius. The little tykes will be so distracted by the clowns and magicians, they won't even notice their presents disappearing." Bill scoffed, running a hand through his greasy hair. "You're serious? We're gonna rob a bunch of kids? That's a new low, even for us." "Oh, don't get all high and mighty on me now," Molly snorted. "We're thieves, Bill. This is what we do. Besides, those spoiled brats probably have enough toys to last them a lifetime." Bill sighed, shaking his head. "Fine, but we better have a solid plan. I don't want to end up on some 'America's Most Wanted' poster as the 'Birthday Bandits.'" "Don't worry," Molly smirked, tapping her temple. "I've got it all figured out. You'll go in as the balloon artist, and I'll be the face painter. We'll blend right in." "Balloon artist?" Bill raised an eyebrow. "I don't know the first thing about twisting balloons." "You'll learn," Molly said, waving a dismissive hand. "Just remember, the more complicated the balloon animal, the more impressed the parents will be. And the more distracted they are, the easier it'll be for us to make off with the loot."
id
"gen-SSazIFWE7v18EFYtV7IY3attYml5"
total_cost
0
upstream_id
"chatcmpl-4afbc255-4038-431f-9346-9303aa41b39c"
created_at
"2024-08-26T22:36:42.115488+00:00"
Monday, August 26th, 2024 at 10:36:42 PM GMT+00:00
model
"nousresearch/hermes-3-llama-3.1-405b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Lambda"
latency
6672
moderation_latency
(null)
generation_time
14956
finish_reason
"stop"
tokens_prompt
67
tokens_completion
296
native_tokens_prompt
83
native_tokens_completion
297
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0
Evaluation details
Result
Evaluator
Details
Meta Data
90.4837%
Matches word count
n/a
neededClean
false
words
210
29.9836%
Dialogue to Total Word Ratio
Ratio: 79.52%, Deviation: 10.48%
neededClean
false
wordsTotal
210
wordsDialogue
167
60.2337%