NC Bench
Overview
About
Tests
Models
Model
Hermes 3 405B
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-08-26-01J68DFMMDFHTYG1N1MJTRC5KQ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill sat hunched over a table, studying a crudely drawn map of the party venue. "Okay, so here's the plan," Molly whispered conspiratorially. "You create a distraction by releasing a bunch of greased pigs into the party." Bill raised an eyebrow. "Greased pigs? Where am I supposed to get those?" Molly waved a dismissive hand. "Details, details. Figure it out. While the kids are chasing the pigs, I'll slip in and grab the presents." "Got it," Bill nodded. "But what if someone recognizes me?" "Wear a disguise, obviously," Molly rolled her eyes. "I dunno, maybe a clown suit or something." "A clown suit?" Bill looked horrified. "You know I hate clowns." "Well, it's either that or a princess dress," Molly smirked. Bill gulped. "Fine, I'll take the clown suit." "Excellent," Molly rubbed her hands together gleefully. "This is going to be the heist of the century!" "Just one question," Bill interjected. "What are we going to do with all those stolen presents?" Molly paused, then shrugged. "We'll figure that out later. For now, let's focus on the heist."
id
"gen-k9AT1koC0WKxBMVnvfuTsrL8IRbh"
total_cost
0
upstream_id
"chatcmpl-7bf219d9-054f-4e22-a278-637e2ec607ed"
created_at
"2024-08-26T22:39:30.598724+00:00"
Monday, August 26th, 2024 at 10:39:30 PM GMT+00:00
model
"nousresearch/hermes-3-llama-3.1-405b"
app_id
182717
streamed
true
cancelled
false
provider_name
"Lambda"
latency
4717
moderation_latency
(null)
generation_time
16712
finish_reason
"stop"
tokens_prompt
67
tokens_completion
261
native_tokens_prompt
83
native_tokens_completion
262
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0
Evaluation details
Result
Evaluator
Details
Meta Data
20.1897%
Matches word count
n/a
neededClean
false
words
180
0.0000%
Dialogue to Total Word Ratio
Ratio: 70.00%, Deviation: 20.00%
neededClean
false
wordsTotal
180
wordsDialogue
126
10.0948%