NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35SS9PGPPQ2RS7KP9RGXT4P
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: (whispering) Bill, do you think this is a good idea? Stealing presents from a kids' party, that's a new low, even for us. Bill: (confidently) Molly, just picture the look on those kids' faces when they see their presents are gone. It'll be priceless! Besides, it's not like they have security cameras here. *snickers* Molly: (hesitant) I don't know, Bill. What if they call the cops? Bill: (waving dismissively) Don't worry, we'll be in and out before they can even blow out the candles. Plus, we'll blend right in. *Adjusts balloon animal hat* Look at us, we're practically part of the entertainment. Molly: (sighs) Alright, but we're only taking the good stuff, no homemade gifts or clothes. Bill: (chuckling) Oh, you and your scruples. Fine, but if we see a particularly jingly-looking wrap, consider it a sign from the gift-stealing gods. The two thieves approach the presents stealthily, balloon animals and all. Molly: (whisper-yelling) Bill, I think one of the parents is looking our way. Bill: (whispering back) Molly, focus! We're here for the loot, not to start a conversation about the weather. *pauses* Nice dinosaur, by the way. They continue to inspected the presents, Molly distracting the adults with a mediocre balloon animal routine while Bill took inventory.
id
"gen-yw45eiK1dtGIKe9KiUKMCOIjjA7k"
total_cost
0.0048175
upstream_id
"5f8e9b7cnsF3D6z7"
created_at
"2024-07-19T15:29:45.49202+00:00"
Friday, July 19th, 2024 at 3:29:45 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
25630
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
321
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
81.2727%
Matches word count
n/a
neededClean
false
words
212
0.0000%
Dialogue to Total Word Ratio
Ratio: 8.49%, Deviation: 81.51%
neededClean
false
wordsTotal
212
wordsDialogue
18
40.6364%