NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2024-07-19-01J35STVCPRXJB9DJH0J9K8E1D
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: Bill, I've got the perfect job for us. We're gonna hit a party, grab all the loot, and nobody's the wiser. Piece of cake, eh? Bill: Really? What kind of party? You know I don't like mingling with high society types. I can't even pronounce half the cheeses they serve. Molly: Relax, Bill, this is an entirely different kind of party. It's a... *kids* party. Bill: You're kidding me, right? We're trained thieves, Molly, not glorified clowns. I'm not wearing one of those ridiculous wigs and honking a horn at a bunch of screaming brats. Molly: Calm down, Bill. We're not crashing the party as entertainers. We're going in as... *ahem*... *parents*. We'll blend right in, take the presents, and leave without a trace. Bill: Uhh, Molly, just because you can stuff your... ehm... assets into a frumpy skirt, doesn't mean you'll convince anyone you're a mom. And me? I look like I ate an entire daycare center for breakfast. Molly: Trust me, Bill, I've got this figured out. I've already rented us some disguises. You'll see. We'll be mistaken for the hip, but slightly odd, uncle and auntie. Bill: Alright, Molly, but you better be sure about this. I've seen the way those PTA moms look at each other. I'm pretty sure they've got their
id
"gen-JPr4Mtiv4eI2QhmhFG1nFQq2R6MA"
total_cost
0.0048175
upstream_id
"5f8e9bacp2h6EczH"
created_at
"2024-07-19T15:30:34.986487+00:00"
Friday, July 19th, 2024 at 3:30:34 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
24226
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
323
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
217
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
217
wordsDialogue
0
0.0000%