NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-07-19-01J35T98MQRT3KW4VKX0HCFPYF
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill, two seasoned thieves, huddled in their dingy hideout, meticulously planning their most audacious heist yet: robbing a children's birthday party of its presents. The stakes were high, but so were the rewards. Elaborate invitations had been surreptitiously stolen, revealing the extravagant budget allocated for the soiree. "This is the big one, Bill," Molly, the mastermind, failed to hide her glee as she flaunted the purloined invite. "I've got us invites, but we'll need disguises. And you, my friend, will be my beloved husband." Bill, a burly man with a voice like gravel, scowled, "I don't wear glasses," he growled, glaring at his nerdy disguise. "It's for the greater good," Molly, in a khaki dress and floppy hat, cajoled. "Now, thevenue's an indoor playground. The kids will be distract ed, but the parents?" She whistled, "That's where the challenge lies." Their plan was simple: blend in as doting guests, and when the cake was served - anarchy. While the sugar-fueled young'uns attacked the icing, they'd make their move. Bill would distract the parents with a fake allergic reaction, and Molly would strike, her purse morphing into a bottomless sack. "Remember, we're peripheral," Molly emphasized, slipping a fake engagement ring on Bill's finger. "No one notices the couple who's here but not really
id
"gen-rGs0qP2akuAqNIv12ixAFpQOfkp3"
total_cost
0.0048175
upstream_id
"5f8e9d63EGGI8Aun"
created_at
"2024-07-19T15:38:23.057676+00:00"
Friday, July 19th, 2024 at 3:38:23 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
20009
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
319
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
68.1022%
Matches word count
n/a
neededClean
false
words
214
0.0000%
Dialogue to Total Word Ratio
Ratio: 29.91%, Deviation: 19.91%
neededClean
false
wordsTotal
214
wordsDialogue
64
34.0511%