NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35T2HSV46NV4B1FD3G0V87F
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Alright, Bill-o, that there's the target." Molly, the short and sassy one with a penchant for hats, pointed at a garishly decorated house at the end of the street. It was the epitome of suburban affluence, complete with a bouncy castle in the backyard and streams of screaming children pouring out its doors. Bill, a lanky chap with the demeanor of a mopey seagull, sighed. "A kid's party, Molly? Really? We're stooping this low?" "We're thieves, Bill, not charitable donors," she shot back. "Besides, I hear ol' man Smoothfingers is hired for this gig too. We get in there before him, we're in for a real treat." Bill shrugged, the prospect of besting their rival thief-for-hire, Smoothfingers, just enough to pique his interest. "Okay, so we're both of the same desperate mind. How do we crack this nut?" Molly grinned, her hazel eyes twinkling with mischief. "Well, my depressed seabird, it just so happens I have a plan." She motioned to a nearby tree, its branches almost touching the house's second floor. "You're going to go in through the window over there, silent as that refrigerator-burglar you're named after." "Wait, wait, wait," Bill interjected, "I've been practicing my ninja skills, I can—" "I think I'll let the air conditioning unit climbing and glass-cutting expert—" she gestured at Bill's
id
"gen-RidgfQq3AQwzX16My32ovOIEtzuu"
total_cost
0.0048175
upstream_id
"5f8e9c97cTs0F6ma"
created_at
"2024-07-19T15:34:44.062176+00:00"
Friday, July 19th, 2024 at 3:34:44 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
21003
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
320
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
6.0907%
Matches word count
n/a
neededClean
false
words
223
99.9393%
Dialogue to Total Word Ratio
Ratio: 51.57%, Deviation: 1.57%
neededClean
false
wordsTotal
223
wordsDialogue
115
53.0150%