NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-07-19-01J35SVJMT5N7DGX337N5CHRTW
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill, two seasoned thieves, squatted behind a thick hedge in the backyard of the bustling mansion. Inside, the children's party was in full swing. Colorful balloons bobbed cheerfully, and the sound of shrill laughter intermingled with clanging utensils. The pair's eyes, however, were fixated on the oversized piƱata brimming with stuffed animals and shiny gift bags piled high. "Piece of cake, just like we planned," Bill whispered, his hazel eyes glinting mischievously. "You distract the clown, and I'll go for the loot. We'll be in and out before they can say 'Scooby-Do'." He chuckled, patting the balloon-shaped helium tank strapped to his back. Molly raised a dubious eyebrow. "You sure this is the right gig, Bill? Stealing from kids?" she asked, tugging at her baseball cap emblazoned with an ironic unicorn. "Relax, Molly," Bill reassponded with a dismissive wave. "These kids won't miss a thing. Their parents have more money than they know what to do with. It's practically a public service." He paused, his grin turning sly. "Besides, think of the resale value on eBay. We'll be laughing all the way to the bank." Molly sighed, the air hissing between her fluorescent green braces. "Fine, but this better go according to plan, or we're donating our shares to charity," she warned, adjusting her clown costume.
id
"gen-NRTnC3RVyYgW0mRJdSKjlRLHOm5h"
total_cost
0.0048175
upstream_id
"5f8e9bc2jPESCt36"
created_at
"2024-07-19T15:31:00.457852+00:00"
Friday, July 19th, 2024 at 3:31:00 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
25908
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
317
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
20.1897%
Matches word count
n/a
neededClean
false
words
220
0.0000%
Dialogue to Total Word Ratio
Ratio: 43.64%, Deviation: 33.64%
neededClean
false
wordsTotal
220
wordsDialogue
96
10.0948%