NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-07-19-01J35SMCRBTBV6RTAARVV1AVXM
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly fidgeted with her lockpicking kit, shooting a sideways glance at Bill. He was lounging against the side of their van, consumed by a comic book featuring an anthropomorphic crime-fighting beaver. "Bill," she hissed, "Focus on the job, will ya?" He looked up, feigning innocence. "Focus on what? The fact that we're about to crash a kiddie party and make off with a pile of toys?" Molly rolled her eyes. "Exactly. We need to maintain our cover. You can't waltz in here dressed as a rcade bandit while I'm in a full-blown clown costume." Bill's shoulders shook with suppressed laughter. "Hey, I'll have you know, this is an authentic B-string villain costume." "Whatever, just keep it together, alright?" They parked the van near the dubious-looking bounce house, and Molly peered through her binoculars at the target: a pink-festooned mansion, host to "Princess Penelope's Perfect Pony Party." The invitation had fallen into their hands (or rather, their drunk friend's puppy's paws, but that's another story) and they couldn't resist the challenge. The party was in full swing. Squeals of delighted children mingled with the incessant bleating of plastic instruments. Molly shuddered. "This better be worth it," she muttered, slipping on her clown wig. At the entrance, Bill straightened his faux-fur collar, adopting a menacing (
id
"gen-Qfn5dHuLl8OYnvZaaf8CbNJGzXqm"
total_cost
0.0048175
upstream_id
"5f8e9ae6beoL1Tla"
created_at
"2024-07-19T15:27:03.246207+00:00"
Friday, July 19th, 2024 at 3:27:03 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
24088
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
315
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
35.0022%
Matches word count
n/a
neededClean
false
words
218
0.0000%
Dialogue to Total Word Ratio
Ratio: 37.16%, Deviation: 27.16%
neededClean
false
wordsTotal
218
wordsDialogue
81
17.5011%