NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35SYDVDVVCKAKHSM0C3ZAAM
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill, a mischievous, bumbling duo of thieves, sat hunched over a ratty map of the local children's entertainment palace. "Alright, Bill, I've got the blueprints here, but I'm not sure about this." Molly sighed, her eyebrows furrowing in worry. "Aw, c'mon, Molly, in and out, just like a hot knife through butter," Bill replied, his beady eyes glinting with plans of stolen loot. "It's a kids' party, for Pete's sake. What could possibly go wrong?" "I don't know, BILL, I've got a bad feeling about this," she moaned, chewing her fingernails. "There'll be children EVERYWHERE. And they're like little surveillance cameras with sticky fingers." Bill slapped the table, rattling their motel room's shoddy frame. "Look, Molly, we're professionals, alright? We've reformatted the drone to look like a giant balloon, so it'll blend right in with the décor." Molly cast him a skeptical look. "A BALLOON? Bill, you know that thing's as subtle as a bull in a china shop." "Hey, it has a stealth mode, alright? And we'll use it to distract the adults while we sneak in and plunder their measly presents. Besides, it's in T. rex form, the kids will LOVE it!" Bill waved his arms enthusiastically, nearly toppling their cheap suitcase filled with g
id
"gen-96OqSMzc4ruo0vGJGIl5XUfNK1v3"
total_cost
0.0048175
upstream_id
"5f8e9c19L5rwJlbV"
created_at
"2024-07-19T15:32:35.189065+00:00"
Friday, July 19th, 2024 at 3:32:35 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
26605
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
307
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
93.6496%
Matches word count
n/a
neededClean
false
words
209
0.5741%
Dialogue to Total Word Ratio
Ratio: 65.07%, Deviation: 15.07%
neededClean
false
wordsTotal
209
wordsDialogue
136
47.1119%