NC Bench
Overview
About
Tests
Models
Model
Goliath 120B
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-07-19-01J35SV4438XY5MC65CMNYTVG8
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill were hunched over a crudely drawn map of the neighborhood park, their fingers hovering over the X that marked their next target. It was a children's birthday party, and the loot? All the presents, tastefully wrapped and begging to be liberated from their ribbons and used to line the pockets of the morally flexible duo. "So, Bill," Molly said, her eyes twinkling with mischief, "you've got your wwwaaaodore, right?" she drawled, waving her hand in front of her nose playfully. "Hey, I showered," he grumbled, scratching his disheveled hair. "It's hard to smell like a rose when we're camping in a van down by the river." Molly snorted. "That might actually be an upgrade from your normal scent," she quipped. Bill elbowed her in the ribs, laughter in his eyes. "oi! Watch it, or I'll tell the clown you're allergic to balloon animals." Molly widened her eyes in mock horror. "You wouldn't!" "Alright, alright," Bill said, waving his hands in surrender. "Let's focus on the mission at hand, Red Riding Hoodlum. We'll approach from opposite sides of the playground. You create a diversion, and I'll slip in and grab the loot sacks." Molly grinned. "It's like that, eh?" she purred, wagging her eyebrows suggestively. "Mature, Molly, real mature," Bill deadpanned. "Just shut up and put on your clow
id
"gen-ltN2Ya9UDEBBqxgvM6WnTrQ65zrz"
total_cost
0.0048175
upstream_id
"5f8e9bb46yDBxN0x"
created_at
"2024-07-19T15:30:48.058486+00:00"
Friday, July 19th, 2024 at 3:30:48 PM GMT+00:00
model
"alpindale/goliath-120b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mancer"
latency
28369
moderation_latency
(null)
generation_time
0
finish_reason
"length"
tokens_prompt
91
tokens_completion
319
native_tokens_prompt
114
native_tokens_completion
400
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0048175
Evaluation details
Result
Evaluator
Details
Meta Data
14.3014%
Matches word count
n/a
neededClean
false
words
221
0.0000%
Dialogue to Total Word Ratio
Ratio: 45.70%, Deviation: 35.70%
neededClean
false
wordsTotal
221
wordsDialogue
101
7.1507%