NC Bench
Overview
About
Tests
Models
Model
Gemma 2 27B
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2024-07-19-01J35SKTW61DJEWZT4M42M6X4M
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The musty air of the abandoned warehouse hung heavy with the scent of damp cardboard and regret. A single bare bulb cast long, flickering shadows across the floor, revealing a haphazard assortment of repurposed playpens, deflated bouncy castles, and a half-eaten birthday cake, its frosting crusting ominously. Molly surveyed the scene with narrowed eyes. "You sure about this, Bill? This place creeps me out." Bill, perched precariously atop a wobbly stack of plastic building blocks, shrugged, his mask askew. "Relax, Molls. It's just a bunch of leftover party crap. Besides," he held up a crumpled flyer that boasted a glitter-encrusted unicorn, "it's a 'Princess Penelope's Enchanted Extravaganza'. Think of the loot. Diamond tiaras, pony plushies, maybe even a magical wand." Molly snorted. "You're delusional. We're talking six-year-olds, Bill. The only magic wand they'll have is the kind that dispenses sticky lollipops." He hopped down, landing with a thud, sending a shower of misplaced confetti raining down on his head. "Hey, you never know. There's gotta be something valuable in all this, right?" He rummaged through a discarded loot bag, pulling out a deflated balloon and a plastic tiara adorned with a single, sad-looking rhinestone. Molly rolled her eyes. "See? Garbage." She sighed. "But fine. Let's do this. If it's anything less than solid gold, though, we're out of here." Bill grinned, his eyes gleaming with the feverish excitement of a bored kid in a candy store. "This is gonna be epic. We’ll be like modern-day Robin Hoods, except instead of robbing the rich, we're relieving spoiled brats of their childish trinkets." He tugged on a deflated dragon costume left behind from a previous party, the fabric seeming to sigh in protest. "Right, so, the plan?" Molly pointed to a giant, intricately decorated cardboard castle in the corner. "We hit the treasure chest, obviously. Looks like it's stacked high with presents. You distract the guards - those sobbing clowns they hired for entertainment - while I nab the goods." "Perfect! I'll wear this clown nose I found. It's practically begging for a tearful meltdown." Bill held up a grotesquely oversized red nose that looked like it could house a family of squirrels. Molly winced, fighting the urge to laugh. "Don't make me regret this, Bill," she muttered. "And try not to scare the children. Remember, we're Robin Hoods, not the Boogeyman." Bill, already halfway into the dragon costume, gave her a thumbs up, his voice muffled by the feathered neck. "Leave it to me, Molls. I'll have those little rugrats convinced they're witnessing a real-life fire-breathing magical dragon." Molly shook her head and sighed, a smile playing on her lips. "This is going to be a disaster." "Disaster?" Bill yelled, stumbling around in the too-tight costume. "This, my dear Molly, is a masterpiece in the making!"
id
"gen-2er5gjOKFJi1XiF8cqEsttZ7lyds"
total_cost
0.00055512
upstream_id
"8a5bbab396d86423-SJC"
created_at
"2024-07-19T15:26:33.607611+00:00"
Friday, July 19th, 2024 at 3:26:33 PM GMT+00:00
model
"google/gemma-2-27b-it"
app_id
182717
streamed
false
cancelled
false
provider_name
"Together"
latency
12680
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
80
tokens_completion
697
native_tokens_prompt
75
native_tokens_completion
696
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00055512
Evaluation details
Result
Evaluator
Details
Meta Data
0.2141%
Matches word count
n/a
neededClean
false
words
472
76.2187%
Dialogue to Total Word Ratio
Ratio: 47.67%, Deviation: 2.33%
neededClean
false
wordsTotal
472
wordsDialogue
225
38.2164%