Model

Phi-3 Medium 128k

Test

Dialogue tags

Scenario

Write 200 words with 10% dialogue

Run ID

2024-07-19-01J35TEJHZBX2X4F4CHN7EGE2D

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly adjusted her wig - a ridiculously fluffy unicorn, the kind that might double as a pillow in a toddler's room, while Bill meticulously checked the intricate blueprint of the McMahon estate. Their target: the promised land of wrapping paper and cotton-stuffed bustles. A party for children, with the promised treasures nested in a sea of plastic tat. "You got the gummy bears?" Molly's voice was muffled by the vibrant hairpiece. "Gummy bears?" Bill raised an eyebrow and, with a chuckle, produced a bizarre assortment of candies from his coat pocket. "Why? For a post-heist snack?" The schematics unfolded before them, detailing balloons to tiptoe around and clowns to distract. The cocktail table, heavy with beverages of dubious appeal, promised a treacherous journey as the party unfolded. Yet Molly managed to giggle at the absurdity of the challenge. "Nah, my friend," Bill said, tucking the sweets safely away. "Gummy bears are for diversion. When we slyly slide smiles and gummies the kids’ way, they’ll be our tiny minions!" As Molly plotted amidst a pirate ship top hat and balloon animals, Bill donned his Robin Hood cardigan, whimsically armed with a party horn. She quipped with a smirk, “Sure beats a dog thief, don't you think?” as they prepared to embark on an epic journey of joy-filled larceny.
id"gen-trFXtfSVIgz8AuLbQMzsrQxBwWkX" 
total_cost0.000476
upstream_id"cmpl-7f9cade9c6f048d3af46050216c1d620" 
created_at"2024-07-19T15:41:10.516623+00:00" Friday, July 19th, 2024 at 3:41:10 PM GMT+00:00
model"microsoft/phi-3-medium-128k-instruct" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Azure" 
latency13466
moderation_latency(null)
generation_time0
finish_reason"stop" 
tokens_prompt82
tokens_completion292
native_tokens_prompt83
native_tokens_completion393
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.000476

Evaluation details

Result Evaluator Details Meta Data
27.1658% Matches word count n/a
neededCleanfalse
words219
35.4505% Dialogue to Total Word Ratio
Ratio: 20.09%, Deviation: 10.09%
neededCleanfalse
wordsTotal219
wordsDialogue44
31.3082%