Model

Phi-3.5 Mini 128k

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-08-26-01J68D0CG2C1YSXSVKNQ9AET9Z

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"Molly: Look at all these gifts, Bill. It's like a treasure trove for a child, but for us, it's the perfect job." Bill: "Indeed, Molly. If only we could charm the nanny away. With the gifts, we'd be unstoppable." Molly: "First things first, let's waltz past those 'monkey bars.' No one suspects creative acrobatics." Bill: "Oh, Molly, your long legs swing like a trapeze artist. Child's birthday party, the perfect smokescreen." Molly: "Once inside, we move like shadows. You, a ninja - silent as the approaching fog." Bill: "And you, Molly, the organizational genius. You'll navigate the chaos as if running a marathon of precision." Molly: "To each of those boxes, we'll assign value. Jewels for the parents, plushies for the babies." Bill: "Jewels? No, wait. Let’s count the legos instead. Fascinating little minions, assembling fortresses." Molly: "Spectators are distracted by the piñata. Your trick will work. Pull out the yardstick pretense, I’ll snatch the pretty confections." Bill: "With all this bounty, we'll be rich beyond belief, or at least beyond the grasp of kid-ruled countries." Molly: "Afterward, let's ensure no trace remains. Dog walkers, babysitters, police - none the wiser to our shenanigans." Bill: "I'll rendezvous in the back alley. We’ll scarf down some candy bars—sweet rewards for bold stripes."
id"gen-j8uaJYCZV1lDNOySwg4YbarQKLVB" 
total_cost0.0000527
upstream_id"chat-4a3ec0f103364e318a9a7576044c7eeb" 
created_at"2024-08-26T22:31:00.381553+00:00" Monday, August 26th, 2024 at 10:31:00 PM GMT+00:00
model"microsoft/phi-3.5-mini-128k-instruct" 
app_id182717
streamedtrue
cancelledfalse
provider_name"Azure" 
latency152
moderation_latency(null)
generation_time10887
finish_reason"stop" 
tokens_prompt82
tokens_completion338
native_tokens_prompt84
native_tokens_completion443
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.0000527

Evaluation details

Result Evaluator Details Meta Data
86.3803% Matches word count n/a
neededCleanfalse
words211
94.8855% Dialogue to Total Word Ratio
Ratio: 94.79%, Deviation: 4.79%
neededCleanfalse
wordsTotal211
wordsDialogue200
90.6329%