Model

Phi-3 Mini 128k

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-19-01J35TD1TWNERDKJWHWGAF4QG4

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly: "Alright, this is it, Bill. We swipe gifts from the healthiest alarm system—a kid's party." Bill: "Relax, we’ve got this. No one—babies, kids, or parents—hear us outside if we chill near the patio.” Molly: "Ha, genius. So, emergency exit plan? There’s just a roundabout and it goes uphill…” Bill: "Nice breakdown. Let's use that. Perfect car chase material. And our getaway? What?" Molly: "There’s the ’82 Skylark in the garage—cheap insurance. No rush lights, plus a baby can't spot it." Bill: "Perfect. Can you bring a dubstep playlist? I need some adrenaline." Molly: "Playlist ready. Oh! And those kidney-shaped cups? Unmarked containers will make them look like our own. Smart.” Bill: "That, plus we’ll place our jargon in napkins as apology notes. 'Sorry kids, we lost the tickets'... delayed poetic justice.” (They review their plan with a mix of dread and excitement. Molly, with a sly smile: "Last thing, carry that awkward silence, will you?" Bill: "It's all about that. Keeps you alive through the upgrades. Alright, this is insane but... absolutely hilarious.”) Molly: "To healthier alternative medicine or the thrill of the heist, here we stand.” Bill: "And laughs, that’s our best piece of stolen jewelry, right? Let's roll."
id"gen-Zr9Bg8bFv02aath0sdZ0wE8icYYt" 
total_cost0.0000511
upstream_id"cmpl-965db1a2ffdc43848ceec8c4f35b7eac" 
created_at"2024-07-19T15:40:11.941472+00:00" Friday, July 19th, 2024 at 3:40:11 PM GMT+00:00
model"microsoft/phi-3-mini-128k-instruct" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Azure" 
latency4783
moderation_latency(null)
generation_time0
finish_reason"stop" 
tokens_prompt82
tokens_completion323
native_tokens_prompt83
native_tokens_completion428
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.0000511

Evaluation details

Result Evaluator Details Meta Data
98.7124% Matches word count n/a
neededCleanfalse
words206
99.0727% Dialogue to Total Word Ratio
Ratio: 86.89%, Deviation: 3.11%
neededCleanfalse
wordsTotal206
wordsDialogue179
98.8925%