Model

Phi-3 Mini 128k

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-19-01J35SXGKE6BY36MAC016J6HR3

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
“Molly, you remember that party down at the park? Last year's cake was a disaster,” Bill casually leaned against a tree. "Don't remind me," Molly sighed, inspecting the gift wrap with meticulous care. “It's gonna be wild. We've got to stick to our game plan." Bill checked his watch, “Nightfall and we’re in. Remember, it's all about distraction." "Right. How about we rig the games to cost us those flamingos—extravagant, naughty, but unusable chaos.” The twinkle in Molly's eye didn’t wane as she said, “Ah, and our hands will be free for the magic of wrapping paper and festive garb shop." Bill chuckled, “Make sure you've got the ‘map’ to our escape, puff up your coat as if avalanches are behind you." "And throw ‘Balloon Pop Sensation,'" Molly suggested. "Party's distracted, we slip away on 'Pop! Pop! Bam!'” Bill leaned in, whispering, “And timing’s key. Half past seven, they'll think the Border Collies escaped. Ol' Buck and Maggie will suspect each other till dusk.” "We make the most daring of pilferers in their dreams, Bill." “Planning to be the jewels' guard instead of their gas" he mused, grinning as they glided into the shadows.
id"gen-CkC1MGmXRHLPMNZknsvbGJrytmPL" 
total_cost0.0000457
upstream_id"cmpl-15617d60cef94d82af7d5a73e83bd38c" 
created_at"2024-07-19T15:31:42.224482+00:00" Friday, July 19th, 2024 at 3:31:42 PM GMT+00:00
model"microsoft/phi-3-mini-128k-instruct" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Azure" 
latency4214
moderation_latency(null)
generation_time0
finish_reason"stop" 
tokens_prompt82
tokens_completion288
native_tokens_prompt83
native_tokens_completion374
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.0000457

Evaluation details

Result Evaluator Details Meta Data
99.3769% Matches word count n/a
neededCleanfalse
words195
3.3018% Dialogue to Total Word Ratio
Ratio: 76.41%, Deviation: 13.59%
neededCleanfalse
wordsTotal195
wordsDialogue149
51.3394%