NC Bench
Overview
About
Tests
Models
Model
Phi-3.5 Mini 128k
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-08-26-01J68DESACBT9ZAAFQ16Z39VG6
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: "So, Bill, we literally walk into a kid's birthday, right?" Bill: "Yes, Molly. It's all about timing. Everyone will have their eyes glued to Mr. Fuzzy Rabbit." Molly: "Mr. Fuzzy Rabbit, huh? I've got a fairy tale villain for you. Legend has it he's got treasures." Bill: "Potential gold, not just dollar bills; we're talking shiny stuff here…plush Tigers, limited-edition action figures." Molly: (smirking) "You're on! Just make sure we're invisible. Color-blind, silent, and silent-footed. No sticky fingers in this operation." Bill: "Understood. And gadgets - The Echo Caretaker will keep us under wraps. No eardrums will recognize our subtle taps." Molly: "Better rolls with a hacked iPod; we sync it with our grabbers. Hacked to perfection - discreet yet deadly." Bill: (chuckles) "You're full of tricks, Molly. So, when do we strike?" Molly: "When Santa gets here. None of those cute little elves eyeing us. Blend in until he's done his pirate tour." Bill: "I suppose fairy-tale time doesn’t count, does it? People don't take chances with ghosts." Molly: "Santa’s a bit of a ghost, Bill. And at midnight’s strike, our myths will turn into riches." Bill: "Invisible, silent, silent-footed, with phones in our hands. There's only one way to go, shall we raise the flag of adventure?" Molly: "Raise it, partner. Here's to stealing the crown jewels of childhood." Bill: "To stealth, and to hoards gathered from invisible hands." Molly: "Let's do this, Team Rabbit Wonderland." Bill: "Team Rabbit. Out." (They share a nod and proceed into the party, their mission poised on the precipice of comedy and chaos.)
id
"gen-UiorVmhVzi9Edo86QtUX6Fs4U8yM"
total_cost
0.0000625
upstream_id
"chat-7af14516dca64b9297d214942a06871e"
created_at
"2024-08-26T22:38:54.833445+00:00"
Monday, August 26th, 2024 at 10:38:54 PM GMT+00:00
model
"microsoft/phi-3.5-mini-128k-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Azure"
latency
158
moderation_latency
(null)
generation_time
13389
finish_reason
"stop"
tokens_prompt
82
tokens_completion
413
native_tokens_prompt
84
native_tokens_completion
541
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0000625
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
265
97.5654%
Dialogue to Total Word Ratio
Ratio: 86.04%, Deviation: 3.96%
neededClean
false
wordsTotal
265
wordsDialogue
228
48.7827%