NC Bench
Overview
About
Tests
Models
Model
Phi-3.5 Mini 128k
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2024-08-26-01J68DCEPTB95NE4R9VG32V3VT
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: "Alright, Bill, we've got one shot at this. No room for mistakes. Remember, timing is everything." Bill: "Yeah, I remember. Dinner's over, clown's on his phone, and presents are like pudding in a highchair – oh, so valuable." Molly: "Exactly. I've scoped out the place. See these gaps under the balcony door? Perfect for a gloved hand." Bill: "Mm, like filing cabinet drawer openings. But we'll need to be light on our feet. Not my style." Molly: "What's a rat's weakness but dizziness? If we hit the dance floor and pretend we're hot chicks, we'll avoid the party noses." Bill: "Hot chicks... never been good at the wiggle dance routine. I fumble more than a fish out of water." Molly: "It's showtime, Bill. Let's charm the fish. Can you mimic a catwalk nail-grab, or do I need to hire another dance pro?" Bill: "Honestly, my moves look more like a bull in a china shop. I could trip a
id
"gen-qjhBCNBhYhhoF2dAJLs7qh46t44H"
total_cost
0.000039
upstream_id
"chat-89eceed223d2400d8a6fd14941e431a2"
created_at
"2024-08-26T22:37:33.387852+00:00"
Monday, August 26th, 2024 at 10:37:33 PM GMT+00:00
model
"microsoft/phi-3.5-mini-128k-instruct"
app_id
182717
streamed
true
cancelled
false
provider_name
"Azure"
latency
143
moderation_latency
(null)
generation_time
8407
finish_reason
"content_filter"
tokens_prompt
82
tokens_completion
235
native_tokens_prompt
84
native_tokens_completion
306
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.000039
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
160
0.0000%
Dialogue to Total Word Ratio
Ratio: 95.00%, Deviation: 25.00%
neededClean
false
wordsTotal
160
wordsDialogue
152
0.0000%