NC Bench
Overview
About
Tests
Models
Model
Mistral Medium
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35WQ56QD79TNBCN83GQQB8B
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: Bill, you sure this is a good idea? Bill: Come on, Molly, it's a children's party. What's the worst that could happen? Molly: I don't know, we get caught by a bunch of six-year-olds? Bill: Hey, they may be small, but they're not stupid. We need to be strategic about this. Molly: Alright, so what's the plan? Bill: We wait until the presents are all unwrapped and the kids are distracted by cake, then we make our move. Molly: And how are we going to carry all the presents out without being noticed? Bill: We'll stuff them in a giant teddy bear, of course. Molly: A teddy bear? Are you serious? Bill: Dead serious. It's the perfect disguise. Who would suspect a teddy bear of being a getaway vehicle? Molly: I guess that could work. But what if the kids want to keep their presents? Bill: We'll tell them we're taking them to be donated to a children's hospital. They'll be none the wiser. Molly: Alright, let's do this. But if we get caught, I'm blaming you. Bill: Deal. Now let's go find that teddy bear.
id
"gen-ooBZfppmuM0eCdpQO7UcVawet79a"
total_cost
0.0029106
upstream_id
"b917c51f2acf4a26b021cacc53e4fe90"
created_at
"2024-07-19T16:20:44.080953+00:00"
Friday, July 19th, 2024 at 4:20:44 PM GMT+00:00
model
"mistralai/mistral-medium"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mistral"
latency
8564
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
265
native_tokens_prompt
85
native_tokens_completion
331
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0029106
Evaluation details
Result
Evaluator
Details
Meta Data
86.3803%
Matches word count
n/a
neededClean
false
words
189
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 90.00%
neededClean
false
wordsTotal
189
wordsDialogue
0
43.1902%