NC Bench
Overview
About
Tests
Models
Model
Mistral Medium
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35WQCMDXH3T78JNPEHEJZTQ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: You sure this is a good idea, Bill? Bill: Of course, it is! We'll be in and out before anyone even notices. Besides, who's going to suspect a couple of thieves at a kid's party? Molly: I don't know, Bill. Kids are pretty perceptive these days. Plus, have you seen the size of those bouncy castles? I don't think either of us would fit! Bill: Molly, you worry too much. Just stick to the plan. We'll pose as caterers, slip in unnoticed, and snatch up all the presents. Easy peasy. Molly: But what if someone recognizes us? I mean, we're not exactly the most inconspicuous pair. Bill: Relax, Molly. We'll wear disguises. Maybe a clown nose or two. Molly: Clowns? Seriously, Bill? That's your solution? Bill: Hey, it worked for those guys in that one movie, didn't it? Molly: You mean the movie where the clowns ended up getting caught? Bill: Well...when you put it that way, maybe we should come up with a different plan. Molly: Maybe we should just stick to robbing banks, Bill. At least they don't have bouncy castles. Bill: Yeah, but where's the fun in that? Besides, think of all the delighted kids we'll be making when they open up their empty presents on Christmas morning. Molly: Bill, I think that's the exact opposite of delight. But, alright, let's give it a shot. Just no clown noses, okay?
id
"gen-B34RPC5Fno9JEGImcWUv0R6v1H9r"
total_cost
0.0035505
upstream_id
"269d6d6c5ccf427e84c994111326271e"
created_at
"2024-07-19T16:20:54.449481+00:00"
Friday, July 19th, 2024 at 4:20:54 PM GMT+00:00
model
"mistralai/mistral-medium"
app_id
182717
streamed
false
cancelled
false
provider_name
"Mistral"
latency
11326
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
336
native_tokens_prompt
85
native_tokens_completion
410
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0035505
Evaluation details
Result
Evaluator
Details
Meta Data
0.0002%
Matches word count
n/a
neededClean
false
words
234
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 90.00%
neededClean
false
wordsTotal
234
wordsDialogue
0
0.0001%