NC Bench
Overview
About
Tests
Models
Model
Gemini Flash 1.5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35T8C2B3PSG582WY8MYK0P8
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Alright, Bill,” Molly said, pulling out a meticulously drawn diagram from her purse. “This is the layout of the party. You see this?” She jabbed a finger at a red dot. “The piñata. It’s where they’ll be storing all the presents. One guy, a Mr. Snuggles, is in charge of it. He’s apparently quite fond of a large, yellow rubber chicken.” Bill squinted at the diagram. “Wait, a rubber chicken? I love rubber chickens. We should steal that, too. It’ll be just like in that old heist movie with the monkey and the banana.” Molly slapped a hand to her forehead. “Bill, we have a target, not a sideshow. Now, the plan is this: I’ll act as a ‘volunteer’ clown. I’ll distract Mr. Snuggles with a balloon animal. Meanwhile, you’ll sneak into the piñata, steal the presents, and-" “But why the clown? I could be a pirate, you know,” Bill interrupted, tugging on the end of his bushy mustache. "A pirate is much cooler and wouldn't have to wear that itchy makeup." "Because clowns are terrifying," Molly said, deadpan. “And trust me, the kids will be too busy running away to even notice a pirate.”
id
"gen-KPUAgk5a7wsR1YHMA6wBsil3Ir8z"
total_cost
0.00023175
created_at
"2024-07-19T15:37:36.54578+00:00"
Friday, July 19th, 2024 at 3:37:36 PM GMT+00:00
model
"google/gemini-flash-1.5"
app_id
182717
streamed
false
cancelled
false
provider_name
"Google"
latency
2746
moderation_latency
(null)
generation_time
0
finish_reason
"STOP"
tokens_prompt
68
tokens_completion
283
native_tokens_prompt
69
native_tokens_completion
286
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00023175
Evaluation details
Result
Evaluator
Details
Meta Data
99.3769%
Matches word count
n/a
neededClean
false
words
195
8.2755%
Dialogue to Total Word Ratio
Ratio: 77.44%, Deviation: 12.56%
neededClean
false
wordsTotal
195
wordsDialogue
151
53.8262%