Model

Gemini Flash 1.5

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-19-01J35T8C2B3PSG582WY8MYK0P8

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
“Alright, Bill,” Molly said, pulling out a meticulously drawn diagram from her purse. “This is the layout of the party. You see this?” She jabbed a finger at a red dot. “The piñata. It’s where they’ll be storing all the presents. One guy, a Mr. Snuggles, is in charge of it. He’s apparently quite fond of a large, yellow rubber chicken.” Bill squinted at the diagram. “Wait, a rubber chicken? I love rubber chickens. We should steal that, too. It’ll be just like in that old heist movie with the monkey and the banana.” Molly slapped a hand to her forehead. “Bill, we have a target, not a sideshow. Now, the plan is this: I’ll act as a ‘volunteer’ clown. I’ll distract Mr. Snuggles with a balloon animal. Meanwhile, you’ll sneak into the piñata, steal the presents, and-" “But why the clown? I could be a pirate, you know,” Bill interrupted, tugging on the end of his bushy mustache. "A pirate is much cooler and wouldn't have to wear that itchy makeup." "Because clowns are terrifying," Molly said, deadpan. “And trust me, the kids will be too busy running away to even notice a pirate.”
id"gen-KPUAgk5a7wsR1YHMA6wBsil3Ir8z" 
total_cost0.00023175
created_at"2024-07-19T15:37:36.54578+00:00" Friday, July 19th, 2024 at 3:37:36 PM GMT+00:00
model"google/gemini-flash-1.5" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Google" 
latency2746
moderation_latency(null)
generation_time0
finish_reason"STOP" 
tokens_prompt68
tokens_completion283
native_tokens_prompt69
native_tokens_completion286
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.00023175

Evaluation details

Result Evaluator Details Meta Data
99.3769% Matches word count n/a
neededCleanfalse
words195
8.2755% Dialogue to Total Word Ratio
Ratio: 77.44%, Deviation: 12.56%
neededCleanfalse
wordsTotal195
wordsDialogue151
53.8262%