Model

Gemini Flash 1.5

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-19-01J35SZHRYKXV7A32V8QT7HHVJ

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"This is insane, Bill. A children's party?" Molly said, eyes wide. "They're loaded! Think of all the presents. This is like hitting a toy store on Black Friday." Bill grinned, practically drooling. "But the police! There'll be a hundred parents there, all high on cake and punch." "Perfect. We blend right in! I'll be Mr. Smiley, the nice uncle who brings his own clown nose. You can be… the balloon lady." Molly rolled her eyes. "You're awful." "But I'm a genius! They'll be so busy trying to keep their kids from eating their own fingers, they'll never notice us." "And how exactly do you plan on getting past a room full of screaming, sugared-up five-year-olds?" Bill shrugged. "We'll bring bubble gum. They'll be too busy trying to blow bubbles to notice us." "You're a terrible person." "But a very successful one." Bill winked. "So what do you say? Are you in?"
id"gen-xTEfK4equKw2h263uGnsIX8mRQeL" 
total_cost0.000198
created_at"2024-07-19T15:32:46.864374+00:00" Friday, July 19th, 2024 at 3:32:46 PM GMT+00:00
model"google/gemini-flash-1.5" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Google" 
latency2130
moderation_latency(null)
generation_time0
finish_reason"STOP" 
tokens_prompt68
tokens_completion211
native_tokens_prompt69
native_tokens_completion241
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.000198

Evaluation details

Result Evaluator Details Meta Data
0.0000% Matches word count n/a
neededCleanfalse
words154
99.9998% Dialogue to Total Word Ratio
Ratio: 89.61%, Deviation: 0.39%
neededCleanfalse
wordsTotal154
wordsDialogue138
49.9999%