NC Bench
Overview
About
Tests
Models
Model
Gemini Flash 1.5
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35T54MKKD41A9W5ZR5APP49
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Don’t you think we’re a little old for this?” Molly asked, her blue eyes sparkling with mischief, even though she was staring at the security camera feed of a brightly decorated party room. Bill, perched on a stack of old phone books, adjusted his glasses and snorted. “Old? We’re seasoned professionals, Molly. We’re beyond ‘old.’” He tapped the screen, zooming in on a chaotic scene of shrieking children and a frantic-looking woman in a clown costume. “Besides, who wouldn’t want to pull off a heist at a kids’ party?” Molly, a twinkle in her eyes, grinned. “You’re right. It’s practically a public service. Those parents probably wouldn’t know how to deal with those presents anyway. All that plastic, all those flashing lights.” “Exactly! We’ll be doing them a favor.” Bill winked. “Now, let’s get to work.” He picked up a pair of bolt cutters. “The first step? We’ll need to disable the clown. She seems to be the only one with any authority here.”
id
"gen-FdGjQbrRMytEQedv8iOOSJrfJ1lD"
total_cost
0.000195
created_at
"2024-07-19T15:35:50.263775+00:00"
Friday, July 19th, 2024 at 3:35:50 PM GMT+00:00
model
"google/gemini-flash-1.5"
app_id
182717
streamed
false
cancelled
false
provider_name
"Google"
latency
2361
moderation_latency
(null)
generation_time
0
finish_reason
"STOP"
tokens_prompt
68
tokens_completion
228
native_tokens_prompt
69
native_tokens_completion
237
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.000195
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
165
98.7730%
Dialogue to Total Word Ratio
Ratio: 53.33%, Deviation: 3.33%
neededClean
false
wordsTotal
165
wordsDialogue
88
49.3865%