NC Bench
Overview
About
Tests
Models
Model
Gemini Pro 1.5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35T87YX0GJ2CA6GRWFMETJC
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"A bouncy castle, Bill. A bouncy castle!" Molly whisper-shouted, waving her phone in his face. Bill rubbed his eyes. "At four in the morning, it better be a life-changing bouncy castle." "Five thousand dollars worth of presents, Bill! Five thousand! This kid, Max, his parents are loaded." Bill grabbed the phone, squinting at a picture of a grinning child surrounded by mountains of wrapped gifts. "They spend five grand on a four-year-old? What's he getting, a pony?" "Who cares? We hit it at naptime, grab and dash." Bill grimaced. "You realize I'll have to wear a clown suit again, right?" Molly grinned, "What, scared of looking silly?" "I'm a thief with dignity, Molly. Clowns don't do dignity." "Fine, you distract with balloon animals. I'll be the one stuffing my pockets with pony-shaped loot."
id
"gen-VYrPYrRko9UpoIXdO4Rt750koHm9"
total_cost
0.00177
created_at
"2024-07-19T15:37:35.241664+00:00"
Friday, July 19th, 2024 at 3:37:35 PM GMT+00:00
model
"google/gemini-pro-1.5"
app_id
182717
streamed
false
cancelled
false
provider_name
"Google"
latency
5675
moderation_latency
(null)
generation_time
0
finish_reason
"STOP"
tokens_prompt
68
tokens_completion
196
native_tokens_prompt
69
native_tokens_completion
213
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00177
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
138
0.3812%
Dialogue to Total Word Ratio
Ratio: 74.64%, Deviation: 15.36%
neededClean
false
wordsTotal
138
wordsDialogue
103
0.1906%