NC Bench
Overview
About
Tests
Models
Model
Gemini Pro 1.5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35SY69QJQ3GBVXWYGARAEY0
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Molly," Bill hissed, his face nearly pressed against the tinted windows of the party bus, "unicorns. Unicorns, I tell ya!" Molly snatched the binoculars from him. "Get a grip, Bill. It’s a kid's birthday party, not a mythical creature convention." Bill rubbed his hands together. “Exactly! Think of the resale value. Parents go nuts for that handcrafted, organic, ethically-sourced unicorn fluff.” Molly rolled her eyes. “You’re delusional. We’re here for the presents, remember? Designer clothes they'll outgrow in a week, limited-edition action figures their parents will probably try to sell in twenty years—" "But Molly," Bill cut in, his voice thick with mock hurt, "what about the magic?" Molly sighed. "Fine," she conceded, "We grab the unicorn paraphernalia. But if I get stuck riding off with a sparkly horn sticking out the window, I’m blaming you."
id
"gen-0MzKqZzM3aSOv0rBdqW6efZYhGu6"
total_cost
0.0017325
created_at
"2024-07-19T15:32:05.509385+00:00"
Friday, July 19th, 2024 at 3:32:05 PM GMT+00:00
model
"google/gemini-pro-1.5"
app_id
182717
streamed
false
cancelled
false
provider_name
"Google"
latency
5262
moderation_latency
(null)
generation_time
0
finish_reason
"STOP"
tokens_prompt
68
tokens_completion
211
native_tokens_prompt
69
native_tokens_completion
208
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0017325
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
138
0.0000%
Dialogue to Total Word Ratio
Ratio: 69.57%, Deviation: 20.43%
neededClean
false
wordsTotal
138
wordsDialogue
96
0.0000%