NC Bench
Overview
About
Tests
Models
Model
Gemini Flash 1.5
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35T0W8D0JPQMPJNY464T97V
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“This is insane, Bill,” Molly hissed, clutching a ludicrously oversized plastic unicorn. “We’re stealing from kids.” Bill, already halfway into the party’s bouncy house, winked back at her. “Relax, they’ll get more. Besides, I’ve got my eye on the piñata. Full of candy, Molly. Candy!” Molly rolled her eyes. “You’re a terrible influence.” “I bring out the best in you,” Bill smirked, holding up a hand with a fistful of shiny, new toy cars. “Look, they’re practically begging to be taken.” Molly sighed, her gaze falling on a giggling mass of children in brightly colored birthday outfits. “I still think we should stick to banks, Bill.” “Banks are boring. Besides, where’s the fun in stealing from boring old men with bad haircuts?” Bill guffawed. “Now, where did I put that bag of balloons? I have a plan.”
id
"gen-7dx1zpLJnIwbsAsDs9vI9KHDAbjk"
total_cost
0.0001755
created_at
"2024-07-19T15:33:30.352002+00:00"
Friday, July 19th, 2024 at 3:33:30 PM GMT+00:00
model
"google/gemini-flash-1.5"
app_id
182717
streamed
false
cancelled
false
provider_name
"Google"
latency
2032
moderation_latency
(null)
generation_time
0
finish_reason
"STOP"
tokens_prompt
68
tokens_completion
207
native_tokens_prompt
69
native_tokens_completion
211
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0001755
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
137
0.0000%
Dialogue to Total Word Ratio
Ratio: 59.12%, Deviation: 30.88%
neededClean
false
wordsTotal
137
wordsDialogue
81
0.0000%