NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=0)
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-08-06-01J4MXVFN1WCR43G2R51APCZZ7
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind a row of hedges, peering at the chaos of balloons and children in the backyard. "Bill, are you sure this is a good idea?" Molly whispered, eyeing the bouncy castle. "Of course, Molly. Who'd suspect a heist at a kid's party?" Bill grinned, adjusting his oversized sunglasses. "But what if we get caught? I don't want to end up in a clown car chase," Molly said, glancing nervously at the face-painted entertainer. "Relax, Molly. We blend in, grab the loot, and we're out. Easy peasy," Bill assured, patting the duffel bag. "Blend in? Bill, we're not exactly dressed for a party," Molly pointed out, gesturing to their all-black attire. "Details, details. Just act like you belong," Bill replied, shrugging. "Okay, but if anyone asks, we're the balloon animal specialists," Molly suggested, trying to sound confident. "Balloon animals? I can barely tie my shoelaces," Bill chuckled. "Then let's hope no one asks for a giraffe," Molly said, rolling her eyes. "Alright, on three. One, two—" "Wait, Bill! What if they have a guard dog?" Molly interrupted, suddenly panicked. "Then we offer it a squeaky toy and hope for the best," Bill winked. "You're impossible," Molly sighed, shaking her head. "That's why you love me," Bill smirked, ready to make their move.
id
"gen-HDChjPT0HO09uJgx5SrJF8dIVacq"
total_cost
0.0033675
upstream_id
"chatcmpl-9tND287Z2sFN4tWubLjz09l6JtDkI"
created_at
"2024-08-06T22:44:47.813783+00:00"
Tuesday, August 6th, 2024 at 10:44:47 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3594
moderation_latency
143
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
325
native_tokens_prompt
67
native_tokens_completion
320
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0033675
Evaluation details
Result
Evaluator
Details
Meta Data
43.3783%
Matches word count
n/a
neededClean
false
words
217
58.9630%
Dialogue to Total Word Ratio
Ratio: 58.53%, Deviation: 8.53%
neededClean
false
wordsTotal
217
wordsDialogue
127
51.1707%