NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=1)
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-08-06-01J4MXT45BNQJQEDQDDNKC79TQ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled in the shadows of a garden shed, their eyes gleaming with mischief. Just beyond the hedges, the shrieks of festive chaos spilled from the backyard. Croquet hoops lay abandoned, and balloons bobbed merrily along the fence. Bill, adjusting his ski mask to rest fashionably askew, whispered, "Hope you brought your best party face." Molly rolled her eyes, tucking a stray hair back into her baseball cap. "The mission's simple. Smile, swipe, and skedaddle." “Remember the cake distraction?” Molly nodded, jangling a key ring that wasn't hers. “I’m cake, you’re piñata. Don't mix it up this time.” Bill stifled a snort, his thoughts drifting to their last escapade involving an unfortunate tripping over a dog piñata. “Just stick to the plan, Molly.” “Right. Grab, giggle, go,” she said, grinning wildly. The moment was right. Molly left a deliberate smudge of icing on her nose, perfecting her disguise as an innocent party-goer. Bill shuffled beside her, muttering under his breath. “After you, Picasso,” he quipped, nodding towards the chaos. With a synchronized step, they infiltrated the giggles and streamers, seamlessly blending with the crowd. Each present was a small masterpiece waiting to be liberated into their comically oversized backpack. Victory danced before them like brightly colored wrapping paper.
id
"gen-aLnUBHkJBTYYWE2ZCQ5sO9FFqmGU"
total_cost
0.0032275
upstream_id
"chatcmpl-9tNCJD4l8LfeTerugmI7Ylzi9rvX1"
created_at
"2024-08-06T22:44:03.375449+00:00"
Tuesday, August 6th, 2024 at 10:44:03 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3675
moderation_latency
188
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
306
native_tokens_prompt
67
native_tokens_completion
306
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0032275
Evaluation details
Result
Evaluator
Details
Meta Data
86.3803%
Matches word count
n/a
neededClean
false
words
211
45.3304%
Dialogue to Total Word Ratio
Ratio: 19.43%, Deviation: 9.43%
neededClean
false
wordsTotal
211
wordsDialogue
41
65.8554%