NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=1)
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-08-06-01J4MXRTKGRZPRZ3GZEWMCP4VC
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly peeked through the bushes at the colorful chaos of the birthday party on the lawn. Balloons bobbed like pastel jellyfish, and kids ran wild, noses already smudged with cake frosting. “Bill, are you sure about this?” she whispered, adjusting her neon wig. “Sure as a clown at a circus,” Bill replied, smoothing his own oversized, polka-dotted tie. “Think about it, Molly. Who’d suspect clowns at a kid’s party?” “The kids, maybe?” Molly rolled her eyes. “Nah, they’re too busy being sugar-fueled hooligans. Focus, Molly. The presents are under the gazebo,” Bill grinned, pointing his squirting lapel flower. “But breaking into a kid's party?” Molly shook her head, astounded. “This is a new low.” “Or a new high, if you like bouncy castles,” Bill chuckled. “We nab the loot, mingle a bit. Easy peasy.” “It’s not ‘easy peasy’ if Timmy starts a balloon sword fight again,” she retorted. “Just watch out for low-flying sugar zealots,” Bill said with a shrug. Molly snorted. “Alright, let’s do it. But if things go south, we’ll out-juggle them in a heartbeat.” “Perfect. Who’d be suspicious of two clumsy clowns?” Bill said as he added a final swipe of greasepaint.
id
"gen-9eohaQQSJtL6eMjkXx09t9YZV1m9"
total_cost
0.0031475
upstream_id
"chatcmpl-9tNBd1XCs0V1jcnHBFg7PvtGGZBOQ"
created_at
"2024-08-06T22:43:20.90432+00:00"
Tuesday, August 6th, 2024 at 10:43:20 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3781
moderation_latency
171
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
302
native_tokens_prompt
67
native_tokens_completion
298
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0031475
Evaluation details
Result
Evaluator
Details
Meta Data
99.9840%
Matches word count
n/a
neededClean
false
words
198
65.2854%
Dialogue to Total Word Ratio
Ratio: 58.08%, Deviation: 8.08%
neededClean
false
wordsTotal
198
wordsDialogue
115
82.6347%