NC Bench
Overview
About
Tests
Models
Model
GPT-4o, May 13th (temp=1)
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35T6Z4TK7ESDTPDV0R5F07P
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind the brightly painted fence, eying the clown at the center of the garden. Molly whispered, "Are you sure about this, Bill? Stealing from a kids' party seems low." Bill smirked, "Where's your sense of adventure, Molly? Anyway, look at those gifts. Must be at least a dozen." "Those balloon animals don't exactly scream 'Fort Knox' security." "Exactly," Bill said, adjusting his ski mask with exaggerated care. "It's a piece of cake. Speaking of which, they probably have cake." "Chocolate or vanilla?" "Does it matter?" Bill laughed. "Stick to the plan: you distract with a juggling act, I'll grab the goods." Molly frowned. "I don't know how to juggle." "Then do balloon animals." "I can barely blow up a balloon without passing out." Bill rolled his eyes. "Fine, just do a dance or something. Anything to keep them busy." "How about I pretend to be a lost kid?" "You want to blend in while wearing a ski mask?" Molly shrugged. "I'm small; I could pass for a ten-year-old." "Yeah, a ten-year-old with a five-o'clock shadow." "If you think this is such a great idea, why don't you do the distraction?" "Because Molly, I don't do performances. I'm the brains." "And what am I, Bill?" "The indispensable sidekick. Now, ready?" "No, but when has that ever stopped us?" Bill grinned. "Let's make this the heist of the century. Ready, set, go!"
id
"gen-2IFm9Chw83sJ6W30jAyEWguzK4AQ"
total_cost
0.005285
upstream_id
"chatcmpl-9mjx2euqxF80RUhXrQsHB0RjEHazj"
created_at
"2024-07-19T15:36:57.840864+00:00"
Friday, July 19th, 2024 at 3:36:57 PM GMT+00:00
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
9665
moderation_latency
218
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
337
native_tokens_prompt
67
native_tokens_completion
330
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.005285
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
239
0.0000%
Dialogue to Total Word Ratio
Ratio: 82.01%, Deviation: 32.01%
neededClean
false
wordsTotal
239
wordsDialogue
196
0.0000%