NC Bench
Overview
About
Tests
Models
Model
GPT-4o, May 13th (temp=0)
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35SH370FX955DGYHCSH8498
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind the bushes, peering through the gaps at the colorful chaos of the children's party. "Bill, are you sure this is a good idea?" Molly whispered, her eyes darting around nervously. "Of course, Molly. Who'd suspect two adults at a kids' party?" Bill grinned, adjusting his clown wig. "But what if they recognize us?" Molly asked, tugging at her oversized bowtie. "Relax. We look like party entertainers. No one's going to think twice," Bill assured her, pulling out a map of the backyard. "Okay, so what's the plan?" Molly leaned in closer, her curiosity piqued. "Simple. We blend in, do a few tricks, and while everyone's distracted, we grab the presents," Bill explained, pointing to the pile of brightly wrapped gifts. "And if someone catches us?" Molly's voice wavered slightly. "We improvise. Remember, we're professionals," Bill said, giving her a confident nod. "Right. Professionals," Molly repeated, trying to muster some confidence. "Ready?" Bill asked, standing up and adjusting his red nose. "As I'll ever be," Molly replied, taking a deep breath. "Let's do this," Bill said, leading the way towards the party, their ridiculous costumes blending perfectly with the festive atmosphere.
id
"gen-vXeEt4ppz8IWgimQQTXzbcdxdOi2"
total_cost
0.004385
upstream_id
"chatcmpl-9mjlTZICc3LNs3s4GTWeW6KUIN5xx"
created_at
"2024-07-19T15:25:01.036871+00:00"
Friday, July 19th, 2024 at 3:25:01 PM GMT+00:00
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
9745
moderation_latency
199
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
279
native_tokens_prompt
67
native_tokens_completion
270
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.004385
Evaluation details
Result
Evaluator
Details
Meta Data
99.3769%
Matches word count
n/a
neededClean
false
words
195
44.4805%
Dialogue to Total Word Ratio
Ratio: 40.51%, Deviation: 9.49%
neededClean
false
wordsTotal
195
wordsDialogue
79
71.9287%