NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=1)
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-08-06-01J4MXSEDEMV3217EEKK5DZDDV
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled behind a bouncy castle, their eyes gleaming with mischief. “Alright, Bill, what's the plan?” Molly whispered, glancing at the rainbow swirl of balloons above them. “Simple, Molly. We slip in through the clown's entrance,” Bill replied, grinning. “Clowns always have a backdoor policy.” “Backdoor policy? You mean where the pies go in and the dignity comes out?” she chuckled. “Precisely. Now, we head for the gift table,” Bill continued, adjusting his bright red nose. “Easy pickings.” “And what about the kids?” Molly asked, peering over at the sugar-charged tornado of little guests. “Distraction, my dear Molly,” Bill said with a wink. “I’ll start a conga line.” “A conga line?” She raised an eyebrow. “At a kid’s party?” “Trust me, nothing distracts like a line of dancing toddlers,” Bill declared confidently. “I hope you’re right,” Molly mumbled. “I’ve never seen a toddler conga trained.” “Stick with me, kid.” Bill grinned, slipping a balloon sword into his belt. “We’re almost in.” “Alright, but if I get pied, you’re on cleanup duty,” Molly warned. “Deal,” Bill laughed, as they crept closer to the colorful chaos. “Now, onto the grand heist.” “Presents beware,” Molly muttered, shaking her head.
id
"gen-hN1KArcIq3CSJEvqSnUZq2IWLTFn"
total_cost
0.0032275
upstream_id
"chatcmpl-9tNBx8Mh8naWBJqRyoqvUbc9PB6uV"
created_at
"2024-08-06T22:43:41.312737+00:00"
Tuesday, August 6th, 2024 at 10:43:41 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3859
moderation_latency
186
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
302
native_tokens_prompt
67
native_tokens_completion
306
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0032275
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
199
89.4467%
Dialogue to Total Word Ratio
Ratio: 55.78%, Deviation: 5.78%
neededClean
false
wordsTotal
199
wordsDialogue
111
94.7229%