NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=1)
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-08-06-01J4MXRVPMZCW2C7ER17ASC46T
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind the inflatable dinosaur, peering out at the chaos unfolding on the other side of the garden fence. Balloons bobbed in the breeze, and children sprinted past, faces painted and sticky with candy residue. The prize, a mountain of brightly wrapped presents, sat temptingly beneath a large banner that read "Happy Birthday, Timmy!" Molly nudged Bill, her eyes alight with mischief. "Distract with balloon animals?" Bill snorted softly. "You think I'm gonna twist a decent giraffe?" "Fair point," Molly admitted, scanning the scene. "Face paint, then?" "Smudge their tiger stripes and send 'em roaring elsewhere?" "Precisely," she winked. "And the cake's the backup plan." Bill's eyes widened with understanding. "Right, a frosting stampede." "Exactly. Once they're elbows-deep in sugar..." Molly gestured to the pile of gifts as though she were unveiling a priceless artifact. Bill nodded, stifling a laugh. "And we waltz right out, undetected." "Simple yet elegant," Molly asserted. "Let's not underestimate the little devils," Bill cautioned, casting a glance at a kid who'd just body-slammed another in pursuit of a rogue balloon. "Relax," Molly assured, a grin playing at her lips. "We'll be legends in the under-five circuit." With synchronized stealth, they vanished into the vibrant chaos of birthday pandemonium.
id
"gen-lR338jmvO3cD6qOwrD26OTwadTXJ"
total_cost
0.0032075
upstream_id
"chatcmpl-9tNBedF13MwSBtpvQFEls95kSL1AK"
created_at
"2024-08-06T22:43:22.061828+00:00"
Tuesday, August 6th, 2024 at 10:43:22 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3811
moderation_latency
156
generation_time
0
finish_reason
"stop"
tokens_prompt
68
tokens_completion
311
native_tokens_prompt
67
native_tokens_completion
304
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0032075
Evaluation details
Result
Evaluator
Details
Meta Data
93.6496%
Matches word count
n/a
neededClean
false
words
209
0.0000%
Dialogue to Total Word Ratio
Ratio: 33.97%, Deviation: 23.97%
neededClean
false
wordsTotal
209
wordsDialogue
71
46.8248%