Model

Claude 3.0 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-19-01J35SSD42E3KJPXHFM30GP17K

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"Alright, Bill, this is it," Molly whispered, her eyes sparkling with mischief. "The biggest score of our careers – a children's party." Bill scratched his head. "You sure about this, Molly? Seems a bit low-stakes, if you ask me." "Low-stakes?" Molly scoffed. "Have you seen the guest list? Half of those brats come from obscenely wealthy families." "Fair point," Bill conceded. "But won't it be crawling with parents and security?" "That's where you come in, my friend," Molly said, slapping him on the back. "You'll be our distraction – dress up as a clown and keep them entertained while I sneak in and nab the loot." Bill grimaced. "A clown? Really, Molly? You know how I feel about those things." "Oh, don't be such a baby," Molly chided. "Just think of all the toys, gadgets, and cold hard cash we'll be walking away with." Bill sighed heavily. "Alright, alright. But if I end up with nightmares about killer clowns, you're paying for my therapy." Molly grinned. "Deal. Now, let's go make some kids' dreams come true – by crushing them mercilessly and stealing their presents."
id"gen-QZnBMV0jpfNZZnrrOXvfNzEAWFb8" 
total_cost0.00498
upstream_id"msg_014qLquYFfaf27gGofr8wgvq" 
created_at"2024-07-19T15:29:30.481836+00:00" Friday, July 19th, 2024 at 3:29:30 PM GMT+00:00
model"anthropic/claude-3-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency7147
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion263
native_tokens_prompt75
native_tokens_completion317
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.00498

Evaluation details

Result Evaluator Details Meta Data
51.9255% Matches word count n/a
neededCleanfalse
words184
80.2605% Dialogue to Total Word Ratio
Ratio: 83.15%, Deviation: 6.85%
neededCleanfalse
wordsTotal184
wordsDialogue153
66.0930%