Model

Claude 3.0 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 50% dialogue

Run ID

2024-07-19-01J35T0STTJW9Q0H6B6WSKPJSF

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"Listen up, Bill," Molly whispered, her eyes gleaming with mischief. "This caper is gonna be a piece of cake." Bill scratched his head, a skeptical frown etching his features. "A children's party? Aren't we sinking a little low here, Molly?" Molly waved a dismissive hand. "Come on, where's your sense of adventure? Think of the loot! Those little brats are bound to score some primo gifts." "I dunno," Bill muttered, shifting uneasily. "Stealing from kids just feels wrong." "Wrong?" Molly scoffed. "It's an art form, my friend. Besides, those spoiled brats won't even notice a few missing trinkets." Bill sighed, resigned to his partner's audacious plans. "Alright, alright. But if we get caught, I'm blaming you." Molly grinned, slapping Bill on the back. "That's the spirit! Now, here's the plan. We'll sneak in during the chaos of the piƱata smash..." Their hushed voices trailed off as they huddled together, plotting the perfect heist amidst the laughter and joy of the unsuspecting children.
id"gen-lWHuS7weHQ5tGefxS8iejYzXqxoM" 
total_cost0.004575
upstream_id"msg_018etJhUN4VX2jbkkYpEJLtT" 
created_at"2024-07-19T15:33:31.995829+00:00" Friday, July 19th, 2024 at 3:33:31 PM GMT+00:00
model"anthropic/claude-3-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency6287
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion243
native_tokens_prompt75
native_tokens_completion290
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.004575

Evaluation details

Result Evaluator Details Meta Data
0.0000% Matches word count n/a
neededCleanfalse
words162
38.6152% Dialogue to Total Word Ratio
Ratio: 59.88%, Deviation: 9.88%
neededCleanfalse
wordsTotal162
wordsDialogue97
19.3076%