Model

Claude 3.0 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-19-01J35T4NAMVMQXJMEPE05SJ9P3

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"Are you sure about this, Molly? Robbing a kid's party seems a bit low, even for us," Bill said, his brow furrowed with concern. Molly waved her hand dismissively. "Relax, Bill. It's the perfect score! Those fancy parents will be too busy sipping their overpriced wines to notice a thing." "But what if we get caught? Can you imagine the headlines? 'Dastardly Duo Steals Joy from Innocent Children'? Our reputations would be ruined!" "You worry too much," Molly chuckled. "We'll be in and out before they even cut the cake. Just think of all those delightful toys and gadgets waiting for us!" Bill rubbed the back of his neck. "I don't know, Molly. It just feels wrong, you know?" "Wrong?" Molly scoffed. "It's not like we're stealing from the kids directly. We're just... redistributing their wealth." "Redistributing their wealth?" Bill repeated incredulously. "Is that what we're calling it now?" "Look, if you're too chicken to go through with it, I'll do it myself," Molly snapped, crossing her arms defiantly. Bill sighed heavily. "Alright, alright. But if we get caught, I'm blaming you entirely." Molly grinned mischievously. "That's the spirit! Now, let's go make some kids cry."
id"gen-QKJx6vMJCHSaRnT3wqwqbOFfglQ2" 
total_cost0.0054
upstream_id"msg_01S5oCYAiqMLLSn7sAQsxTyS" 
created_at"2024-07-19T15:35:40.458872+00:00" Friday, July 19th, 2024 at 3:35:40 PM GMT+00:00
model"anthropic/claude-3-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency7140
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion292
native_tokens_prompt75
native_tokens_completion345
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.0054

Evaluation details

Result Evaluator Details Meta Data
99.7443% Matches word count n/a
neededCleanfalse
words196
45.9926% Dialogue to Total Word Ratio
Ratio: 80.61%, Deviation: 9.39%
neededCleanfalse
wordsTotal196
wordsDialogue158
72.8685%