Model

Claude 3.5 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 10% dialogue

Run ID

2024-07-19-01J35SPW7VGGJ5VCR6ZZ9XQXTZ

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly and Bill huddled in their dimly lit apartment, surrounded by blueprints and party supply catalogs. The air was thick with tension and the smell of cheap coffee. "You sure about this, Molly?" Bill asked, his voice trembling slightly. Molly rolled her eyes. "It's foolproof. Kids, cake, and a mountain of presents. What could go wrong?" Bill nervously twirled a party blower between his fingers. "I don't know. Something about stealing from children feels... wrong." "Grow a spine, Bill," Molly snapped. "We're not stealing from children. We're stealing from their parents." She spread out a crudely drawn map of the party venue on the coffee table, knocking over empty energy drink cans in the process. Bill leaned in, squinting at the scribbles. "So, we enter through the bounce house?" he asked incredulously. Molly nodded, a mischievous glint in her eye. "Exactly. No one will suspect a thing. We'll blend right in with the clowns." Bill sighed, resigned to his fate. As Molly continued outlining their ridiculous plan, he couldn't help but wonder how his life had come to this – plotting to rob a bunch of sugar-high six-year-olds.
id"gen-FM2xmickBemTt1fOEd5XtFvJcyvS" 
total_cost0.00477
upstream_id"msg_014MnRLqo6CCFg1KTCPAXgqS" 
created_at"2024-07-19T15:28:13.196759+00:00" Friday, July 19th, 2024 at 3:28:13 PM GMT+00:00
model"anthropic/claude-3.5-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency12709
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion261
native_tokens_prompt75
native_tokens_completion303
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.00477

Evaluation details

Result Evaluator Details Meta Data
90.4837% Matches word count n/a
neededCleanfalse
words190
0.0000% Dialogue to Total Word Ratio
Ratio: 33.16%, Deviation: 23.16%
neededCleanfalse
wordsTotal190
wordsDialogue63
45.2419%