Model

Claude 3 Haiku

Test

Dialogue tags

Scenario

Write 200 words with 90% dialogue

Run ID

2024-07-19-01J35T0G825YGQQ8HNTZSYR1A7

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"Alright, Molly, here's the plan," Bill said, rubbing his hands together. "We're gonna sneak into that kids' party, grab all the presents, and be out of there before anyone even notices." Molly raised an eyebrow. "Presents, huh? Isn't that a bit... low-brow for us?" Bill scoffed. "Low-brow? Are you kidding? Those parents are loaded. Just imagine all the cash and valuables hidden in those wrapping papers!" "I don't know, Bill. Stealing from a bunch of kids doesn't exactly scream 'professional thief' to me." "Come on, Molly. It's the perfect heist! No one's gonna suspect us, and the payoff will be huge. We need this, you know we do." Molly sighed, conceding defeat. "Alright, fine. But we have to get in and out quickly, no messing around. And I get to pick the next job, deal?" Bill grinned. "Deal. Now, here's what we're gonna do..."
id"gen-wvgdSS1rFwDipqktut7DyssFPbcu" 
total_cost0.000335
upstream_id"msg_01KWyRKfwRgN1KXhRXb1Cuuv" 
created_at"2024-07-19T15:33:18.596618+00:00" Friday, July 19th, 2024 at 3:33:18 PM GMT+00:00
model"anthropic/claude-3-haiku:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency2703
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion215
native_tokens_prompt75
native_tokens_completion253
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.000335

Evaluation details

Result Evaluator Details Meta Data
0.0000% Matches word count n/a
neededCleanfalse
words146
99.7063% Dialogue to Total Word Ratio
Ratio: 87.67%, Deviation: 2.33%
neededCleanfalse
wordsTotal146
wordsDialogue128
49.8532%