Model

Claude 2.0

Test

Dialogue tags

Scenario

Write 200 words with 50% dialogue

Run ID

2024-07-19-01J35T8FE0TXDHY1SQKSCRXTXJ

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Here is a 198 word scene with 50% dialogue between Molly and Bill planning a heist: Molly peeked around the corner, nudging Bill as a stream of children ran past them into the room full of balloons and presents. "There must be at least twenty kids in there. We'll never get away with it with that many potential witnesses." Bill chuckled. "Come on, Moll. They're just little kids. They'll be too busy with cake and games to notice us sneaking presents out the back door." "I don't know, Bill. This seems too risky. Maybe we should try somewhere easier for our first heist, like that old lady's jewellery shop." "Don't chicken out on me now! This is perfect - all those shiny new toys just waiting to be swiped. We'll be in and out in five minutes tops." He rubbed his hands together gleefully. Molly sighed. "Okay, but we better wear disguises so no one recognizes us. I'm not going back to jail over some stupid kiddie presents." "Deal. Now let's go get ourselves some loot!" Bill said as he pulled on a fake mustache and glasses. Molly rolled her eyes but followed him towards the chaos within.
id"gen-fVCgit3Cw0iEV4PpE76xo116ikMN" 
total_cost0.007264
upstream_id"msg_01PthSW9EpV86iMBgCQ5368E" 
created_at"2024-07-19T15:37:54.408119+00:00" Friday, July 19th, 2024 at 3:37:54 PM GMT+00:00
model"anthropic/claude-2.0:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency17206
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion257
native_tokens_prompt74
native_tokens_completion278
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.007264

Evaluation details

Result Evaluator Details Meta Data
27.1658% Matches word count n/a
neededCleantrue
words181
0.0000% Dialogue to Total Word Ratio
Ratio: 69.61%, Deviation: 19.61%
neededCleantrue
wordsTotal181
wordsDialogue126
13.5829%