Model

Claude 3.0 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 50% dialogue

Run ID

2024-07-19-01J35SSMW4P73K4NHE7MNJD7DW

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly surveyed the bouncy castle and candy-colored streamers with a grimace. "This is too much, even for a couple of hardened thieves like us." Bill shrugged, unfazed. "Hey, you wanted a challenge. Besides, what kid doesn't love a clown robbery?" "Quiet, you'll blow our cover!" Molly hissed. "And lose the red nose, would ya?" "But it's part of my charm." Bill batted his lashes, grinning mischievously. "So, what's the plan?" Molly rolled her eyes. "We wait for the cake-cutting, then make our move. You distract the parents with your, uh, 'charm,' while I nab the loot." "Gotcha." Bill saluted. "Operation Sticky Fingers is a go." As the first notes of "Happy Birthday" rang out, Molly crept towards the gift table, her nimble fingers working their magic. Just then, a high-pitched shriek pierced the air. "Look, Mommy! A clown!" Bill waved cheerily, his mouth stuffed with pilfered cupcakes. "Who wants a magic trick?" Molly groaned, her sack bulging with ill-gotten gains. "I'm getting too old for this."
id"gen-6maXoJgag3wbPAw7BNi9552Ernr9" 
total_cost0.004815
upstream_id"msg_019LCApAo9MhSmatFwkSgNGM" 
created_at"2024-07-19T15:29:38.78292+00:00" Friday, July 19th, 2024 at 3:29:38 PM GMT+00:00
model"anthropic/claude-3-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency7102
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion260
native_tokens_prompt75
native_tokens_completion306
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.004815

Evaluation details

Result Evaluator Details Meta Data
0.0304% Matches word count n/a
neededCleanfalse
words170
88.7160% Dialogue to Total Word Ratio
Ratio: 55.88%, Deviation: 5.88%
neededCleanfalse
wordsTotal170
wordsDialogue95
44.3732%