Model

Claude 3.0 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 10% dialogue

Run ID

2024-07-19-01J35T9KHYEBAHPKK7S0BMBCZB

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The dim alleyway cast flickering shadows over Molly and Bill's furtive figures, huddled together like co-conspirators in a noir thriller. Molly's eyes glinted with mischief as she leaned in, her voice a hushed whisper, "Are you ready for the score of a lifetime, Bill?" Bill's brow furrowed, his gaze darting around nervously. "A children's party? Molly, that's a new low, even for us." "Oh, come on," she scoffed, adjusting her black beanie. "It's the perfect target – a room full of unsuspecting rugrats, drowning in a sea of shiny loot." "But the wailing," Bill shuddered. "The tears, the tantrums…" Molly waved a dismissive hand. "Piece of cake. We'll be in and out before the first whine erupts. Besides, what's a few traumatized kiddies compared to the haul we'll rake in?" Bill hesitated, then a sly grin crept across his face. "Well, when you put it like that… Lead the way, partner." With a conspiratorial nod, they slipped into the shadows, ready to crash the party and make off with their ill-gotten gains – a heist that would undoubtedly go down in infamy, or at least provide a good chuckle at the next thieves' convention.
id"gen-NnBChmeMpqjKAX6GY6rmdy7Mt05O" 
total_cost0.005205
upstream_id"msg_016Bn5gKo5V84wVnwxjN5g2Q" 
created_at"2024-07-19T15:38:21.502372+00:00" Friday, July 19th, 2024 at 3:38:21 PM GMT+00:00
model"anthropic/claude-3-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency7301
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion285
native_tokens_prompt75
native_tokens_completion332
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.005205

Evaluation details

Result Evaluator Details Meta Data
97.6276% Matches word count n/a
neededCleanfalse
words193
0.0000% Dialogue to Total Word Ratio
Ratio: 45.08%, Deviation: 35.08%
neededCleanfalse
wordsTotal193
wordsDialogue87
48.8138%