Model

Claude 3.5 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 50% dialogue

Run ID

2024-07-19-01J35SKPW1QBAAG0XKQ7Z062H9

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly and Bill huddled behind a dumpster, their eyes fixed on the brightly decorated house across the street. Children's laughter and music drifted through the air. "You sure about this, Molly? Stealing from kids seems low, even for us," Bill whispered. Molly rolled her eyes. "Grow a spine, Bill. Those brats have too much stuff anyway. We're doing them a favor." "If you say so," Bill mumbled, fidgeting with his ski mask. "Alright, here's the plan. I'll distract the parents while you sneak in through the back door." "And then what?" "Then you grab as many presents as you can carry and meet me at the getaway car." Bill nodded nervously. "What if I get caught?" "Don't get caught," Molly hissed. "Just act natural. Pretend you're somebody's uncle or something." "Whose uncle? I don't even know these kids!" "Figure it out, Bill! This isn't rocket science." Bill took a deep breath. "Okay, okay. I'm ready." "Good. Now let's go ruin some children's birthday party." With a mischievous grin, Molly stood up and sauntered towards the house, leaving Bill to mutter a quiet prayer before following her lead.
id"gen-qWoLQIB3Q1pTiO5Q0OfTNxYrSKn3" 
total_cost0.00489
upstream_id"msg_015pADpnkKe3J9oHJBvBG6H6" 
created_at"2024-07-19T15:26:31.055443+00:00" Friday, July 19th, 2024 at 3:26:31 PM GMT+00:00
model"anthropic/claude-3.5-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency14341
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion266
native_tokens_prompt75
native_tokens_completion311
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.00489

Evaluation details

Result Evaluator Details Meta Data
75.1556% Matches word count n/a
neededCleanfalse
words187
17.4230% Dialogue to Total Word Ratio
Ratio: 61.50%, Deviation: 11.50%
neededCleanfalse
wordsTotal187
wordsDialogue115
46.2893%