Run: 2024-07-19-01J35T3AFSTMQM794Z0P1GVBK6

Run Details

Model:: Claude 3.5 Sonnet
Test:: Dialogue tags
Scenario:: Write 200 words with 50% dialogue
Duration:: 17.7s
Cost:: $0.004950
Provider:: Anthropic

Evaluation summary

Score	Evaluator	Details Link
9.61%	Matches word count	(details)
20.31%	Dialogue to Total Word Ratio	(details)
14.9614%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly paced the dimly lit garage, her eyes darting between Bill and the blueprints spread across a workbench. "So, we're really doing this? Stealing from kids?"

Bill shrugged, adjusting his ski mask. "Hey, times are tough. You got a better idea?"

"I don't know, maybe rob a bank like normal criminals?" Molly sighed, running a hand through her hair.

"Too risky. This is foolproof," Bill insisted, tapping the blueprints. "We slip in during the magician act, grab the loot, and vamoose."

Molly raised an eyebrow. "Vamoose? Who says that anymore?"

"Focus, Molly. You remember the plan?"

She nodded reluctantly. "Yeah, yeah. I distract the clown while you raid the gift table."

Bill grinned. "That's my girl. And remember, if anyone asks-"

"We're the backup entertainment," Molly finished, rolling her eyes. "This is ridiculous."

"It's genius," Bill corrected. "Now, let's go make some kids cry."

Molly grabbed her props and followed Bill to the van, muttering, "I can't believe I let you talk me into this."

"Trust me," Bill said, starting the engine. "It'll be a piece of cake."

Evaluation details

Result

Evaluator

Details

Meta Data

9.61%

Matches word count

n/a

neededClean	false
words	178

20.31%

Dialogue to Total Word Ratio

Ratio: 61.24%, Deviation: 11.24%

neededClean	false
wordsTotal	178
wordsDialogue	109

14.9614%

id	"gen-ZzVzsO99jN7EFqC1ffJ1Wqnq9F78"
total_cost	0.00495
upstream_id	"msg_0184ArWEwCKusZapMD6FEWyu"
created_at	"2024-07-19T15:35:05.878205+00:00" 7/19/2024, 3:35:05 PM
model	"anthropic/claude-3.5-sonnet:beta"
app_id	182717
streamed	false
cancelled	false
provider_name	"Anthropic"
latency	17561
moderation_latency	(null)
generation_time	0
finish_reason	"end_turn"
tokens_prompt	68
tokens_completion	265
native_tokens_prompt	75
native_tokens_completion	315
num_media_prompt	(null)
num_media_completion	(null)
origin	"https://ncbench.com/"
usage	0.00495