Model

Claude 3.5 Sonnet

Test

Dialogue tags

Scenario

Write 200 words with 50% dialogue

Run ID

2024-07-19-01J35T9N33M4PGGY6TGBBS3QK8

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly peered through binoculars at the bustling backyard party. "I count at least twenty wrapped boxes, Bill. This is gonna be a goldmine." Bill fidgeted nervously beside her. "I don't know, Mol. Stealing from kids feels wrong." "Wrong? We're professional thieves. Everything we do is wrong." "Yeah, but... it's little Timmy's birthday." Molly lowered the binoculars and glared at her partner. "How do you know the kid's name?" "I, uh, might've read the invitation." "You what? Where'd you get an invitation?" Bill shrugged sheepishly. "I know his mom." Molly groaned. "Great. Just great. Next, you'll tell me you brought a present." Bill's silence was deafening. "You didn't." "It's a really cool remote-controlled car," Bill mumbled. Molly pinched the bridge of her nose. "Okay, new plan. We go in, you give the kid his stupid car, we eat some cake, and then we leave. With empty hands and full stomachs." "Really? Thanks, Mol! You're the best partner ever." "Yeah, yeah. Just remember, next time we're robbing a retirement home. No excuses."
id"gen-uTc2zSveb2G6mBM63C97WLOdnNfh" 
total_cost0.004845
upstream_id"msg_01QdbR1c4AEQY16LuumWuybP" 
created_at"2024-07-19T15:38:31.758575+00:00" Friday, July 19th, 2024 at 3:38:31 PM GMT+00:00
model"anthropic/claude-3.5-sonnet:beta" 
app_id182717
streamedfalse
cancelledfalse
provider_name"Anthropic" 
latency15981
moderation_latency(null)
generation_time0
finish_reason"end_turn" 
tokens_prompt68
tokens_completion264
native_tokens_prompt75
native_tokens_completion308
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.004845

Evaluation details

Result Evaluator Details Meta Data
0.0848% Matches word count n/a
neededCleanfalse
words171
0.0000% Dialogue to Total Word Ratio
Ratio: 76.02%, Deviation: 26.02%
neededCleanfalse
wordsTotal171
wordsDialogue130
0.0424%