NC Bench
Overview
About
Tests
Models
Model
Claude 3.5 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35T3AFSTMQM794Z0P1GVBK6
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced the dimly lit garage, her eyes darting between Bill and the blueprints spread across a workbench. "So, we're really doing this? Stealing from kids?" Bill shrugged, adjusting his ski mask. "Hey, times are tough. You got a better idea?" "I don't know, maybe rob a bank like normal criminals?" Molly sighed, running a hand through her hair. "Too risky. This is foolproof," Bill insisted, tapping the blueprints. "We slip in during the magician act, grab the loot, and vamoose." Molly raised an eyebrow. "Vamoose? Who says that anymore?" "Focus, Molly. You remember the plan?" She nodded reluctantly. "Yeah, yeah. I distract the clown while you raid the gift table." Bill grinned. "That's my girl. And remember, if anyone asks-" "We're the backup entertainment," Molly finished, rolling her eyes. "This is ridiculous." "It's genius," Bill corrected. "Now, let's go make some kids cry." Molly grabbed her props and followed Bill to the van, muttering, "I can't believe I let you talk me into this." "Trust me," Bill said, starting the engine. "It'll be a piece of cake."
id
"gen-ZzVzsO99jN7EFqC1ffJ1Wqnq9F78"
total_cost
0.00495
upstream_id
"msg_0184ArWEwCKusZapMD6FEWyu"
created_at
"2024-07-19T15:35:05.878205+00:00"
Friday, July 19th, 2024 at 3:35:05 PM GMT+00:00
model
"anthropic/claude-3.5-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
17561
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
265
native_tokens_prompt
75
native_tokens_completion
315
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00495
Evaluation details
Result
Evaluator
Details
Meta Data
9.6081%
Matches word count
n/a
neededClean
false
words
178
20.3148%
Dialogue to Total Word Ratio
Ratio: 61.24%, Deviation: 11.24%
neededClean
false
wordsTotal
178
wordsDialogue
109
14.9614%