NC Bench
Overview
About
Tests
Models
Model
Claude 3.0 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35SSD42E3KJPXHFM30GP17K
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Alright, Bill, this is it," Molly whispered, her eyes sparkling with mischief. "The biggest score of our careers – a children's party." Bill scratched his head. "You sure about this, Molly? Seems a bit low-stakes, if you ask me." "Low-stakes?" Molly scoffed. "Have you seen the guest list? Half of those brats come from obscenely wealthy families." "Fair point," Bill conceded. "But won't it be crawling with parents and security?" "That's where you come in, my friend," Molly said, slapping him on the back. "You'll be our distraction – dress up as a clown and keep them entertained while I sneak in and nab the loot." Bill grimaced. "A clown? Really, Molly? You know how I feel about those things." "Oh, don't be such a baby," Molly chided. "Just think of all the toys, gadgets, and cold hard cash we'll be walking away with." Bill sighed heavily. "Alright, alright. But if I end up with nightmares about killer clowns, you're paying for my therapy." Molly grinned. "Deal. Now, let's go make some kids' dreams come true – by crushing them mercilessly and stealing their presents."
id
"gen-QZnBMV0jpfNZZnrrOXvfNzEAWFb8"
total_cost
0.00498
upstream_id
"msg_014qLquYFfaf27gGofr8wgvq"
created_at
"2024-07-19T15:29:30.481836+00:00"
Friday, July 19th, 2024 at 3:29:30 PM GMT+00:00
model
"anthropic/claude-3-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
7147
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
263
native_tokens_prompt
75
native_tokens_completion
317
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00498
Evaluation details
Result
Evaluator
Details
Meta Data
51.9255%
Matches word count
n/a
neededClean
false
words
184
80.2605%
Dialogue to Total Word Ratio
Ratio: 83.15%, Deviation: 6.85%
neededClean
false
wordsTotal
184
wordsDialogue
153
66.0930%