NC Bench
Overview
About
Tests
Models
Model
Claude 3.0 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35T4NAMVMQXJMEPE05SJ9P3
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Are you sure about this, Molly? Robbing a kid's party seems a bit low, even for us," Bill said, his brow furrowed with concern. Molly waved her hand dismissively. "Relax, Bill. It's the perfect score! Those fancy parents will be too busy sipping their overpriced wines to notice a thing." "But what if we get caught? Can you imagine the headlines? 'Dastardly Duo Steals Joy from Innocent Children'? Our reputations would be ruined!" "You worry too much," Molly chuckled. "We'll be in and out before they even cut the cake. Just think of all those delightful toys and gadgets waiting for us!" Bill rubbed the back of his neck. "I don't know, Molly. It just feels wrong, you know?" "Wrong?" Molly scoffed. "It's not like we're stealing from the kids directly. We're just... redistributing their wealth." "Redistributing their wealth?" Bill repeated incredulously. "Is that what we're calling it now?" "Look, if you're too chicken to go through with it, I'll do it myself," Molly snapped, crossing her arms defiantly. Bill sighed heavily. "Alright, alright. But if we get caught, I'm blaming you entirely." Molly grinned mischievously. "That's the spirit! Now, let's go make some kids cry."
id
"gen-QKJx6vMJCHSaRnT3wqwqbOFfglQ2"
total_cost
0.0054
upstream_id
"msg_01S5oCYAiqMLLSn7sAQsxTyS"
created_at
"2024-07-19T15:35:40.458872+00:00"
Friday, July 19th, 2024 at 3:35:40 PM GMT+00:00
model
"anthropic/claude-3-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
7140
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
292
native_tokens_prompt
75
native_tokens_completion
345
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0054
Evaluation details
Result
Evaluator
Details
Meta Data
99.7443%
Matches word count
n/a
neededClean
false
words
196
45.9926%
Dialogue to Total Word Ratio
Ratio: 80.61%, Deviation: 9.39%
neededClean
false
wordsTotal
196
wordsDialogue
158
72.8685%