NC Bench
Overview
About
Tests
Models
Model
Claude 3.5 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2024-07-19-01J35SPW7VGGJ5VCR6ZZ9XQXTZ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled in their dimly lit apartment, surrounded by blueprints and party supply catalogs. The air was thick with tension and the smell of cheap coffee. "You sure about this, Molly?" Bill asked, his voice trembling slightly. Molly rolled her eyes. "It's foolproof. Kids, cake, and a mountain of presents. What could go wrong?" Bill nervously twirled a party blower between his fingers. "I don't know. Something about stealing from children feels... wrong." "Grow a spine, Bill," Molly snapped. "We're not stealing from children. We're stealing from their parents." She spread out a crudely drawn map of the party venue on the coffee table, knocking over empty energy drink cans in the process. Bill leaned in, squinting at the scribbles. "So, we enter through the bounce house?" he asked incredulously. Molly nodded, a mischievous glint in her eye. "Exactly. No one will suspect a thing. We'll blend right in with the clowns." Bill sighed, resigned to his fate. As Molly continued outlining their ridiculous plan, he couldn't help but wonder how his life had come to this – plotting to rob a bunch of sugar-high six-year-olds.
id
"gen-FM2xmickBemTt1fOEd5XtFvJcyvS"
total_cost
0.00477
upstream_id
"msg_014MnRLqo6CCFg1KTCPAXgqS"
created_at
"2024-07-19T15:28:13.196759+00:00"
Friday, July 19th, 2024 at 3:28:13 PM GMT+00:00
model
"anthropic/claude-3.5-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
12709
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
261
native_tokens_prompt
75
native_tokens_completion
303
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00477
Evaluation details
Result
Evaluator
Details
Meta Data
90.4837%
Matches word count
n/a
neededClean
false
words
190
0.0000%
Dialogue to Total Word Ratio
Ratio: 33.16%, Deviation: 23.16%
neededClean
false
wordsTotal
190
wordsDialogue
63
45.2419%