NC Bench
Overview
About
Tests
Models
Model
Claude 3.5 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35SS4PQJ4BDR2DST1V604Q6
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled behind a dumpster, eyeing the colorful balloons tied to the mailbox across the street. The sound of children's laughter drifted through the air. "You sure about this, Molly?" Bill whispered, fidgeting with his ski mask. "Of course I'm sure. It's foolproof," Molly replied, adjusting her own mask. "Kids are distracted, parents are drunk on punch. Easy pickings." Bill frowned. "But stealing from children? Seems a bit... low." "Low? It's brilliant! Think about it – brand new toys, still in the boxes. We'll make a fortune!" "I don't know... What if we get caught?" Molly rolled her eyes. "By who? A bunch of sugar-high six-year-olds?" "Good point," Bill conceded. "So what's the plan?" "Simple. We sneak in through the back, grab the gifts, and make our escape before they even cut the cake." "And if someone spots us?" "We'll say we're the entertainment. Now come on, let's go before all the good stuff is opened." As they crept towards the house, Bill muttered, "I can't believe I'm doing this. My mother would be so disappointed." "Less talking, more stealing," Molly hissed, leading the way.
id
"gen-ZC5OdBvgJoJYVM3tZPlJCuh0kw68"
total_cost
0.00498
upstream_id
"msg_01SwqWJg7nxr9B66wtaFaPYY"
created_at
"2024-07-19T15:29:31.939463+00:00"
Friday, July 19th, 2024 at 3:29:31 PM GMT+00:00
model
"anthropic/claude-3.5-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
17230
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
269
native_tokens_prompt
75
native_tokens_completion
317
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00498
Evaluation details
Result
Evaluator
Details
Meta Data
86.3803%
Matches word count
n/a
neededClean
false
words
189
0.0052%
Dialogue to Total Word Ratio
Ratio: 67.72%, Deviation: 17.72%
neededClean
false
wordsTotal
189
wordsDialogue
128
43.1928%