NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-19-01J35SKPW1QBAAG0XKQ7Z062H9
Run Details
Model:
Claude 3.5 Sonnet
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
14.4s
Cost:
$0.004890
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
75.16%
Matches word count
(details)
17.42%
Dialogue to Total Word Ratio
(details)
46.2893%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled behind a dumpster, their eyes fixed on the brightly decorated house across the street. Children's laughter and music drifted through the air. "You sure about this, Molly? Stealing from kids seems low, even for us," Bill whispered. Molly rolled her eyes. "Grow a spine, Bill. Those brats have too much stuff anyway. We're doing them a favor." "If you say so," Bill mumbled, fidgeting with his ski mask. "Alright, here's the plan. I'll distract the parents while you sneak in through the back door." "And then what?" "Then you grab as many presents as you can carry and meet me at the getaway car." Bill nodded nervously. "What if I get caught?" "Don't get caught," Molly hissed. "Just act natural. Pretend you're somebody's uncle or something." "Whose uncle? I don't even know these kids!" "Figure it out, Bill! This isn't rocket science." Bill took a deep breath. "Okay, okay. I'm ready." "Good. Now let's go ruin some children's birthday party." With a mischievous grin, Molly stood up and sauntered towards the house, leaving Bill to mutter a quiet prayer before following her lead.
id
"gen-qWoLQIB3Q1pTiO5Q0OfTNxYrSKn3"
total_cost
0.00489
upstream_id
"msg_015pADpnkKe3J9oHJBvBG6H6"
created_at
"2024-07-19T15:26:31.055443+00:00"
7/19/2024, 3:26:31 PM
model
"anthropic/claude-3.5-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
14341
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
266
native_tokens_prompt
75
native_tokens_completion
311
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00489
Evaluation details
Result
Evaluator
Details
Meta Data
75.16%
Matches word count
n/a
neededClean
false
words
187
17.42%
Dialogue to Total Word Ratio
Ratio: 61.50%, Deviation: 11.50%
neededClean
false
wordsTotal
187
wordsDialogue
115
46.2893%