NC Bench
Overview
About
Tests
Models
Model
Claude 3 Haiku
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2024-07-19-01J35TE05WTY53HW5XWCM4VBFK
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned in, the mischievous glint in her eyes betraying her excitement. "Alright, Bill, here's the plan. We're gonna hit that kids' party and clean them out of every last present." Bill scratched his chin, a frown creasing his brow. "I don't know, Molly. Stealing from a bunch of kids? That seems low, even for us." "Low?" Molly scoffed. "This is the big score we've been waiting for. Do you have any idea how much loot those parents are gonna be bringing?" "But those presents are for the kids. Can't we just, I don't know, pull off a different job instead?" Bill shifted uncomfortably. Molly rolled her eyes. "Bill, you big softie. This is our chance to make it big. Think of the Caribbean beaches, the fancy champagne, the adoring fans. All we gotta do is snatch a few toys." Bill sighed, his resolve crumbling. "Alright, fine. But I'm not happy about it." "That's the spirit!" Molly clapped him on the back. "Now, here's what we're gonna do..."
id
"gen-rQqiTlqU3HSzFPYEjGVc5hCC6XAs"
total_cost
0.00036
upstream_id
"msg_01K5Zuk9X2Nac5dVcYjTmxug"
created_at
"2024-07-19T15:40:41.402827+00:00"
Friday, July 19th, 2024 at 3:40:41 PM GMT+00:00
model
"anthropic/claude-3-haiku:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
3078
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
239
native_tokens_prompt
75
native_tokens_completion
273
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00036
Evaluation details
Result
Evaluator
Details
Meta Data
0.0028%
Matches word count
n/a
neededClean
false
words
168
1.3494%
Dialogue to Total Word Ratio
Ratio: 75.60%, Deviation: 14.40%
neededClean
false
wordsTotal
168
wordsDialogue
127
0.6761%