NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-19-01J35T0Q1WWJBKPYRQMDA287PQ
Run Details
Model:
Claude 3 Haiku
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
2.3s
Cost:
$0.000386
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
30.1380%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced the dimly lit apartment, her brow furrowed in concentration. "So, Bill, you sure this kids' party heist is going to work? I mean, it's not exactly our usual high-stakes target." Bill leaned back in his chair, a mischievous grin spreading across his face. "Trust me, Mol, this is gonna be a cakewalk. Those little ankle-biters are gonna be too busy playing pass the parcel to even notice us slipping in and out." "I don't know, Bill," Molly said, chewing on her lip. "What if the parents catch us? I don't fancy spending the next few years behind bars, you know." "Relax, will ya?" Bill said, waving a dismissive hand. "I've got it all figured out. We'll go in, grab the goods, and be back at the safe house before anyone's the wiser." Molly sighed, but there was a glint of excitement in her eyes. "Alright, fine. But if this goes south, I'm blaming you." Bill chuckled, already reaching for the blueprints of the party venue. "That's the spirit, Mol. Now, let's get to work and make some kids cry on their special day."
id
"gen-5XV37cyTAwFyFAeEwIazyx6HKHW3"
total_cost
0.00038625
upstream_id
"msg_01HvcnG54MjUZG6du1s7Eajq"
created_at
"2024-07-19T15:33:25.063659+00:00"
7/19/2024, 3:33:25 PM
model
"anthropic/claude-3-haiku:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
2202
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
262
native_tokens_prompt
75
native_tokens_completion
294
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00038625
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
185
0.00%
Dialogue to Total Word Ratio
Ratio: 68.45%, Deviation: 18.45%
neededClean
false
wordsTotal
187
wordsDialogue
128
30.1380%