NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-23-01J3G7ZRSCSTBFVPE70Q4M41VM
Run Details
Model:
Llama 3.1 8B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
1.3s
Cost:
$0.000071
Provider:
Fireworks
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The sweet scent of freshly baked bread wafted through the air as the early morning crowd began to trickle in to the Sunshine Bakery. Behind the counter, a florid-faced woman with a wild mane of curly hair chatted with a tall, lanky man, his eyes scanning the display case. "You can't believe the trouble we've had with her,"
the woman said
, her voice rising in indignation. "Pampering those precious customers of hers, making sure they're all happy and content. I swear, it's like she thinks she's running the town or something." The man nodded sympathetically, his eyes lingering on the rows of golden brown scones. "I know what you mean. But you have to admit, she's done a good job of making Sweet Smiles the go-to bakery in town. People come from all over for those famous lemon bars." The woman snorted. "Famous? Please. Anyone can throw some tartar and sugar on a tray and call it a lemon bar. I've been baking for years, and my products are far superior." The man chuckled, his eyes crinkling at the corners. "I think you might be biased, Mary. But I do know one thing – the real challenge will come when word gets out she's moving to the old Miller place on Main Street. Then the real fireworks will start."
id
"gen-oT2AXzqRxJm4ZJHk7ZwFDJjA8ST2"
total_cost
0.0000706
upstream_id
"36005fcd-9e21-450d-8a67-b3b073e0e242"
created_at
"2024-07-23T16:49:57.605786+00:00"
7/23/2024, 4:49:57 PM
model
"meta-llama/llama-3.1-8b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
1142
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
103
tokens_completion
280
native_tokens_prompt
71
native_tokens_completion
282
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0000706
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"the woman said"