NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-08-06-01J4MXS6ZMVQS46X6SJ5683XA8
Run Details
Model:
GPT-4o, Aug. 6th (temp=0)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
3.6s
Cost:
$0.002790
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The aroma of freshly baked bread mingled with the sweet scent of cinnamon as the bell above the door jingled softly. "Morning! I see you're up early again." "Couldn't resist the call of your blueberry scones. They're practically famous around here." "Flattery will get you everywhere. But tell me, what brings you in today besides my baking?" "Well, I heard some interesting chatter at the post office. Seems like Mrs. Hargrove's cat went missing again." "Again? That cat has more adventures than anyone in this town." "True, but this time, it was last seen near the old mill. You know, where the lights have been flickering at night." "Strange. You think there's more to it than just a wandering feline?" "Could be. Remember the rumors about the mill being haunted?" "Ghosts and cats, what a combination. But seriously, do you think we should check it out?" "Why not? A little mystery might spice up our quiet lives. Besides, I have a feeling there's more to this than meets the eye." "Alright, but only if you promise to bring some of those scones. Ghost hunting on an empty stomach is a terrible idea." "Deal. Let's meet at dusk. Who knows what we'll uncover?"
id
"gen-LIiwHJOvk6JOcr6YXYAX0jjMyT7u"
total_cost
0.00279
upstream_id
"chatcmpl-9tNBpX6dgOkCyLMwzHXHX0lUhcfK5"
created_at
"2024-08-06T22:43:33.140584+00:00"
8/6/2024, 10:43:33 PM
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3314
moderation_latency
197
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
270
native_tokens_prompt
68
native_tokens_completion
262
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00279
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)