NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=0)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-08-06-01J4MXS6ZMVQS46X6SJ5683XA8
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The aroma of freshly baked bread mingled with the sweet scent of cinnamon as the bell above the door jingled softly. "Morning! I see you're up early again." "Couldn't resist the call of your blueberry scones. They're practically famous around here." "Flattery will get you everywhere. But tell me, what brings you in today besides my baking?" "Well, I heard some interesting chatter at the post office. Seems like Mrs. Hargrove's cat went missing again." "Again? That cat has more adventures than anyone in this town." "True, but this time, it was last seen near the old mill. You know, where the lights have been flickering at night." "Strange. You think there's more to it than just a wandering feline?" "Could be. Remember the rumors about the mill being haunted?" "Ghosts and cats, what a combination. But seriously, do you think we should check it out?" "Why not? A little mystery might spice up our quiet lives. Besides, I have a feeling there's more to this than meets the eye." "Alright, but only if you promise to bring some of those scones. Ghost hunting on an empty stomach is a terrible idea." "Deal. Let's meet at dusk. Who knows what we'll uncover?"
id
"gen-LIiwHJOvk6JOcr6YXYAX0jjMyT7u"
total_cost
0.00279
upstream_id
"chatcmpl-9tNBpX6dgOkCyLMwzHXHX0lUhcfK5"
created_at
"2024-08-06T22:43:33.140584+00:00"
Tuesday, August 6th, 2024 at 10:43:33 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3314
moderation_latency
197
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
270
native_tokens_prompt
68
native_tokens_completion
262
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00279
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)