NC Bench
Overview
About
Tests
Models
Model
GPT-4o, May 13th (temp=1)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-07-14-01J2SKDYN2D7KD17N4V619DYB1
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Did you hear about Mrs. Thornton's missing cat?" "Not again! Wasn't that the fourth time this month?" "Fifth, actually. I think there's something fishy going on." "Well, if it isn't Buster, she’s lost her marbles. How's she taking it?" "Surprisingly well. I found her in here nibbling on a scone as if nothing happened." "Maybe those lavender scones have mystical powers." "Or she's just used to his Houdini antics by now. Have you tried the new cinnamon rolls?" "Not yet, but they smell divine. Anyway, do you think someone’s taking Buster?" "Could be. I saw a mysterious car parked near her house last night." "You should tell Constable Martin. He might actually do his job for once." "Maybe. But what if it's just a coincidence?" "In Lostwood Grove? Coincidences are as rare as a chocolate tea kettle." "Touché. Alright, I’ll drop by the station later. Speaking of mysteries, did you solve your donut dilemma?" "I did, actually! It was Margaret switching the labels in a sleep-deprived daze." "Hilarious. I bet she felt terrible." "She was mortified, bless her heart. Offered to bake an entire new batch to make up for it." "Well, isn't that just the cherry on top of today's drama?" "Indeed it is. Let's hope Buster turns up soon."
id
"gen-95rAxkVXWXYOCBZcCz9ZlVJZRQgn"
total_cost
0.00484
upstream_id
"chatcmpl-9l1LwGFzxnvPncoKU5ttfZ3eWq9TI"
created_at
"2024-07-14T21:47:29.005317+00:00"
Sunday, July 14th, 2024 at 9:47:29 PM GMT+00:00
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
4621
moderation_latency
192
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
302
native_tokens_prompt
68
native_tokens_completion
300
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00484
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)