NC Bench
Overview
About
Tests
Models
Model
GPT-4o, May 13th (temp=0)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-07-14-01J2SKDYN19VS15YG563J7MP65
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Did you hear about Mrs. Thompson's cat?" "Her cat? No, what happened?" "Gone missing. She swears it was stolen." "Stolen? In this town? Who would do such a thing?" "That's the mystery, isn't it? She found a note on her porch." "A note? What did it say?" "Something cryptic. 'Finders keepers, losers weepers.'" "That's chilling. Poor Mrs. Thompson. She must be beside herself." "She is. And you know what else? The note was written on the back of a bakery receipt." "From here? Our bakery?" "Yes, from here. Dated last Tuesday." "Last Tuesday... that was the day of the pie contest. The place was packed." "Exactly. Anyone could have been here. But who would leave such a clue?" "Maybe someone who wants to be caught. Or someone who thinks they won't be." "Either way, it's unsettling. Mrs. Thompson's been asking around, trying to see if anyone remembers anything unusual." "Well, I did see Mr. Jenkins acting a bit odd that day. He was lingering near the counter, looking nervous." "Mr. Jenkins? The librarian? He doesn't seem the type." "People can surprise you. Maybe we should tell Mrs. Thompson." "Agreed. Let's finish our coffee and head over. She needs all the help she can get."
id
"gen-zRq1mCqFS9ljsnG2iBnxzXT8e6YD"
total_cost
0.00472
upstream_id
"chatcmpl-9l1Lwfw3Bpv5sBXrugbtmY3cFGq0I"
created_at
"2024-07-14T21:47:29.842697+00:00"
Sunday, July 14th, 2024 at 9:47:29 PM GMT+00:00
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
5545
moderation_latency
191
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
293
native_tokens_prompt
68
native_tokens_completion
292
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00472
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)