NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=1)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-08-06-01J4MXVXJFDC5ETRBY445NXBTN
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The gentle chime of the bell rang out as the bakery door swung open, letting in a brisk autumn breeze. The sweet aroma of cinnamon and vanilla enveloped the room. “This place smells heavenly, doesn't it?” “It's my daily sanctuary. Have you tried their pecan streusel muffins? Practically a spiritual experience.” “Considering how often you come here, I'm surprised there's not a pastry named after you yet.” “Speaking of which, do you see anything odd about that gentleman?” “The one by the display case? He’s just admiring the éclairs, I suppose.” “He’s been standing there for ages. No one takes that long to choose a pastry.” “Perhaps he’s overwhelmed by choice.” “Or perhaps he’s pretending to be. Look at his shoes—exquisite and polished. Not the usual fare for these parts.” “Interesting observation. You think he's not from around here?” “Certainly not. Watch as he checks the time again. Looks to me like he's waiting for someone.” “Should we invite him over? Maybe he’s lost.” “I'd rather we wait and see. This could turn into something—” “An unexpected adventure?” “Something like that. Now, don’t look too eager; we don’t want him to suspect we’re onto him.” “We’re in a bakery full of people. How suspicious can two people sharing the last lemon scone really be?”
id
"gen-fVZEAYIWfK4r93ErSfBqaoIBJWqz"
total_cost
0.003
upstream_id
"chatcmpl-9tNDG2YLuIRKJTkaTO1UFsoa5a2LH"
created_at
"2024-08-06T22:45:01.686341+00:00"
Tuesday, August 6th, 2024 at 10:45:01 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3147
moderation_latency
198
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
291
native_tokens_prompt
68
native_tokens_completion
283
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.003
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)