NC Bench
Overview
About
Tests
Models
Model
GPT-4o, May 13th (temp=1)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-07-14-01J2SKDYN2GE21J11C2SFM8GGV
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"What do you think of the new pastry chef?" "She seems nice enough, but there's something off about her. Can't quite put my finger on it." "I noticed she locks the kitchen door when she’s baking. Why would she need to do that?" "That’s strange. Maybe she's just protective of her recipes." "Or maybe she’s hiding something. Remember how secretive she was about her last job?" "Yeah, she only said it was 'complicated.' Really doesn't add up, does it?" "Not at all. And did you see her reaction when the mayor came in this morning?" "I did. She froze like a deer caught in headlights. Suspicious, don't you think?" "Definitely. And what about the missing ingredients?" "I had the same thought. Flour and sugar don’t just disappear on their own." "You think she’s involved?" "Could be. She’s the only one with access to the kitchen besides Mary." "Speaking of Mary, she’s been awfully quiet since hiring her. Haven’t seen her around much." "Maybe it’s time we had a little chat with our new pastry chef, get to the bottom of all this." "Agreed. But let’s be careful. We don't know what we're walking into." "Good point. We'll approach it casually... over a cup of coffee and a slice of pie." "Sounds deliciously dangerous."
id
"gen-eIZktgXHNhIx44AOStOeL2Y0e333"
total_cost
0.004555
upstream_id
"chatcmpl-9l1LwW6xcJyIkQUVB8G7Z1WCi38p4"
created_at
"2024-07-14T21:47:28.606274+00:00"
Sunday, July 14th, 2024 at 9:47:28 PM GMT+00:00
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
4279
moderation_latency
213
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
285
native_tokens_prompt
68
native_tokens_completion
281
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.004555
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)