NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-14-01J2SKDYN2GE21J11C2SFM8GGV
Run Details
Model:
GPT-4o, May 13th (temp=1)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
5.3s
Cost:
$0.004555
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"What do you think of the new pastry chef?" "She seems nice enough, but there's something off about her. Can't quite put my finger on it." "I noticed she locks the kitchen door when she’s baking. Why would she need to do that?" "That’s strange. Maybe she's just protective of her recipes." "Or maybe she’s hiding something. Remember how secretive she was about her last job?" "Yeah, she only said it was 'complicated.' Really doesn't add up, does it?" "Not at all. And did you see her reaction when the mayor came in this morning?" "I did. She froze like a deer caught in headlights. Suspicious, don't you think?" "Definitely. And what about the missing ingredients?" "I had the same thought. Flour and sugar don’t just disappear on their own." "You think she’s involved?" "Could be. She’s the only one with access to the kitchen besides Mary." "Speaking of Mary, she’s been awfully quiet since hiring her. Haven’t seen her around much." "Maybe it’s time we had a little chat with our new pastry chef, get to the bottom of all this." "Agreed. But let’s be careful. We don't know what we're walking into." "Good point. We'll approach it casually... over a cup of coffee and a slice of pie." "Sounds deliciously dangerous."
id
"gen-eIZktgXHNhIx44AOStOeL2Y0e333"
total_cost
0.004555
upstream_id
"chatcmpl-9l1LwW6xcJyIkQUVB8G7Z1WCi38p4"
created_at
"2024-07-14T21:47:28.606274+00:00"
7/14/2024, 9:47:28 PM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
4279
moderation_latency
213
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
285
native_tokens_prompt
68
native_tokens_completion
281
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.004555
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)