NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-14-01J2SKDYN20CMRZ45DPYPRNDNF
Run Details
Model:
GPT-4o, May 13th (temp=1)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
5.2s
Cost:
$0.004885
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Looks like Mrs. Tindle's at it again, huh?" "Her lemon scones or her gossiping?" "Both, if you ask me. Heard she spotted Mr. Finch sneaking out of the back door at midnight." "Finch? The librarian? No way. Midnight’s way past his bedtime." "Maybe he was returning some overdue books!" "I doubt it. Can you keep an eye on the counter? I need more cinnamon rolls." "Sure thing. Here’s the tray of fresh muffins, by the way." "You’re a lifesaver. So, what else did Mrs. Tindle say?" "She insists she saw him carrying something wrapped in a blanket." "That’s peculiar. Wasn’t there a break-in at the historical society last week?" "That’s right. Old maps went missing—Mrs. Tindle would be the first to know if they found anything." "Interesting. Do you think Finch could be involved?" "He’s always struck me as an oddball, but a thief? Hard to say." "Well, I suppose we’ll just have to keep an eye on him. Here, taste this lemon curd—new recipe." "Oh, that’s divine. If Finch did steal those maps, maybe he’s on the run with them." "In Willow Creek? Where would he hide?" "A good mystery, isn’t it? Thanks for the lemon curd. We should get Mrs. Tindle her scone before she starts a rumor about us!"
id
"gen-GgnHEckMCLfF0k73Z2ZndQ6yu63G"
total_cost
0.004885
upstream_id
"chatcmpl-9l1LwBFTMl4r80LT3xvmwI9UGpm4N"
created_at
"2024-07-14T21:47:28.495387+00:00"
7/14/2024, 9:47:28 PM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
4134
moderation_latency
206
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
301
native_tokens_prompt
68
native_tokens_completion
303
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.004885
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)