NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-14-01J2SKDYN2G9MDXPFXCRR6JF3R
Run Details
Model:
GPT-4o, May 13th (temp=0)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
6.0s
Cost:
$0.005200
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Did you hear about Mrs. Thompson's cat?" "Her cat? No, what happened?" "Gone missing. Just vanished. She swears it was stolen." "Stolen? In this town? Who would steal a cat?" "That's what I said. But you know Mrs. Thompson, always jumping to conclusions." "True. Remember when she thought the mailman was a spy?" "How could I forget? Poor Mr. Jenkins. He still avoids her house." "Well, I hope she finds her cat. It's been a rough year for her." "Yeah, first the garden gnomes, now this. It's like she's cursed." "Speaking of curses, did you see the new sign outside the antique shop?" "No, what does it say?" "Closed for investigation. Apparently, some of the items were stolen." "Stolen? In this town? That's the second time today I've heard that word." "Strange, isn't it? First the cat, now the antiques. Makes you wonder." "Wonder what?" "If there's a connection. Maybe someone new in town?" "Or someone old with new intentions." "Like who?" "Well, I did see Mr. Grayson lurking around the bakery last night." "Mr. Grayson? But he's harmless. Just a bit eccentric." "Maybe. Or maybe he's hiding something." "Guess we'll have to keep our eyes open. This town's getting interesting." "Too interesting for my taste. I just want my morning muffin in peace." "Good luck with that. Seems like peace is the one thing we can't find."
id
"gen-0ooRQgCYA1kyEU1mV03dcZIR4XnD"
total_cost
0.0052
upstream_id
"chatcmpl-9l1Lw2IDw6waKA66Xo0YX002tkBPM"
created_at
"2024-07-14T21:47:29.270065+00:00"
7/14/2024, 9:47:29 PM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
4938
moderation_latency
220
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
332
native_tokens_prompt
68
native_tokens_completion
324
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0052
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)