NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-14-01J2SKDYN1CS76XS88VQNDN18H
Run Details
Model:
GPT-4o, May 13th (temp=0)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
4.1s
Cost:
$0.004750
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Did you hear about Mrs. Thompson's missing cat?" "Missing? I thought she just found it yesterday." "That's what she said, but now it's gone again. She swears someone took it." "Who would steal a cat in this town? Everyone knows everyone." "Exactly. That's what makes it so strange. And you know what else? I saw Mr. Jenkins lurking around her house last night." "Mr. Jenkins? The retired librarian? He wouldn't hurt a fly." "Maybe not, but he has been acting odd lately. Always mumbling to himself and carrying that old briefcase everywhere." "That briefcase is full of crossword puzzles, nothing sinister." "Still, it's peculiar. And then there's the matter of the broken window at the general store." "Broken window? When did that happen?" "Last night, around the same time Mrs. Thompson's cat went missing. Coincidence? I think not." "You're starting to sound like one of those detective novels you love so much." "Maybe, but something's not right. And I intend to find out what." "Well, if anyone can solve this mystery, it's you. Just be careful. This town has more secrets than it lets on." "Don't worry. I've got my eyes and ears open. Now, how about a cinnamon roll to fuel my investigation?" "Make it two. I have a feeling we're going to need all the energy we can get."
id
"gen-5TvzarGTfZTM5n74gwByOrRIsCx9"
total_cost
0.00475
upstream_id
"chatcmpl-9l1Lw6mJiTP3w1qJATSfDo5lWWQau"
created_at
"2024-07-14T21:47:27.40998+00:00"
7/14/2024, 9:47:27 PM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3241
moderation_latency
213
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
300
native_tokens_prompt
68
native_tokens_completion
294
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00475
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)