NC Bench
Overview
About
Tests
Models
Model
GPT-4o, May 13th (temp=0)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-07-14-01J2SKDYN1MWVG2V92TZF0NFGB
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Did you hear about Mrs. Thompson's missing cat?" "Missing? I thought she just found it in the attic last week." "That was last week. This time, it's really gone. She found a note." "A note? What kind of note?" "One of those ransom notes, made from cut-out magazine letters. It was taped to her front door." "You're kidding. In our town? What did it say?" "Something like, 'If you want to see Whiskers again, bring a dozen of your famous apple pies to the old mill by midnight.'" "Apple pies? That's oddly specific. Do you think it's a prank?" "Could be, but Mrs. Thompson is taking it seriously. She's been baking all morning." "Poor woman. She must be worried sick. Who would do such a thing?" "That's what I intend to find out. I have a hunch it might be someone local. Someone who knows about her pies." "Well, if you need any help, you know where to find me. I can’t stand the thought of Whiskers in danger." "Thanks. I might take you up on that. For now, let's keep our ears open. Someone's bound to slip up." "Agreed. And in the meantime, how about a couple of those blueberry scones? I could use a distraction." "Make it two. We might need the energy."
id
"gen-qvLoyfjcAqLfvKs9Ye7eyYw6CE87"
total_cost
0.004675
upstream_id
"chatcmpl-9l1LwhKhrv3V3BMJL9jxxR3qItc0P"
created_at
"2024-07-14T21:47:27.66789+00:00"
Sunday, July 14th, 2024 at 9:47:27 PM GMT+00:00
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3501
moderation_latency
205
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
292
native_tokens_prompt
68
native_tokens_completion
289
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.004675
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)