NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-07-14-01J2SKDYN1VZ0B7F9DB0EE9KR0
Run Details
Model:
GPT-4o, May 13th (temp=0)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
5.8s
Cost:
$0.004750
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Did you hear about Mrs. Thompson's missing cat?" "Missing? I thought she just let it roam around." "Not this time. She swears someone took it." "Who would steal a cat in this town? Everyone knows everyone." "That's what makes it so strange. She found a note on her porch." "A note? What did it say?" "Something like, 'If you want to see Whiskers again, meet me at the old mill at midnight.'" "Sounds like a prank. Kids, maybe?" "Maybe, but Mrs. Thompson is convinced it's serious. She came in here this morning, all flustered." "Poor woman. She's had a rough year already." "Tell me about it. First her husband, now this." "Do you think we should go to the mill tonight? Just to see?" "Are you suggesting we play detective?" "Why not? It's not like we have anything better to do." "True. But what if it's dangerous?" "Dangerous? In Pinewood? The most dangerous thing here is Mrs. Jenkins' fruitcake." "Alright, you’ve convinced me. Midnight at the mill it is." "Great. And if we solve this, maybe we can start our own detective agency." "Let's not get ahead of ourselves. First, let's find Whiskers." "Deal. Now, how about a cinnamon roll to fuel our investigation?" "Make it two."
id
"gen-mswKYahMy90cX0JkNh1FZ8XK7DUt"
total_cost
0.00475
upstream_id
"chatcmpl-9l1Lw5iPjGc4j6ZPc4LPozYe2kV1Y"
created_at
"2024-07-14T21:47:29.125408+00:00"
7/14/2024, 9:47:29 PM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
4713
moderation_latency
432
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
296
native_tokens_prompt
68
native_tokens_completion
294
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00475
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)