NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=0)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-08-06-01J4MXY3SJ91CSRTV1W5DYDN3Q
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The aroma of freshly baked bread mingled with the sweet scent of cinnamon as the bell above the door jingled. "Morning! You're here early today." "Couldn't resist the call of your blueberry scones. They’re practically famous around town now." "Flattery will get you a free coffee. How's the investigation going?" "Slow. You'd think in a town this size, secrets would be hard to keep." "People have a way of hiding things in plain sight. Like that time Mrs. Thompson swore she lost her cat, only to find it napping in her laundry basket." "True, but this isn't a missing cat. It's a missing necklace, and a valuable one at that." "Do you think it was stolen?" "Possibly. Or misplaced. But the owner insists it was taken." "Have you talked to everyone who was at the garden party?" "Yes, and everyone has an alibi. It's like the necklace vanished into thin air." "Maybe it’s hiding in plain sight, like Mrs. Thompson’s cat." "Perhaps. I might need to look at things from a different angle." "Well, if you need a fresh perspective, you know where to find me. I’m always here, with scones and secrets." "I might just take you up on that. Thanks for the coffee."
id
"gen-cR6mfIeIXZMy09BL4iqSODN9489u"
total_cost
0.00288
upstream_id
"chatcmpl-9tNEQURcIyK4MjACgQmQSvcSIolZI"
created_at
"2024-08-06T22:46:13.778119+00:00"
Tuesday, August 6th, 2024 at 10:46:13 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3353
moderation_latency
178
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
275
native_tokens_prompt
68
native_tokens_completion
271
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00288
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)