NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=1)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-08-06-01J4MXW9VN2RGY26YFHGJZ0JJ0
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed as the buttery aroma of croissants wrapped around her like a scarf. She brushed a few stubborn raindrops off her coat and glanced around the bakery, settling on the woman behind the counter. "You're earlier than usual today." "Had a hunch I might catch a new blend. Am I right?" "Sharp as ever. We've got a lavender honey scone fresh from the oven." "Tempting, but I'm more interested in a different flavor—yesterday's gossip." "You'd think in a town this small we'd run out of stories. But no, still brewing." "Ah, I see the regular crew is conspicuously absent this morning. Is it silence by intent or coincidence?" "More of a strategic retreat, I’d imagine. Ever since you solved the mystery of Eleanor's missing cat, folks have been treading lightly." "I suppose it's not every day they see a dental hygienist moonlighting as a detective." "As long as it doesn't scandalize Mrs. Pritchard, I think you're safe." "And if it does?" "Bring her one of these scones and she'll forgive your entire past." "Noted. Now, tell me, what’s brewing besides the coffee?" "Keep your voice down, but... rumor has it, someone planted begonias in the town square. Imagine, begonias!" "Who would've thought a flower could rustle so many feathers?"
id
"gen-7hXtmsKGzi8sxJTDHLURTcuPXDre"
total_cost
0.00308
upstream_id
"chatcmpl-9tNDTkZ7r94MzlGZg9whihgYEIbkr"
created_at
"2024-08-06T22:45:14.702145+00:00"
Tuesday, August 6th, 2024 at 10:45:14 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3604
moderation_latency
231
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
301
native_tokens_prompt
68
native_tokens_completion
291
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00308
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)