NC Bench
Overview
About
Tests
Models
Model
GPT-4o, Aug. 6th (temp=1)
Test
Dialogue tags
Scenario
Write unattributed dialogue
Run ID
2024-08-06-01J4MXSZ4K6TYYZZTZW68DDQ7Q
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The aroma of freshly baked pastries and brewed coffee filled the small bakery. Sunlight streamed through the window, casting warm pools of light on the checkered floor. "Did you hear about Mrs. Hargrove?" "No, what happened?" "Apparently, she found something curious in her pie when she sliced it yesterday." "Not another mouse tail? That's the third time this month." "Not quite. She found a tiny key." "A key? Well, now that's something." "Exactly. Folks are saying it must open the old safe in the library basement." "That safe has been locked for decades, hasn't it?" "Indeed. Rumor says there's something valuable inside." "Reckon it could be the long-lost Baker's Pearl?" "Oh, aren't you keen. But Mrs. Hargrove is keeping that key under tight wraps, no one's laying a hand on it." "Can't say I blame her, with treasure hunters sniffing around these parts recently." "True. Still, a mystery like this... It's got everyone talking. Old towns like ours don't see much excitement." "Except when it's baked into a pie, apparently." "Or when it's kneaded into dough. What are you going to have?" "I think the raspberry tart. You?" "Same. And maybe, just maybe, we’ll find a clue of our own."
id
"gen-ReHJsOh35BtIzph5cmQq11NfzbRm"
total_cost
0.0029
upstream_id
"chatcmpl-9tNCEkaQnTVUOnICl4ahS3lvVPtjN"
created_at
"2024-08-06T22:43:58.106777+00:00"
Tuesday, August 6th, 2024 at 10:43:58 PM GMT+00:00
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
3418
moderation_latency
194
generation_time
0
finish_reason
"stop"
tokens_prompt
69
tokens_completion
283
native_tokens_prompt
68
native_tokens_completion
273
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0029
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)