NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2025-04-14-01JRTSWZ99XHB5XC4YE3SVC2Q7
Run Details
Model:
GPT-4.1 Mini
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
6.8s
Cost:
$0.000435
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the bakery door jingled softly as the woman stepped inside, shaking off a light drizzle. The scent of freshly baked bread and cinnamon pastries filled the cozy room. “Looks like the rain’s finally catching up with us,”
she said
, glancing out the fogged-up window. “Tell me about it,” came the reply from behind the counter. “I was just about to close early, but figured a hot cup of coffee might keep some folks around.” She smiled, stepping closer to the display case. “Any chance you’re hiding something new today? I need a little treat to brighten this gloomy afternoon.” “Funny you should ask. Just pulled out a batch of lavender honey scones. Not many people try lavender here, but I swear it’s easier on a gray day.” The woman’s eyes lit up. “I’ll take two, then. And a black coffee, please. My detective duties aren’t going to solve themselves.” The laugh that followed was warm, almost conspiratorial. “If you solve this town’s mysteries as well as you pick pastries, I’m certain the answer’s already in your pocket.” Outside, the rain softened to a gentle patter, and the soft hum of the oven filled the quiet space.
id
"gen-1744656301-moC8mdxdm7gbroCZi9gS"
upstream_id
"chatcmpl-BMJ5igZfew2aXa4WSUINrwVFag6XR"
total_cost
0.0004352
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:45:08.944318+00:00"
4/14/2025, 6:45:08 PM
model
"openai/gpt-4.1-mini-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
894
moderation_latency
108
generation_time
5239
tokens_prompt
68
tokens_completion
254
native_tokens_prompt
68
native_tokens_completion
255
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0004352
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"she said"