NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHKA1XNBQ7FD5PTP7CS8D3AR
Run Details
Model:
GPT-5 Mini
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
16.9s
Cost:
$0.002471
Provider:
OpenAI
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
You left the back door open and the oven nearly ate the morning. I left it open on purpose; I had to follow a trail of flour three doors down. Flour? Who else in Marigold bakes at two in the morning? The mailman, Sam, delivered a postcard with a smear of pastry on it and a corner torn like someone was in a hurry. A postcard? From whom? No name, just "We don't forget recipes," and a pressed sprig of rosemary tucked inside. Who would send a passive-aggressive herb like a hostage? Mrs. Pennington? She thinks recipes are county secrets and keeps a sieve like it's a sheriff's badge. She was at the quilting circle until midnight. Someone dropped something more telling on my prep table. What, besides threatening aromatics? A single button with my initials sewn under it and a smear of ink that isn't mine. You mean like someone came here, rifled through your recipe box, and left a calling card? Exactly. And the raspberry danish I set out for the window sitter—gone, crumbs leading toward the alley. So either a very hungry person with a flair for stationery, or someone's sending you a message. Either way, you didn't call the sheriff? Not yet. I thought I'd ask the person who knows people by their loaf sizes first. And if your adviser suspects Mrs. Pennington? Then we bake a dozen scones and see who shows up pretending to forget their manners.
created_at
"2026-02-16T13:26:08.627877+00:00"
2/16/2026, 1:26:08 PM
model
"openai/gpt-5-mini-2025-08-07"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6371
moderation_latency
(null)
generation_time
16608
tokens_prompt
81
tokens_completion
871
native_tokens_prompt
67
native_tokens_completion
1227
native_tokens_completion_images
0
native_tokens_reasoning
896
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00247075
router
(null)
provider_responses
0
id
"resp_0240dc68939513960169931adef6908193b64a566ef011fef3"
status
200
is_byok
false
latency
122
endpoint_id
"c4f66d01-20b0-4c27-a225-438ea22fda43"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-5-mini-2025-08-07"
api_type
"completions"
id
"gen-1771248350-uYLGWAoS1gcOkwzFGQX3"
upstream_id
"resp_0240dc68939513960169931adef6908193b64a566ef011fef3"
total_cost
0.00247075
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
60.6531%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"No name, just and a pressed sprig of rosemary tucked inside."