NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM09060XKM7MS3T6VF6BCEVC
Run Details
Model:
GPT-5.4 Mini
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
2.1s
Cost:
$0.001418
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell over the bakery door gave its cheerful tinkle as Mara stepped inside, bringing in a gust of autumn air and the smell of woodsmoke. Warmth wrapped around her at once, along with butter, cinnamon, and fresh bread. Behind the glass case, rows of lemon tarts and sticky buns gleamed under the lights. “Tell me that cinnamon swirl loaf is still available.” “Barely. You’re looking at the last one.” “I knew I should have come earlier.” “You say that every Thursday.” “Because every Thursday you keep making it impossible to resist.” A low laugh drifted from the kitchen, where flour dusted the floor and a tray of cooling scones sat on the counter. Mara leaned closer to the case, then frowned. “Has that vase always been there?” “What vase?” “The blue one by the register. I swear I’ve never seen it before.” “No, it’s new. Mrs. Henshaw left it here this morning.” “Empty?” “Not when she brought it in. It had a note tucked inside.” Mara looked up sharply. “A note?” “Mm-hmm. She said if anyone came asking about the old mill, they were to read it first.” The bakery seemed suddenly quieter, the hum of the refrigerator and the crackle of the oven stretching between them. “And,”
Mara said
, lowering her voice, “did anyone come asking?” One corner of the other woman’s mouth lifted. “You just did.”
created_at
"2026-03-18T10:48:56.617Z"
3/18/2026, 10:48:56 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
385
moderation_latency
(null)
generation_time
1920
tokens_prompt
81
tokens_completion
385
native_tokens_prompt
67
native_tokens_completion
304
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00141825
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_0d93242a084e36620169ba8318adc08194b9560b59e52f5fbb"
is_byok
false
latency
212
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830936-c2kbKlAavMT2fHVHdtVJ"
upstream_id
"resp_0d93242a084e36620169ba8318adc08194b9560b59e52f5fbb"
total_cost
0.00141825
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"Mara said"