NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM098NYCGJBC0HN9375XSG65
Run Details
Model:
GPT-5.4 Mini (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
2.7s
Cost:
$0.002053
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.03%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell over the bakery door gave a cheerful jingle, and warm air wrapped around Clara like a quilt. Cinnamon, butter, and fresh bread drifted from the back room, making the whole place smell like breakfast and comfort. “You’re late.” “I brought coffee.” “That almost makes up for it.” Clara slid a paper cup across the counter. Mrs. Vale accepted it with a sniff, though the corners of her mouth softened. Behind her, trays of raspberry scones cooled beside a plate of lemon bars dusted so evenly with sugar they looked frosted. “I found something odd outside,”
Clara said
, lowering her voice. “Odd is what passes for excitement in this town.” “Someone left a flour print by the back door.” Mrs. Vale paused, one hand on the rolling pin. “A shoe print?” “More like a handprint. In flour.” That got her attention. She set the pin down and came around the counter. “Show me.” Clara reached into her tote and produced a small brass key, still dusted white at the edges. “It was beside it.” Mrs. Vale stared at the key, then toward the curtained doorway leading to the storeroom. “Well,”
she said
at last, “that’s not supposed to be there.” From the back, a muffled thump echoed through the bakery. They both turned. “Oh, dear,”
Clara whispered
. Mrs. Vale took another sip of coffee and reached for the rolling pin. “Now that,”
she said
, “is definitely not part of the recipe.”
created_at
"2026-03-18T10:53:35.063Z"
3/18/2026, 10:53:35 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1176
moderation_latency
(null)
generation_time
2514
tokens_prompt
81
tokens_completion
396
native_tokens_prompt
67
native_tokens_completion
445
native_tokens_completion_images
0
native_tokens_reasoning
122
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00205275
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_0a643af5bc1d53490169ba842f22448194a92ede9bc1d9fee0"
is_byok
false
latency
224
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831215-acj2J2XusPa9Etyw4mWU"
upstream_id
"resp_0a643af5bc1d53490169ba842f22448194a92ede9bc1d9fee0"
total_cost
0.00205275
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"Clara said"
1
"she said"
2
"Clara whispered"
3
"she said"