NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM089AA72QTB5FTHFP47Y7WS
Run Details
Model:
GPT-5.4 Mini (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
3.7s
Cost:
$0.001675
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
13.53%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell over Honeycomb Bakery gave a cheerful jingle as Nora stepped in from the cold, bringing a swirl of sleet and the scent of wet wool with her. The place was warm enough to make her glasses fog. Behind the glass case, rows of cinnamon twists, jam tarts, and glossy lemon bars gleamed under the morning lights. “You’re late.” “I know. The bridge was iced over, and then Mrs. Barlow decided to tell me, in great detail, about her grandson’s trout.” “Unforgivable.” Nora laughed and held out a paper bag. “I brought the blueberry scones you asked for.” “Bless you.” A crash sounded from the kitchen, followed by a sharp curse. “Is that Marge?” “Who else would it be?” The woman behind the counter wiped her hands on her apron and frowned toward the swinging door. “She’s been in a state all morning. Says somebody’s been moving things in the bakery after closing.” “Moving things?” “Little things. A rolling pin. The receipt book. Yesterday she found a sugar sack tied with ribbon.” Nora’s smile faded. “Ribbon?” “Pink,
she said
. Not our ribbon.” The kitchen door banged open, and Marge burst out clutching a flour-dusted envelope. “I found this behind the flour bin,”
she said
. “And before anyone asks, no, I did not put it there.” Nora took the envelope, noticing the seal at once. It was embossed with the town crest. “Well,”
she said
quietly, “that’s not a recipe card.”
created_at
"2026-03-18T10:36:27.349Z"
3/18/2026, 10:36:27 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2892
moderation_latency
(null)
generation_time
3459
tokens_prompt
81
tokens_completion
526
native_tokens_prompt
67
native_tokens_completion
361
native_tokens_completion_images
0
native_tokens_reasoning
29
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00167475
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_089dcb5f185dc0140169ba802b6ed4819683884f7cc2926b9f"
is_byok
false
latency
250
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830187-OpueFtbQAwe6IPHjZFej"
upstream_id
"resp_089dcb5f185dc0140169ba802b6ed4819683884f7cc2926b9f"
total_cost
0.00167475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
13.53%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"she said"
1
"she said quietly"