NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM08EWBZX374RTCEP3QBF7BH
Run Details
Model:
GPT-5.4 Mini (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
3.1s
Cost:
$0.001571
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
13.53%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell over Poppy Lane Bakery gave a cheerful jingle as Nora pushed open the door, bringing in a ribbon of cold morning air and the smell of cinnamon. “Tell me I’m imagining things.” “Depends. Are you imagining the mayor’s wife hiding behind the scone tower?” “I’m not imagining it.” “A little to the left, then. She can’t see us from there.” Nora set a paper bag on the counter and lowered her voice. “Why is Alice Wren crouched between the oat bran and the lemon bars?” “Because she’s either playing a very elaborate prank or she found something she didn’t like in her husband’s office.” “Or she’s avoiding the quilting club.” “Same thing, in this town.” From behind the shelves came a muffled sniffle, followed by a crackle of paper. “Don’t say anything,”
Nora murmured
. “If she’s crying, I’ll never know whether to offer a tart or a tissue.” “I vote tart.” “You would.” The bakery door jingled again, and Mr. Duffy from the hardware store stepped inside, hat in hand and looking far too pleased with himself. “Well,”
he said
, glancing toward the scone tower, “I suppose the whole town knows by lunch now.” Nora and June exchanged a quick look. “Knows what?” Mr. Duffy smiled. “That the mayor’s wife just found the key to the safe. And the note tucked under it.”
created_at
"2026-03-18T10:39:29.677Z"
3/18/2026, 10:39:29 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2264
moderation_latency
(null)
generation_time
2707
tokens_prompt
81
tokens_completion
511
native_tokens_prompt
67
native_tokens_completion
338
native_tokens_completion_images
0
native_tokens_reasoning
30
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00157125
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_00d1e34342b2eba70169ba80e1bf3c8197ba47bd71296e4b00"
is_byok
false
latency
184
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830369-jv7SkGmcRcJeMLLedSGd"
upstream_id
"resp_00d1e34342b2eba70169ba80e1bf3c8197ba47bd71296e4b00"
total_cost
0.00157125
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
13.53%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"Nora murmured"
1
"he said"