NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM079AMFCQWEC22X31NRQGRF
Run Details
Model:
GPT-5.4 Nano
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
2.1s
Cost:
$0.000362
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
13.53%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell over the door chimed, soft as a secret, and the warm smell of cinnamon wrapped around Maribel the moment she stepped inside. Loaves were cooling in tidy rows, and the glass case shone with sugared tarts. “I swear the scones were here yesterday,”
she said
, leaning closer to the empty space where strawberry should have been. “They’re not missing,”
Nora replied
, wiping flour from her hands with a bright smile that didn’t reach her eyes. “They were… borrowed.” “By who? The town is full of grandmothers and accountants, not thieves.” Maribel slid her finger along the display edge. “And these crumbs don’t look like anyone’s breakfast. They look like someone took one and left the rest.” Nora poured coffee into two mugs, the stream steady. “You’re thinking too hard. People like variety.” “Then why is there a smear of jam on the receipt roll?” Maribel pointed toward the counter drawer, where the clerk’s paper lay slightly unspooled, sticky at the corner. Nora’s smile tightened. “Maybe the delivery driver sneezed.” Maribel took a careful sniff of the paper. “No. That’s raspberry. And we don’t stock raspberry this week.” Nora set the mugs down, coffee steaming like a warning. “Maybe we should start.”
created_at
"2026-03-18T10:18:59.266Z"
3/18/2026, 10:18:59 AM
model
"openai/gpt-5.4-nano-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
472
moderation_latency
(null)
generation_time
1347
tokens_prompt
81
tokens_completion
341
native_tokens_prompt
67
native_tokens_completion
279
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00036215
router
(null)
provider_responses
0
endpoint_id
"0c835f2e-c18d-4e8c-b245-e1e3bd08b97f"
id
"resp_0c6d5b4c4096fd450169ba7c1385f881968b846cfea6c8a92b"
is_byok
false
latency
217
model_permaslug
"openai/gpt-5.4-nano-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829139-rqOXDTyv4Ze8boeEpOFh"
upstream_id
"resp_0c6d5b4c4096fd450169ba7c1385f881968b846cfea6c8a92b"
total_cost
0.00036215
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
13.53%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"she said"
1
"Nora replied"