NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM091Q3E1S1VC1F8T65JP8X7
Run Details
Model:
GPT-5.4 Nano (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
2.5s
Cost:
$0.000383
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.03%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door gave a soft chime, and the warm, buttery air wrapped around them as if it had been waiting. Flour dusted the counter like fresh snow. A tray of cinnamon rolls cooled near the window, glistening with glaze. “I swear I didn’t steal it,”
Mara said
, sliding a mixing bowl into place with exaggerated care. “You wouldn’t have to steal it if someone hadn’t moved it,”
Nora replied
, tucking a tea towel over her shoulder. She glanced toward the back shelf where the old cookbook used to sit. “It was right there yesterday,”
Mara insisted
, lowering her voice. “Next thing I know, Mrs. Delaney’s missing heirloom recipe card is gone.” Nora leaned in, eyebrows lifting. “Maybe it fell behind the flour bin.” Mara looked horrified. “Flour bins don’t eat cards. They just make everything look guilty.” Nora’s smile sharpened. “Then we should stop making everything look guilty and start looking.” She pulled open the pantry door. A puff of cinnamon-scented air escaped, followed by silence. Nora reached past the sacks and tugged free a folded card, edges smudged with cocoa. “Found it,”
Nora said
, holding it up like evidence. Mara exhaled. “Okay. But who put cocoa on a cinnamon card?” Nora’s gaze flicked to Mara’s apron. “You tell me.”
created_at
"2026-03-18T10:49:46.877Z"
3/18/2026, 10:49:46 AM
model
"openai/gpt-5.4-nano-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1112
moderation_latency
(null)
generation_time
2190
tokens_prompt
81
tokens_completion
352
native_tokens_prompt
67
native_tokens_completion
296
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0003834
router
(null)
provider_responses
0
endpoint_id
"0c835f2e-c18d-4e8c-b245-e1e3bd08b97f"
id
"resp_05255cc9bc4aaddd0169ba834af4108193aed96ead7714c68c"
is_byok
false
latency
215
model_permaslug
"openai/gpt-5.4-nano-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830986-Nhf93k1SXoxDAl6OsG4v"
upstream_id
"resp_05255cc9bc4aaddd0169ba834af4108193aed96ead7714c68c"
total_cost
0.0003834
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"Mara said"
1
"Nora replied"
2
"Mara insisted"
3
"Nora said"