NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM09E5PRTH4CE263HREBZ850
Run Details
Model:
GPT-5.4 Nano (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
2.5s
Cost:
$0.000418
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell over the door chimed, and warm cinnamon air washed over them. Flour dusted the counter like fresh snow, and a tray of lemon tarts cooled under a tea towel. “You’re late,”
Lena said
, not looking up from the pastry bag. “The caramel will set if you keep hovering.” “Wouldn’t dream of hovering.” Grant leaned on the display case, eyes flicking to the door. “I came straight from the diner.” “Uh-huh. And yet you have that look.” “It’s not a look.” He pulled a folded note from his pocket, smoothing it against his palm. “It’s a… problem.” Lena finally met his gaze. “Drop the dramatic pause. What happened?” “Mrs. Pritchard’s been asking questions again.” Grant tapped the note. “She thinks someone tampered with the church raffle ledger. The one that was locked in the back office.” Lena’s fingers stilled. The bag hissed softly as she squeezed. “Tampered? In a town where everybody knows your sourdough starter?” “She says she saw a man in a gray cap near the office at closing.” Lena set the bag down, slow and careful. “Gray cap. Like the one that keeps coming in for day-old cannoli and never buys the coffee.” Grant exhaled. “So you’ve noticed.” “Of course I’ve noticed.” Lena reached for a fresh tin and slid it toward him. “Take these to the ladies’ group. Then tell me exactly what you heard at the diner.”
created_at
"2026-03-18T10:56:35.045Z"
3/18/2026, 10:56:35 AM
model
"openai/gpt-5.4-nano-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1050
moderation_latency
(null)
generation_time
2179
tokens_prompt
81
tokens_completion
381
native_tokens_prompt
67
native_tokens_completion
324
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0004184
router
(null)
provider_responses
0
endpoint_id
"0c835f2e-c18d-4e8c-b245-e1e3bd08b97f"
id
"resp_092fd5dc4768509b0169ba84e3192c8197951d9478f5c0ac8a"
is_byok
false
latency
169
model_permaslug
"openai/gpt-5.4-nano-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831395-MskIuHGLwo1nOgyvG0Dn"
upstream_id
"resp_092fd5dc4768509b0169ba84e3192c8197951d9478f5c0ac8a"
total_cost
0.0004184
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"Lena said"