NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0AK52239SJAXXTM6KZ5R4E
Run Details
Model:
GPT-5.4 Nano
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
2.5s
Cost:
$0.000576
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
36.79%
Dialogue to Total Word Ratio
(details)
18.3940%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Molly, tell me you packed the glitter-proof duct tape.” “I did, Bill. It’s called ‘party decor,’ and it sticks better to wrapping than your confidence.” “Hey, my confidence is solid. Like that aunt’s chandelier.” “Focus. We’re breaking into a children’s party. The guards are nine-year-olds with juice boxes.” “Easy. We’ll distract them with a diversion.” “A diversion? What kind of diversion?” “The classic: pretend to be clowns.” “You can barely juggle your own lies.” “Juggling is advanced lying. Okay, plan: I slip in through the side gate.” “You hate side gates.” “I hate them aesthetically. But I can do it. You climb the fence.” “I’m not climbing anything. I’m ‘coincidentally ascending.’” “Great. Ascend. Once inside, we steal all the presents before the cake gets cut.” “The cake is the real distraction. While everyone stares at frosting, we snatch presents like sugar thieves.” “Wow, poetic. Now tell me about the laser grid.” “There is no laser grid.” “There’s always a laser grid.” “Molly, it’s a birthday party.” “They love security theater. Like the clown with the squeaky shoes.” “Fine. If there’s a laser grid, I’ll compliment it until it confesses.” “You can’t flirt with a laser.” “I can. Lasers respond to intensity.” “Just don’t get caught by the balloons.” “Balloons don’t catch people.” “Balloons float.” “Exactly. If they catch anything, it’s your ego.” “Okay, clock starts at ‘Happy Birthday.’ Ready?” “As ready as we’ll ever be to steal joy.” “Joy is plural. So are the gifts.” “Then let’s kidnap the Christmas of this week.” “Agreed. Three… two…—wait, Bill, what if the presents are for someone with a dog?” “Then we steal the dog’s heart too.” “Murderous romantic. Go.”
created_at
"2026-03-18T11:16:46.801Z"
3/18/2026, 11:16:46 AM
model
"openai/gpt-5.4-nano-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
449
moderation_latency
(null)
generation_time
2363
tokens_prompt
77
tokens_completion
535
native_tokens_prompt
66
native_tokens_completion
450
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0005757
router
(null)
provider_responses
0
endpoint_id
"0c835f2e-c18d-4e8c-b245-e1e3bd08b97f"
id
"resp_07d47038240163010169ba899eda18819080cb28201da4d50f"
is_byok
false
latency
187
model_permaslug
"openai/gpt-5.4-nano-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773832606-UyqT2nw66H4PHmlMhk3L"
upstream_id
"resp_07d47038240163010169ba899eda18819080cb28201da4d50f"
total_cost
0.0005757
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
277
36.79%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 10.00%
neededClean
false
wordsTotal
280
wordsDialogue
280
18.3940%