NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM09Q7TFG5T47AJQA0PN8HJ5
Run Details
Model:
GPT-5.4 Mini (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5.6s
Cost:
$0.004559
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a hedge shaped like a regrettable sheep and peered through the lace of branches at the house across the street. Balloons bobbed above the gate in bright, irresponsible colors. Somewhere inside, a party was already in full swing, a chorus of shrieking children colliding with music that sounded as if it had been chosen by someone who feared silence. She adjusted the black gloves she was wearing purely for effect. Bill unfolded a paper map on the garden bench, though the house was so close he could have pointed at it with a sigh. “So,” he said, “the objective is presents. Specifically, all the presents.” “Correct,” Molly whispered. “That is what ‘all’ means.” Bill squinted at the house. “And the children?” Molly gave him the kind of look usually reserved for men who’d tried to sharpen a butter knife. “We are not stealing the children, Bill. We’re stealing what the children have not yet ruined.” “That’s a relief,” he said. “I was picturing handcuffs, paperwork, and questions about my motives.” “They would ask fewer questions if you’d stop looking like a man who negotiates with ducks.” Bill touched his coat, wounded. “It’s a respectable coat.” “It has crumbs in the lapels.” “I’m a working criminal.” From the house came a roar of delight, followed by the unmistakable crinkling sound of tearing paper. Molly smiled. “Hear that? The first layer has come off. The gift table will be lightly guarded now. Parents are distracted, children are sticky, and the host is probably pretending not to cry in the kitchen.” Bill peered through the hedge. “What’s the angle?” “The angle,” said Molly, “is this. We enter through the side gate during the magician’s balloon segment. He’ll have all eyes on him, because he just made a rabbit appear in a hat and children will forgive almost anything that resembles wonder. We move to the conservatory, where the presents are stacked. We use the cake trolley as cover.” Bill frowned. “There is a cake trolley?” “There is always a cake trolley.” “I love your confidence,” he said. “It’s like a lantern. A dangerous lantern.” Molly leaned closer. “Listen carefully. The presents are not to be opened on site. That is amateur behavior and psychologically messy. We load them into the laundry baskets, transfer them to the van, and leave before anyone realizes the pile under the paper snow has developed legs.” Bill nodded slowly. “And if someone sees us?” “Then you are my uncle.” “I don’t look like an uncle.” “You look like a man who would explain a train schedule to a pigeon. You can be a very specific uncle.” He considered this with professional dignity. “What’s my name?” “Uncle Bernard.” “Too stern.” “Uncle Neville.” “Worse.” “Uncle Bill.” “That sounds suspiciously like me.” Molly shrugged. “Then try not to answer to it.” A burst of music spilled from the open window, followed by a dozen children chanting for the birthday cake. The sound made Bill flinch. “They’re loud.” “They’re young,” Molly said. “It’s the same thing with less accounting.” Bill rolled up the map. “All right. We go in, nick the gifts, vanish into the evening, and no one gets hurt.” Molly checked the watch on her wrist, though she had bought it from a stall that also sold fake pearls and lucky teeth. “Exactly. Elegant, humane, and festive.” “And if there are embarrassing presents?” “We don’t judge,” she said. “We steal with dignity.” Bill straightened his collar. “That’s the spirit.” “Remember,” Molly said, slipping from the hedge, “we are invisible. We are patient. We are, above all, just another pair of relatives with poor boundaries.” Bill followed her toward the gate as the magician shouted, “And now, children, the rabbit is going back in the hat!” Molly smiled in the dimming light. “Perfect timing,” she murmured. “Let’s rob Christmas’s younger cousin.”
created_at
"2026-03-18T11:01:32.125Z"
3/18/2026, 11:01:32 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1370
moderation_latency
(null)
generation_time
5196
tokens_prompt
77
tokens_completion
1108
native_tokens_prompt
66
native_tokens_completion
1002
native_tokens_completion_images
0
native_tokens_reasoning
139
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0045585
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_0ed4a8bc220c8bc50169ba860c30ec8193aebb5ceb502da802"
is_byok
false
latency
242
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831692-juXAGegO8tRHLascP5QE"
upstream_id
"resp_0ed4a8bc220c8bc50169ba860c30ec8193aebb5ceb502da802"
total_cost
0.0045585
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
643
0.00%
Dialogue to Total Word Ratio
Ratio: 59.72%, Deviation: 29.72%
neededClean
false
wordsTotal
643
wordsDialogue
384
0.0000%