NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0994WR11QX6282YP9HZH1Z
Run Details
Model:
GPT-5.4 Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
6.1s
Cost:
$0.004878
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
3.05%
Dialogue to Total Word Ratio
(details)
1.5253%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the party flyer across the hood of the getaway van and squinted at the glittery letters. “This is a crime scene in crayon,” she said. Bill adjusted his black gloves and leaned in. “It’s a children’s party. Everything is a crime scene in crayon.” The flyer showed balloons, streamers, a clown with a suspiciously cheerful grin, and, in a corner, the words PRESENT TABLE. Bill tapped that with a fingernail. “There. That’s the vault.” “If the vault contains glitter glue, I’m leaving,” Molly said. “Focus,” Bill whispered, though they were parked in an alley behind a bakery and no one was within twenty yards except a suspicious pigeon. “We go in at three, blend with the crowd, and walk out with the gifts.” Molly looked him up and down. He wore a fake mustache, a striped shirt, and shorts with little sharks on them. “Blend?” “I’m going as a party dad.” “You look like a party uncle who was asked to leave.” Bill shrugged. “Children trust flaws. It’s the perfection that scares them.” Molly checked the list she’d scribbled on a napkin. “We have disguises, bags, and a map of the house.” “That’s not a map,” Bill said. “It’s a napkin with arrows.” “It points to ‘cake’ six times.” “That’s because cake is everywhere.” From inside the house came a burst of shrieking laughter and a trumpet blast. Molly peered through the hedge. “How many kids are there?” Bill counted the helium balloons tied to the mailbox. “Enough to form a small government.” A parent in the yard was bending over a table, wrestling a piñata shaped like a unicorn. Another was setting out paper cups. “We need a distraction,” Molly said. “Something loud, harmless, and ideally something that makes adults stop paying attention.” Bill grinned. “I’ve got just the thing.” He reached into the van and produced a remote control covered in tape and a label reading DO NOT PRESS IF YOU HAVE A SOUL. Molly stared at it. “You’ve been carrying that around?” “For months,” Bill said. “It’s for emergencies.” “This is a children’s party.” “Exactly.” He pressed the button. Across the street, a delivery drone began blaring a tinny version of Happy Birthday while circling a lamppost and dropping leaflets for a plumbing company. Every head in the yard turned. Molly blinked. “That’s your emergency?” “It’s multi-purpose.” The adults rushed to admire the drone, which promptly snagged itself in a tree and began reciting coupon codes. Bill opened the van door. “Now!” They slipped through the side gate and into the garden, where a table groaned under the weight of wrapped presents in every color imaginable. One was shaped like a dinosaur. One was enormous and covered in stars. One had so much tape it looked medically sealed. Molly whispered, “We take the easy ones first.” Bill pointed at the dinosaur. “That one looks heavy.” “So?” “So I like a challenge.” They began loading bags with gifts, moving briskly, like professionals with a moral issue. From the kitchen window, a child in a paper crown shouted, “Who are those people?” Bill crouched beside a stack of boxes and called out in a nasal voice, “We’re the present inspectors!” Molly hissed, “There is no such thing.” “There is now.” A small boy frowned. “Why are you stealing birthday presents?” Molly froze. “We are not stealing. We are—” Bill smoothly cut in. “Relocating them. For safety.” The boy considered this with the grave suspicion only children can manage. Then he pointed to a box shaped like a robot. “That one’s for me. It has batteries.” “We’ll be very careful with it,” Molly said. At that moment, the clown from the flyer emerged from behind the cake table carrying a plate of frosting. He looked straight at their bulging bags. Bill whispered, “Plan B.” “There is no Plan B.” “There is now.” The clown smiled, wiped his mouth, and said, “You two are terrible at this.” Molly clutched a gift bag to her chest. “You’re not supposed to be talking.” “I’m the birthday uncle,” the clown said. “I know when people are stealing presents. It’s the same face adults make at family weddings.” Bill frowned. “Then why haven’t you shouted?” The clown looked toward the children, who were now distracted by the drone, the wrecked piñata, and a dog wearing streamers. “Because,” he said, “I hid the really good presents in the laundry room.” Molly and Bill stared at each other. Bill lowered his bag. “You’re kidding.” The clown shrugged. “I’m not a clown. I’m the father.” Molly exhaled slowly. “This party has become much more criminal than we planned.” Bill already had the laundry room key in his hand. “Agreed. Let’s steal them before somebody decent does.”
created_at
"2026-03-18T10:53:50.374Z"
3/18/2026, 10:53:50 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
508
moderation_latency
(null)
generation_time
5889
tokens_prompt
77
tokens_completion
1350
native_tokens_prompt
66
native_tokens_completion
1073
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.004878
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_0279fef69ee7a6c70169ba843e74248194ba005e3229285a53"
is_byok
false
latency
265
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831230-h3K6uGgomOkKIVRfQjON"
upstream_id
"resp_0279fef69ee7a6c70169ba843e74248194ba005e3229285a53"
total_cost
0.004878
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
790
3.05%
Dialogue to Total Word Ratio
Ratio: 38.35%, Deviation: 8.35%
neededClean
false
wordsTotal
790
wordsDialogue
303
1.5253%