NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07NJK1YJGM01W78C9JMWQB
Run Details
Model:
GPT-5.4 Mini (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5.5s
Cost:
$0.004442
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0004%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Tell me again why we’re casing a children’s party,” Molly whispered, peering through the rain-streaked window of the community hall. “Because,” Bill said, squinting at the row of wrapped boxes under the balloon arch, “that is where the presents are.” Molly folded her arms. “That’s the worst sentence you’ve ever said, and you once called a vault ‘a metal suggestion box.’” Bill adjusted his hat, which was both too small and suspiciously festive. “Think of the opportunity. No alarms, no guards, just cupcakes, tissue paper, and people who think glitter counts as security.” “Children are not guards.” “No, but they are unpredictable. That’s practically the same thing.” Molly watched a small clown wobble past the window, nearly colliding with a stack of paper plates. “I still think this is beneath us.” “Beneath us?” Bill hissed. “Molly, we are thieves. Our brand is beneath things.” “Our brand,” she repeated, “is usually bank safes and museum wings and the occasional smug antique dealer. Not birthdays.” Bill leaned closer, lowering his voice with theatrical gravity. “Listen carefully. Every present in there is a mystery box. Some could be socks. Some could be robots. Some could be money in a card, which is just a festive loophole.” Molly made a face. “And what exactly is the plan, Professor Presents?” Bill pulled a folded diagram from his coat. “You go in as a balloon delivery woman. I enter as the entertainment.” “You?” Molly said. “I contain multitudes.” “You contain a ukulele and a low-quality pirate costume.” “Those are tools.” Molly sighed. “What’s the objective again?” “Simple. Sweep the gift table, replace the boxes with these empty lookalikes, and leave before the children notice anything is wrong.” Molly stared at the stack of false presents in the trunk of their car. “Bill, these are wrapped in newspaper.” “Resourceful.” “They say ‘Property Tax Assessment’ on the side.” “Educational.” She pinched the bridge of her nose. “And what happens when someone opens one?” Bill brightened. “That’s the beauty of it. We’re not taking the toys. Just the wrapping. By morning, there’ll be chaos.” Molly blinked. “You want to steal the presents and leave the contents?” “Think bigger. The contents are bulky. The wrapping is the real value. Do you know how much people pay for limited-edition dinosaur paper?” “You are impossible.” “I’m visionary.” A burst of music thumped from inside, followed by a chorus of tiny voices shrieking the chorus to a song that sounded like it had been invented by a caffeinated squirrel. Molly winced. “Fine,” she said. “We do it your way. But if this turns into a moral lecture from a six-year-old, I’m blaming you.” Bill smiled. “Perfect. You distract the party with your balloon-woman charisma. I’ll mingle, charm the adults, and make like a man with excellent timing.” “You? Charming adults?” “I can be charming.” “You once introduced yourself to a trustee as a ‘prince of consequences.’” “He remembered me, didn’t he?” Molly looked back at the hall, then at the rain, then at Bill. “One rule.” “Only one?” “No stealing anything with a name tag on it.” Bill nodded solemnly. “I am a professional.” “You are a catastrophe in a waistcoat.” “A stylish catastrophe.” She opened the car door. “If we get caught, I’m telling everyone this was your idea.” Bill grabbed the box of fake gifts and grinned. “If we succeed, Molly, we are geniuses.” They stepped toward the glittering entrance together, two shadows approaching a room full of confetti, cake, and unattended treasure, while inside the hall a child’s voice rang out, “Who’s ready for presents?” and Bill whispered, delighted, “That,” he said, “is the sound of opportunity.”
created_at
"2026-03-18T10:25:40.465Z"
3/18/2026, 10:25:40 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3576
moderation_latency
(null)
generation_time
4558
tokens_prompt
77
tokens_completion
1212
native_tokens_prompt
66
native_tokens_completion
976
native_tokens_completion_images
0
native_tokens_reasoning
129
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0044415
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_050afeae73c061dc0169ba7da484cc8196af89f52b8197bb45"
is_byok
false
latency
176
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829540-RxMlzCZ2BkKND41pLoZN"
upstream_id
"resp_050afeae73c061dc0169ba7da484cc8196af89f52b8197bb45"
total_cost
0.0044415
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
607
0.00%
Dialogue to Total Word Ratio
Ratio: 65.25%, Deviation: 15.25%
neededClean
false
wordsTotal
613
wordsDialogue
400
0.0004%