NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0AD96MDB0RHW5C67YCWRNK
Run Details
Model:
GPT-5.4 Mini (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
6.0s
Cost:
$0.004968
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
42.92%
Dialogue to Total Word Ratio
(details)
21.4589%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the parked van and peered through the rain-streaked windshield at the community hall. “Tell me again why we’re stealing from a children’s party.” Bill adjusted the fake mustache above his lip. “Because the insurance job fell through, the antique shop had a dog, and this one is unattended.” “Unattended by adults,” Molly said. “Children are not the same as unattended.” “They’re smaller,” Bill said. “That should help.” “That is the worst logic I have ever heard.” “Yet here we are.” Inside the hall, balloons bobbed against the ceiling, and the muffled shrieks of delighted children leaked through the walls in waves. A cardboard sign over the entrance read HAPPY BIRTHDAY, LEO! in glittery blue letters, and a pirate cutout leaned cheerfully against the door. Molly watched a grandmother in a sequined cardigan hand a tray of cupcakes to a woman in a ponytail. “You do realize,” she said, “that this is the kind of party where people notice if even one jelly bean goes missing.” Bill pulled on black gloves. “Then we don’t take jelly beans. We take the presents.” “Bill.” “What?” “Those are presents for children.” “That’s what makes them present presents.” Molly pinched the bridge of her nose. “We are not doing wordplay in front of a birthday party.” “We’re not in front of it. We’re in a van with terrible suspension.” She glanced back at the cargo area, where their equipment sat in neat, absurd rows: lock picks, a miniature crowbar, a grappling hook, three flashlights, and a cake knife Bill had insisted was “for morale.” “What exactly is your plan?” Bill smiled, and the expression made his mustache tilt slightly to the left. “Simple. We blend in.” “With the children?” “With the parents.” Molly stared at him. “You are wearing a leather jacket.” “Vintage.” “You have a mustache drawn on with mascara.” “Discreet mascara.” “And you’re carrying a sack labeled ‘TOTALLY NOT STOLEN.’” Bill looked down at the sack. “In fairness, that was your label.” Molly sighed. “We are not blending in. We are two criminals with the energy of a failed magician duo.” “That is unfair,” Bill said. “I have excellent stage presence.” “You once bowled while a man was actively yelling ‘stop that thief.’” “I did it gracefully.” A burst of laughter spilled from inside the hall, followed by the sound of a trumpet made by a child who had no business owning a trumpet. Molly watched an usher with a lanyard open the side door to carry in a stack of paper plates. “There,” Bill whispered. “Our way in.” “That is a side door.” “Yes.” “Guarded by a woman in practical shoes.” Bill nodded. “Practical shoes mean she’s focused on logistics, not crime.” “That is also terrible logic.” “It keeps working for me.” Molly reached for her hood. “If we’re doing this, we’re doing it cleanly. We go in, find the gift table, scoop up the presents, and leave before anyone notices.” Bill raised a finger. “We leave a decoy gift.” “No.” “A very nice decoy gift.” “No.” “A gift that says ‘To Leo, from Santa’ so they blame seasonal cross-contamination.” Molly unlatched the van door. “You should never be allowed near stationery.” Bill hopped out beside her. “Ready?” “No.” “Fantastic. That’s usually when you’re at your best.” They crossed the wet pavement toward the side entrance just as the woman in practical shoes stepped inside. Molly slipped in behind her, head down, while Bill followed with a cardboard box under his arm and the confidence of a man who had never once been invited to a party honestly. The hall was warm and bright, full of streamers and chaos. A long table near the stage groaned under piles of wrapped boxes, each one labeled with looping handwriting and stickers of dinosaurs, rockets, and smiling suns. Molly’s eyes widened despite herself. Bill whispered, “Look at that. A treasure hoard.” “Do not romanticize theft.” “I’m not. I’m appreciating the craftsmanship.” From across the room, a child in a paper crown screamed with joy as a magician made a rabbit appear from a hat. Molly blinked. “There’s a magician.” Bill froze. “That is deeply unhelpful.” The magician, a man with a silver vest and the haunted eyes of someone who had lost a war to balloon animals, locked eyes with them and gave the tiniest, most exhausted nod imaginable. Molly whispered, “Does he know?” Bill whispered back, “I think he knows everything.” “Then we’re doomed.” “Not doomed,” Bill said, straightening his jacket. “Pressured.” “Bill, that is not better.” But he was already walking toward the gift table with a grin, and Molly, against all sense and judgment, followed.
created_at
"2026-03-18T11:13:34.434Z"
3/18/2026, 11:13:34 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1063
moderation_latency
(null)
generation_time
5773
tokens_prompt
77
tokens_completion
1361
native_tokens_prompt
66
native_tokens_completion
1093
native_tokens_completion_images
0
native_tokens_reasoning
44
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.004968
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_0aeb1d34ff78f3990169ba88de840881939f5d96001aab2614"
is_byok
false
latency
249
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773832414-lrHI9xLAZECuWUXvGnJl"
upstream_id
"resp_0aeb1d34ff78f3990169ba88de840881939f5d96001aab2614"
total_cost
0.004968
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
776
42.92%
Dialogue to Total Word Ratio
Ratio: 45.89%, Deviation: 4.11%
neededClean
false
wordsTotal
778
wordsDialogue
357
21.4589%