NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07TTPVZYJYJ6XX3YEJC88V
Run Details
Model:
GPT-5.4 Mini (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5.5s
Cost:
$0.004590
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0028%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van sat crooked in the alley like a disgruntled shoe, its side panel reading PETER’S PET PARROTS in faded letters from a previous criminal career involving birds and poor decisions. Molly crouched beside the rear door, peering through a crack in the brickwork at the children’s party next door. Balloons bobbed over the garden fence. A clown, unfortunately, was already there, bending himself into a figure that looked suspiciously like a warning. Bill arrived with two paper cups of tea and the expression of a man who had recently discovered both regret and polyester. “Good news,” he said, handing her a cup. “Define good.” “The party is in full swing.” Molly squinted at him. “That’s the bad news, Bill. The good news is supposed to be something like ‘the adults are distracted’ or ‘the dog is asleep.’” Bill sipped his tea. “There is a dog. He’s awake. He hates me.” “Of course he does. You once tried to distract a terrier with a ham sandwich and a legal argument.” “He lost the ham.” Molly turned back to the fence. On the other side, children ran in circles with the kind of joyful velocity that made theft feel almost rude. A banner over the patio read HAPPY BIRTHDAY, LUCY! in giant glittering letters. “There’s our target,” Molly said. Bill nodded toward the mountain of wrapped gifts stacked beneath the gazebo. “That’s a lot of loot for one small human.” “She’ll survive.” “You say that now.” “She’s five, Bill. At five, every present is either a revelation or a sock. We’re doing her a public service.” Bill frowned. “By taking the presents?” “By redistributing them.” “To where?” Molly gave him a patient look. “Our flat.” “Oh. Right. The charitable bit.” A burst of shrieking rose from the garden. Someone had apparently handed the clown a microphone. Bill winced. “I hate children’s parties,” he said. “You hate all parties.” “I contain multitudes.” Molly checked her watch. “We go in on the piñata.” “The piñata?” “During the confusion. One good swing, sweets everywhere, children descend like tiny, sugar-fueled vultures, and nobody notices two sensible professionals transporting gifts.” Bill considered this. “That is the worst plan I’ve ever heard.” “It’s also the only plan.” He looked through the fence again. “What if they ask questions?” “About what?” “Why there are two adults carrying forty-seven wrapped boxes toward a van.” Molly smirked. “We’re entertainers. You’re the magician, I’m the assistant.” “I can’t do magic.” “You can do cynicism. Close enough.” Bill straightened his coat. “I’m not wearing the top hat.” “You’re wearing the top hat.” “I look like a disgraced traffic warden.” “Perfect. Children trust authority.” He groaned, but she could see he was already warming to the idea, which was always the danger with Bill. He complained like a man auditioning for sainthood, then committed to nonsense with the enthusiasm of a ferret on espresso. From the garden came a chant of, “Piñata! Piñata! Piñata!” followed by the wet thwack of a plastic bat and a chorus of delighted screams. Molly grinned. “Hear that? The opening act.” Bill set down his tea and reached for the sack. “All right, Molly. If we’re going to ruin a birthday, let’s do it with style.” “That’s the spirit.” “And if this goes wrong?” “It won’t.” “It always does.” She glanced at him, amused. “Bill, we’re thieves. Going wrong is the hobby.” Then the clown’s microphone squealed, the children yelled louder, and from the party came the first cascade of flying sweets. Molly and Bill slipped through the side gate wearing their best criminal confidence, which in Bill’s case looked mildly apologetic and in Molly’s looked like business. The presents waited under the gazebo, bright paper gleaming in the afternoon sun, innocent as sin and twice as tempting.
created_at
"2026-03-18T10:28:32.617Z"
3/18/2026, 10:28:32 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1316
moderation_latency
(null)
generation_time
5294
tokens_prompt
77
tokens_completion
1117
native_tokens_prompt
66
native_tokens_completion
1009
native_tokens_completion_images
0
native_tokens_reasoning
137
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00459
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_0fc56865cb22a20a0169ba7e50b0a48195914e5b4630f62a21"
is_byok
false
latency
195
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829712-NQPKQWGNfm4Z0N8f48C1"
upstream_id
"resp_0fc56865cb22a20a0169ba7e50b0a48195914e5b4630f62a21"
total_cost
0.00459
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
630
0.01%
Dialogue to Total Word Ratio
Ratio: 43.99%, Deviation: 13.99%
neededClean
false
wordsTotal
632
wordsDialogue
278
0.0028%