NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HNKNH74T98ZR4P8K4GYNF
Run Details
Model:
GPT-5.4 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
23.6s
Cost:
$0.012585
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
4.21%
Dialogue to Total Word Ratio
(details)
2.1067%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind the hedge of Number Twelve, a hedge so aggressively trimmed it looked as if it had military aspirations. Beyond it, the back garden blazed with bunting, balloons, and the shrill cheerfulness of a dozen children who had not yet learned disappointment. A magician in a purple waistcoat was pulling handkerchiefs from his sleeve with the despairing air of a man trying to retrieve his own dignity. On the patio stood the gift table. It was magnificent. Boxes in shiny paper rose in a wobbling mountain beside a pink cake shaped like a unicorn suffering from hay fever. Curly ribbons twitched in the breeze like little beckoning fingers. Molly adjusted her cap and peered through the leaves with the solemnity of an astronomer discovering a greedy new planet. “Tell me again,” Bill whispered, “why we’re robbing a birthday party and not, say, a bank?” “Because banks have guards,” Molly said. “This has a clown and a woman dressed as a princess who’s smoking behind the shed.” Bill followed her gaze. “Fair.” A child screamed with delight as the magician produced a rubber chicken from someone’s ear. The adults applauded with the glazed expressions of hostages instructed to appear relaxed. Molly unrolled a hand-drawn map on her knee. It was on the back of a pizza leaflet and smelled faintly of anchovies. “The objective,” she murmured, tapping the page, “is the present pile. Entry point: side gate. Obstacle one: bouncy castle. Obstacle two: Labrador in a bow tie. Obstacle three: attentive grandmother.” Bill frowned. “The dog I can handle. Grandmothers are unpredictable.” “Exactly why I’ve assigned you distraction.” He looked wounded. “Why do I always get distraction?” “Because you have the face of a man to whom unfortunate things naturally happen. People gather.” From the lawn came a burst of music and the thump of tiny feet. A game had begun. Children ran in circles around a row of chairs with the ruthless intensity of stockbrokers. Bill watched them, impressed. “No honor among the under-fives.” “Keep focused,” Molly said. She tucked a strand of hair behind one ear and surveyed the battlefield. “When the cake arrives, all eyes will go to the candles. You limp in from the side gate, clutch your chest, and mutter something alarming about wasps.” “Wasps?” “People can’t resist communal panic. While they’re herding the children and arguing over antihistamines, I sweep the presents into the laundry basket by the fence.” Bill nodded slowly. “And if someone asks who I am?” “Say you’re from entertainment.” “That explains surprisingly much.” The Labrador trotted into view wearing a satin bow tie and the expression of a minor aristocrat. Bill sighed. “He knows. Look at him. That dog knows.” “He’s a Labrador, Bill, not a detective.” The dog sat down and stared straight through the hedge, disappointed already. From the patio, a woman’s voice rang out, “Five minutes till cake!” The children responded as if informed of rescue ships. Molly’s eyes lit up. “This is it.” Bill swallowed. “Right. One question.” “What?” “What if there are books in the presents?” She turned to him. “Then we fence them to clever people.” “And if they’re educational toys?” “We take the batteries and leave the shame behind.” Bill grinned despite himself. “You really have thought of everything.” “No,” said Molly, folding the map. “Just the important parts.” She rose into a crouch, poised like a cat with criminal intent. Bill straightened his jacket, took a deep breath, and prepared to become a public emergency. In the garden, the magician bowed, the clown tripped over a hose, and the grandmother near the gate narrowed her eyes, as if she could smell nonsense on the wind. Molly smiled. At last, she was working with professionals.
created_at
"2026-03-06T12:23:59.928Z"
3/6/2026, 12:23:59 PM
model
"openai/gpt-5.4-20260305"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
449
moderation_latency
(null)
generation_time
23588
tokens_prompt
77
tokens_completion
1059
native_tokens_prompt
66
native_tokens_completion
828
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.012585
router
(null)
provider_responses
0
endpoint_id
"9ff5625c-403f-4d7f-b895-58ac7295062c"
id
"resp_0f0ceb01e21960090169aac75ffd588196be5b8391c3668479"
is_byok
false
latency
176
model_permaslug
"openai/gpt-5.4-20260305"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799839-7vqLzCFhELbbdZSBTEHt"
upstream_id
"resp_0f0ceb01e21960090169aac75ffd588196be5b8391c3668479"
total_cost
0.012585
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
625
4.21%
Dialogue to Total Word Ratio
Ratio: 37.96%, Deviation: 7.96%
neededClean
false
wordsTotal
627
wordsDialogue
238
2.1067%