NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0PJ6VVFJ2BNDHYQD8ZTGQ8
Run Details
Model:
GPT-5.5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
26.6s
Cost:
$0.034980
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
95.22%
Dialogue to Total Word Ratio
(details)
47.6084%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread a napkin-map across the café table, weighing down one corner with a doughnut and the other with Bill’s guilty-looking elbow. Outside, rain polished the pavement; inside, Bill polished off his third éclair. “Focus,” Molly said. “At four o’clock, the children’s party begins. Balloons, cake, shrieking. At four-oh-five, we enter.” “As what?” Bill asked. “As respectable adults.” Bill stared at her. “Fine. As slightly respectable adults.” “That’s better,” Bill said. “I can’t act respectable under pressure. Last time, I saluted a coat rack.” Molly tapped the napkin. It showed a lopsided house, a garden, and something labelled PRESENT MOUNTAIN in aggressive red ink. “The presents will be here,” she said. “Stacked beside the jelly table.” Bill shivered. “Jelly. Wobbly sentry of the dessert world.” “You distract the children.” “With what?” “Balloon animals.” Bill looked wounded. “Molly, I can make three balloon shapes: snake, worried snake, and regrettable sausage.” “Perfect. Children have low standards when screaming.” At the next table, an old woman lowered her newspaper. Molly smiled at her with the innocent warmth of a person definitely not discussing grand larceny beneath a bakery awning. Bill waved. The old woman narrowed her eyes and resumed reading. Molly leaned closer. “While you entertain them, I approach the gifts.” “And then?” “And then I gather them into the sack.” Bill patted the canvas bag at his feet. It had HAPPY TURNIP FARMS printed on the side. “This sack says vegetables,” he said. “Exactly. No one suspects a turnip.” “Everyone suspects a turnip,” Bill said. “They’re sinister.” Molly ignored this. “We leave calmly.” “With all the presents?” “With all the presents.” Bill’s face twisted. “All of them?” “That is usually what ‘all’ means.” “What about the small ones with bows? They’re probably socks. We don’t need socks.” “We are professionals.” “Professional sock thieves?” “Professional opportunity specialists.” Bill sighed and nudged the doughnut off the napkin. It rolled into the ashtray with tragic grace. “Molly,” he said, “how old are these children?” “Six.” “Six?” Bill clutched his chest. “That’s tiny. That’s practically decorative.” “They’ll recover.” “From losing presents?” “They’ll have cake.” “Cake is not a legal substitute for a dinosaur puzzle.” Molly paused. “How do you know there’s a dinosaur puzzle?” Bill looked away. “There’s always a dinosaur puzzle.” The rain ticked harder against the glass. A party horn squeaked somewhere down the street, followed by a chorus of delighted shrieks. Molly folded the napkin slowly. “We’ve stolen from bankers,” Bill said softly. “From taxidermists. From that magician who deserved it.” “He did deserve it.” “But children?” Bill shook his head. “They smell like biscuits and hope.” Molly groaned. “You’re going noble on me.” “I’m going practical. Children bite.” She considered this. From the direction of the party came a crash, then a high voice yelling, “I AM THE DRAGON KING!” Bill pointed. “See? Armed monarchy.” Molly slipped the napkin into her coat. “Fine. New plan.” “We go home?” “No. We break into the party.” Bill winced. “And steal nothing.” He brightened cautiously. “Nothing?” “We replace the presents with better ones.” Bill blinked. “That’s not theft.” “It’s reverse theft.” “That’s charity with a hat on.” Molly stood, snatching up the turnip sack. “Call it whatever you like. We’re still wearing disguises.” Bill grinned. “Can I be a pirate?” “No.” “A dragon king?” “Absolutely not.” Five minutes later, Molly and Bill marched toward the shrieking house carrying a sack full of toy dinosaurs, glitter pens, and one packet of emergency socks. Bill wore a paper crown he had stolen from the café condiment basket. “Respectable adults,” Molly muttered. “Slightly respectable,” Bill corrected, and rang the bell.
created_at
"2026-04-24T21:33:04.258Z"
4/24/2026, 9:33:04 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
8212
moderation_latency
(null)
generation_time
25631
tokens_prompt
77
tokens_completion
1328
native_tokens_prompt
66
native_tokens_completion
1155
native_tokens_completion_images
0
native_tokens_reasoning
262
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.03498
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_048bb25aa98a8d8b0169ebe1905eec8193be8e34dfec929229"
is_byok
false
latency
316
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777066384-Va2EXO3uOBpN50nteqLa"
session_id
(null)
api_type
"completions"
id
"gen-1777066384-EqAULgXayeuSt9NoWxwV"
upstream_id
"resp_048bb25aa98a8d8b0169ebe1905eec8193be8e34dfec929229"
total_cost
0.03498
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
602
95.22%
Dialogue to Total Word Ratio
Ratio: 50.99%, Deviation: 0.99%
neededClean
false
wordsTotal
606
wordsDialogue
309
47.6084%