NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0NB79B6F10QG7WP40K4HFZ
Run Details
Model:
GPT-5.5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
27.3s
Cost:
$0.030990
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0039%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the hydrangeas, peering through pink balloons at the garden beyond. A banner reading HAPPY 7TH BIRTHDAY, LUCY flapped above a mountain of wrapped presents. Bill adjusted his fake mustache. It immediately fell into the punch bowl beside him. “Bill,” Molly whispered, “why is your mustache swimming?” “It’s hydrating.” “It’s evidence.” “It’s a disguise with needs.” Molly sighed and unfolded a crayon-drawn map. “Right. Operation Party Parcel. We enter through the bouncy castle, cross the lawn during Musical Statues, blend in near the magician, then load the gifts into the clown van.” “Your clown van.” “Our clown van.” “It says Giggles McSprinkles on the side.” “That’s called authenticity.” “It also honks when you brake.” “That’s called morale.” Bill squinted over the hedge. “Are we absolutely sure about this target? Children are unpredictable. Last week one called me ‘sir’ and handed me a worm.” “We are professionals.” “We stole a cheese wheel from a wake.” “And nobody cried.” “The priest chased us.” “He wanted a slice.” A burst of laughter rose from the garden as a magician pulled a ribbon from his sleeve and accidentally knocked over a tray of cupcakes. Bill pointed. “That man is competition. Look at him. Distracting the crowd with wonder while standing within arm’s reach of the gift table. Textbook.” “Then we move before he does.” “What if he’s not a magician? What if he’s security?” “Bill, security guards don’t wear capes covered in rabbits.” “Exactly what security wants you to think.” Molly tucked the map into her belt. “Listen carefully. The birthday girl’s mother is in the kitchen. The father is arguing with the barbecue. The grandparents are asleep in lawn chairs. We have a six-minute window.” “Seven, if the barbecue wins.” A small boy in a pirate hat suddenly appeared on the other side of the hedge. He stared at them. Molly froze. Bill waved. “Ahoy.” The boy narrowed his eyes. “Are you grown-ups?” “No,” Molly said. “Then why are you so tall?” “Vegetables.” “I don’t like vegetables.” “That’s why you’re short,” Bill said. The boy considered this, then pointed at Molly’s black clothes. “Are you ninjas?” Molly smiled. “Birthday ninjas.” “Can you do a flip?” “No.” “Bad ninjas.” Bill leaned toward Molly. “We’ve been compromised by Captain Judgment.” Molly pulled a wrapped parcel from her bag and handed it over. “Here. A present.” The boy shook it. Something rattled. “What is it?” “Treasure.” “It says ‘bath salts’ on it.” “Treasure for tired people.” He ran off, shouting, “Mum! There are bad ninjas in the bushes!” Molly and Bill stared at each other. “Six-minute window,” Bill said, “has become a six-second window.” “Abort?” “Never. Adapt. Improvise. Panic elegantly.” They slipped through the side gate and collided with a woman carrying a tray of sausage rolls. “Oh!” she said. “You must be the entertainers.” Molly blinked. “Yes.” Bill bowed. “We are the bad ninjas.” “Wonderful. The children are just by the gift table. You’re on in two minutes.” Molly grabbed Bill’s sleeve as they were swept onto the lawn. Twenty children turned to stare. The magician looked relieved. A girl in a glittery crown stepped forward. “Are you here to do flips?” Bill straightened. “We are here to teach you the ancient ninja art of… present protection.” Molly hissed, “What are you doing?” “Living.” He pointed dramatically at the gift pile. “Rule one: never leave presents unattended.” The children gasped. “Rule two,” Molly said slowly, catching on, “always form a protective circle.” The children linked arms around the presents. Bill nodded. “Excellent. Rule three: if suspicious adults approach, shout very loudly.” “Suspicious adults!” Lucy shouted, pointing at them. The circle erupted. “SUSPICIOUS ADULTS!” Parents turned. The father dropped his barbecue tongs. The magician applauded nervously. Molly raised both hands. “And that concludes the lesson.” Bill whispered, “We have accidentally invented community service.” Lucy beamed. “Best party ever.” Her mother pressed cash into Molly’s hand. “You were brilliant. Same time next year?” Molly looked at Bill, then at the guarded fortress of presents. Bill shrugged. “We could go legitimate.” Molly pocketed the money. “One party at a time.”
created_at
"2026-04-24T21:11:46.74Z"
4/24/2026, 9:11:46 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5134
moderation_latency
(null)
generation_time
27107
tokens_prompt
77
tokens_completion
1451
native_tokens_prompt
66
native_tokens_completion
1022
native_tokens_completion_images
0
native_tokens_reasoning
30
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.03099
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_076acb5224ba6bd80169ebdc92eaf88193b52b80ed97e333ea"
is_byok
false
latency
420
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777065106-PyMN0lpHFqfO3QLYMPYs"
session_id
(null)
api_type
"completions"
id
"gen-1777065106-8kAaHubFfivdlSiLH2Et"
upstream_id
"resp_076acb5224ba6bd80169ebdc92eaf88193b52b80ed97e333ea"
total_cost
0.03099
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
684
0.01%
Dialogue to Total Word Ratio
Ratio: 56.25%, Deviation: 13.75%
neededClean
false
wordsTotal
688
wordsDialogue
387
0.0039%