NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0NQZYVC00DF0MMSQ6NZXF1
Run Details
Model:
GPT-5.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
29.8s
Cost:
$0.032610
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0001%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unfolded a crayon-smeared map of the community hall across the dashboard of the ice cream van. Bill squinted at it over the top of his novelty sunglasses. “Why is the vault labelled ‘Bouncy Castle’?” “Because, Bill, it’s where the presents are guarded.” “By toddlers?” “By parents. Worse.” Bill shuddered. “Parents have handbags full of tissues and rage.” “And snacks,” Molly said. “Never forget snacks.” Outside, balloons bobbed on the hall railings. A banner read HAPPY 6TH BIRTHDAY, ARCHIE! A small pirate in face paint ran past the van shrieking. Bill watched him go. “We’re really doing this?” “We are not stealing from children,” Molly said firmly. “We’re breaking into a children’s party to steal all the presents.” “We are intercepting suspicious parcels before they can corrupt the youth.” “With action figures and glitter slime?” “Exactly. Glitter slime is society’s downfall.” Bill tapped the map. “Entry point?” “Side door by the kitchen. The entertainer arrives at two. We go in at one fifty-five, wearing these.” She held up two sagging mascot costumes. Bill stared. “A rabbit and a baked potato?” “It’s a bear.” “It has foil on its head.” “That’s a crown.” “It’s a jacket potato with royal ambitions.” “You can be the potato king, then.” “I am not wearing a tuber monarchy.” “You wore a traffic cone to rob a jeweller.” “That was elegant.” Molly checked her watch. “Focus. Once inside, you distract the children.” “How?” “Juggling.” “I can’t juggle.” “Then fail loudly. Children love disaster.” Bill considered this. “True. Last week my nephew watched me fall off a stool for forty minutes.” “I collect the presents, load them into the laundry cart, and we exit through the cloakroom.” “What if someone asks why a rabbit is wheeling away birthday gifts?” “I’ll say it’s part of the magic show.” “What magic show?” “The one where the presents disappear.” Bill nodded slowly. “That is honest, in a way.” Molly zipped herself into the rabbit costume. One ear flopped over her eye. “How do I look?” “Like Easter lost a bar fight.” “Perfect. Menacing but seasonal.” Bill climbed reluctantly into the bear suit. The crown-foil thing crackled. “I can’t see.” “You don’t need to see. You need to be whimsical.” “I feel boiled.” “That’s the spirit.” They slipped from the van and waddled toward the hall. Music thumped inside. A woman carrying a tray of sausage rolls opened the side door with her hip. Molly brightened her voice. “Entertainment!” The woman blinked. “You’re early.” Bill waved. “We’re very punctual woodland carbohydrates.” “Pardon?” “He means characters,” Molly said, elbowing him. “Oh. Lovely. Go through.” They entered chaos. Children zigzagged between chairs. A magician argued with a balloon pump. Presents glittered in a mountain beneath a table. Bill whispered, “That’s a lot of wrapping paper.” “Stay calm.” A little girl pointed at him. “Mummy, why is the bear crunchy?” Bill whispered, “I told you.” Molly hissed, “Distract.” Bill raised both paws. “Who wants to see me juggle?” Twenty children screamed, “Me!” Bill picked up three foam balls from a game stall, tossed one, fumbled the second, and somehow bounced the third off his own crowned head. The children exploded with laughter. “Again!” shouted a pirate. “No,” Bill said. “Art must not be rushed.” Molly grabbed the laundry cart and began sweeping presents into it. A boy in a superhero cape narrowed his eyes. “Why are you taking Archie’s presents?” Molly leaned close. “Secret mission.” “What kind?” “Anti-glitter.” The boy gasped. “Mum bought glitter slime.” “I knew it.” He pointed. “That blue one.” “Thank you, citizen.” Across the room, Bill was now lying on the floor while children stacked paper cups on him. “This is not in the plan,” he groaned. “You’re doing beautifully,” Molly called. The birthday boy approached, clutching cake. “Are you stealing my presents?” Molly froze. “Borrowing.” “Why?” “Quality control.” Archie thought about it. “Can I come?” “No.” “Can I have the slime back if it’s safe?” Molly looked at Bill, who had a paper cup balanced on each ear. Bill said, “Molly.” She sighed, then lifted the blue present from the cart. “One dangerous item confiscated. The rest remain.” Archie grinned. “Good. I hate glitter slime. It gets in Dad’s beard.” Molly set the other presents back under the table. Bill sat up, shedding cups. “So what are we stealing?” Molly tucked the slime under her arm. “A menace.” “And cake,” Bill said. Archie handed him a slice. “For the potato king.” Bill straightened proudly. “At last. Respect.”
created_at
"2026-04-24T21:18:45.217Z"
4/24/2026, 9:18:45 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1900
moderation_latency
(null)
generation_time
29652
tokens_prompt
77
tokens_completion
1443
native_tokens_prompt
66
native_tokens_completion
1076
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.03261
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_0c0170fc1bd425b10169ebde3551ac8194bd84ca1b078041b0"
is_byok
false
latency
1480
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777065525-3GLR93icmwCRggzH7M0w"
session_id
(null)
api_type
"completions"
id
"gen-1777065525-cBz8EoPgnN2te6YjxvAN"
upstream_id
"resp_0c0170fc1bd425b10169ebde3551ac8194bd84ca1b078041b0"
total_cost
0.03261
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
748
0.00%
Dialogue to Total Word Ratio
Ratio: 53.53%, Deviation: 16.47%
neededClean
false
wordsTotal
751
wordsDialogue
402
0.0001%