NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0N43ZA8B3EDATAY66J7GYJ
Run Details
Model:
GPT-5.5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
28.2s
Cost:
$0.034230
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.15%
Dialogue to Total Word Ratio
(details)
0.0761%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her nose to the rain-streaked window of the rented van and squinted at the house across the street. Balloons bobbed on the gate like brightly colored witnesses. A paper banner sagged over the porch, announcing HAPPY 7TH BIRTHDAY, TOBY! in letters large enough to shame the moon. Bill sat beside her with a notebook balanced on his knee, chewing the end of a pencil with the grave concentration of a man defusing a bomb, or choosing soup. “That’s a lot of presents,” Molly said. Bill nodded. “A scandalous quantity.” “Wrapped, ribboned, unattended.” “Temporarily attended,” Bill corrected. “By children sticky with cake, adults weakened by party sausages, and one hired clown whose shoes suggest poor balance.” Molly lowered the binoculars. “Bill, focus. We are not here to critique footwear.” “We are if the footwear becomes relevant to the operation.” In the back of the van, their equipment lay in tragic disarray: two party hats, a plastic wand, a sack labeled LAUNDRY because Bill felt labels made crime look tidy, and a large stuffed crocodile Molly had insisted was “for blending in.” The crocodile stared at them with stitched optimism. “Our objective,” Molly said, tapping the dashboard, “is simple. We enter the children’s party, locate the presents, remove the presents, and leave before anyone asks why neither of us knows Toby.” Bill wrote this down, then underlined Toby twice. “What if Toby asks?” “We compliment his age.” “He’s seven.” “Then we say he wears it well.” Across the street, a woman opened the front door and shouted something cheerful. A stampede of children surged from the hallway into the garden, where a bouncy castle lurched like a blue lung. Molly and Bill both recoiled instinctively. Bill swallowed. “There are more of them than I expected.” “They travel in packs.” “And they scream.” “It’s how they communicate.” A small boy wearing fairy wings tackled another boy dressed as a pirate. The clown attempted to intervene and was immediately given a crown of jam. Bill shut his notebook. “Molly, I’m beginning to question the wisdom of stealing from children.” Molly looked offended. “We are professionals. We question wisdom afterwards, during pie.” “But look at them. They’re so… short.” “That is not a moral category.” “It feels like one.” Molly sighed and watched a little girl place a carefully wrapped present on the growing pile by the front window. The box was decorated with uneven stars, the kind cut by careful hands with blunt scissors. She felt, against her will, a small splinter of conscience wedge itself beneath her ambition. Bill noticed. “You’re thinking it too.” “I am thinking the window latch is terrible.” “You are thinking that present has homemade stars.” “I hate you.” “No, you don’t. You hate that I’m right in a waistcoat.” Molly sank back into her seat. Rain ticked on the roof. From the house came a roar of laughter as the clown slipped, recovered, and bowed as if falling had been art. “So,” Bill said gently, “new plan?” Molly stared at the sack labeled LAUNDRY. Then at the crocodile. Then at the balloons shivering in the rain. “We still break into the party,” she said. Bill blinked. “Pardon?” “We go in, we steal all the presents—” “Molly.” “—from the porch, where the rain is getting them wet, and move them to the dining room. Then we eat cake. Quietly. Criminally quietly.” Bill opened his notebook again. “A humanitarian heist.” “A temporary relocation of loot.” “And if anyone asks who we are?” Molly picked up the stuffed crocodile and tucked it under her arm with dignity. “We’re with the reptile entertainment.”
created_at
"2026-04-24T21:07:53.971Z"
4/24/2026, 9:07:53 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
8533
moderation_latency
(null)
generation_time
28054
tokens_prompt
77
tokens_completion
1043
native_tokens_prompt
66
native_tokens_completion
1130
native_tokens_completion_images
0
native_tokens_reasoning
321
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.03423
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_09fd4d4bcbd3d5400169ebdbaa12348196a033702c95c226fb"
is_byok
false
latency
344
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777064873-pWB4W2YGWB2pzS77Y6G6"
session_id
(null)
api_type
"completions"
id
"gen-1777064873-J9zNUZrgkSSjc0SaOYLK"
upstream_id
"resp_09fd4d4bcbd3d5400169ebdbaa12348196a033702c95c226fb"
total_cost
0.03423
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
603
0.15%
Dialogue to Total Word Ratio
Ratio: 41.39%, Deviation: 11.39%
neededClean
false
wordsTotal
604
wordsDialogue
250
0.0761%