NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0P5XD8NCYBJ4VCPGDJFD4D
Run Details
Model:
GPT-5.5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
24.9s
Cost:
$0.030090
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0026%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly flattened the crayon-scribbled map on the dashboard and pinned it with a half-eaten doughnut. “Right,” she said. “Target: the Henderson twins’ seventh birthday party. Location: community hall. Security: one grandmother, two balloon arches, and a magician named The Amazing Plonko.” Bill peered through the windshield at the hall, where paper dinosaurs bobbed in the breeze. “I don’t like it.” “You never like it.” “I liked the biscuit factory job.” “You got trapped in a vat of custard.” “Warm custard, Molly. There’s a difference.” She tapped the map. “Presents are here, beside the cake table. We go in disguised as children’s entertainers, create a diversion, and remove the parcels.” Bill lifted a sequined top hat from his lap. “I still say my clown name should be Sir Honks-a-Lot.” “It sounds like a goose with a mortgage.” “Fine. Professor Giggles.” “You’re not a professor of anything.” “I’ve studied snacks extensively.” Molly opened a canvas bag and pulled out juggling pins, a fake moustache, and a clipboard. “Listen carefully. We are not actually stealing the presents.” Bill blinked. “We’re not?” “No. We’re retrieving them.” “From children?” “From Derek ‘Sticky Fingers’ Doyle. He’s posing as Uncle Derek, and he’s already swapped half the gifts for empty boxes. Mrs. Henderson hired us to catch him quietly before the children notice.” Bill sat straighter. “So we’re heroic thieves?” “Consultants.” “With masks?” “No masks.” “With capes?” “Absolutely no capes.” “Then how will people know we’re heroic?” “They’ll know because we don’t fall over the punch bowl.” Bill considered this. “That’s a high standard.” Molly shoved the top hat onto his head. “Your job is to keep the children watching the puppet show.” “I don’t do puppets.” “You own six.” “They’re colleagues.” “Your colleagues are going to save a birthday.” He sighed and pulled a sock puppet from his pocket. “Reginald hates pressure.” “Reginald is a sock.” “Say that to his face.” Inside the hall, the air smelled of frosting, crisps, and sugar-fuelled chaos. Children shrieked around a bouncy castle shaped like a dragon. At the far wall, a heap of bright presents glittered beneath a banner reading HAPPY BIRTHDAY, LUCY AND LEO! Molly smiled at a woman in a dinosaur apron. “Mrs. Henderson?” The woman whispered, “Thank goodness. Uncle Derek is by the gift table. Blue shirt. Suspicious knees.” Bill glanced over. “His knees do look criminal.” “Places,” Molly said. Bill leapt onto a small stage. “Ladies, gentlemen, and persons currently wearing cake! I am Professor Giggles, and this is Reginald the Remarkably Tax-Aware Ferret!” A child shouted, “That’s a sock!” Reginald bobbed. “And yet I have better manners than you, small sir!” The children erupted with laughter. Even The Amazing Plonko paused mid-card trick. Molly slipped toward the present table, clipboard raised. Derek was crouched beside a sack, sliding a shiny parcel inside. “Uncle Derek,” she said brightly, “quick inventory for the thank-you notes.” He froze. “What?” “Name of gift?” “Er… box.” “Lovely. From?” “Me.” “To?” “Birthday.” Molly clicked her pen. “Very convincing.” Onstage, Bill shouted, “Who wants to see Reginald escape from the Castle of Doom?” Every child roared, “Me!” Derek stood, sweating. “I was just tidying.” “Into a sack marked DEREK’S LAUNDRY?” “It’s my laundry.” “It jingles.” “My socks are festive.” Molly yanked open the sack. Out tumbled dolls, books, puzzles, and one remote-control dinosaur that immediately began roaring and ramming Derek’s ankle. Bill, still performing, pointed dramatically. “Behold! Reginald has summoned justice!” Mrs. Henderson gasped. “Derek!” Derek tried to run, slipped on a paper plate, and landed sitting in the bouncy castle entrance. The dragon swallowed him up with a wheeze. The children cheered. Molly gathered the recovered presents. “No harm done.” Bill bowed as Reginald waved. “Another successful mission, partner.” Molly glanced at his bulging jacket. “Bill.” “What?” “Why is there a cupcake in your pocket?” He looked offended. “That’s not theft. That’s evidence.” Reginald nodded. “Delicious evidence.”
created_at
"2026-04-24T21:26:21.358Z"
4/24/2026, 9:26:21 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2115
moderation_latency
(null)
generation_time
24752
tokens_prompt
77
tokens_completion
1280
native_tokens_prompt
66
native_tokens_completion
992
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.03009
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_0084096e363eb4990169ebdffd7240819497910226d7a9a756"
is_byok
false
latency
259
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777065981-Zl54Tj5ulMMULkQzaEaw"
session_id
(null)
api_type
"completions"
id
"gen-1777065981-bt1druLPQAL7BLie8avX"
upstream_id
"resp_0084096e363eb4990169ebdffd7240819497910226d7a9a756"
total_cost
0.03009
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
647
0.01%
Dialogue to Total Word Ratio
Ratio: 55.95%, Deviation: 14.05%
neededClean
false
wordsTotal
656
wordsDialogue
367
0.0026%