NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX2QT8R27580ZDNC9N58C69
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 34s
Cost:
$0.031932
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
69.54%
Dialogue to Total Word Ratio
(details)
34.7679%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill paced the rusted van, his boots clanging against the metal floor. The vehicle smelled faintly of stale takeout. He stopped at the rear doors and turned to face his partner. "So let me get this straight. We are casing a Chuck E. Cheese?" Molly did not look up from her binoculars. She adjusted the focus dial with a practiced thumb, peering through the windshield at the brightly lit building across the parking lot. "No, Bill. We are casing the private event room at the Malibu Grand Prix. A kid named Timmy is turning eight, and his dad is a tech billionaire. The presents alone are worth six figures." "Six figures? For an eight-year-old? What is he getting, a solid gold scooter?" Molly lowered the binoculars and dropped them onto the dashboard. She pulled a crumpled blueprint from a manila folder, smoothing it out against the steering wheel. "A diamond-encrusted iPad, apparently. Plus cash. Now, the plan is simple. You go in as the clown." "Absolutely not. I am not wearing the red nose." "Bill, the red nose is non-negotiable. It’s the cornerstone of the entire operation." "I look ridiculous in polka dots! And the shoes are a tripping hazard." "Which is exactly why you are perfect for the distraction phase. You trip, you fall, you make balloon animals. Meanwhile, I slip into the coatroom and bag the loot." Bill slumped into the passenger seat, staring at the oversized shoes resting by his feet. He picked one up, weighing the massive foam shoe in his hands. "What if the kids swarm me? Eight-year-olds are vicious. They bite." "Then bite them back! I don’t care, just keep them occupied for five minutes." "Five minutes is a lifetime in the clown trenches, Molly. What if they demand a magic trick? I only know the one where I make my thumb come off." Molly rubbed her temples, exhaling a long, frustrated breath. "Then perform the thumb trick! Do the thumb trick until your actual thumb falls off. Just make sure security is looking at you when the fire alarm goes off." "Ah, the fire alarm. Now that’s something I can get behind. Who’s pulling it?" "I am, obviously. Right after I secure the gift bags. Once the alarms blare, chaos erupts. Parents grab their kids, I blend into the panic, and we meet back here." "What about the getaway? I can’t run in these giant shoes." "You won’t have to run. You’ll waddle. Waddle out the side exit by the skee-ball ramps." "Waddle? I’ll look like a pregnant penguin." Molly finally cracked a smile, adjusting the rearview mirror. "A pregnant penguin with a diamond iPad in his oversized pants. Now, are we doing this or not?" "I guess. But if a kid kicks me in the shins, I’m quitting this crew." "Noted. Put on the wig." Bill reached into a duffel bag on the floorboards. He pulled out a garish, neon-colored accessory and held it up in the dim light. "Do I have to wear the rainbow afro too?" "Bill. The wig is the rainbow afro." "You’ve got to be kidding me. This is humiliating." "Crime is a humiliating business, Bill. Now fluff it up. Timmy’s party starts in twenty minutes." "Fine. But if anyone asks, my clown name is Barry." "Your clown name is Bonkers, and you love it." "I hate you, Molly." "Shut up and honk your nose."
created_at
"2026-06-24T15:07:22.276Z"
6/24/2026, 3:07:22 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3022
moderation_latency
(null)
generation_time
93484
tokens_prompt
77
tokens_completion
7470
native_tokens_prompt
73
native_tokens_completion
7234
native_tokens_completion_images
(null)
native_tokens_reasoning
6467
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0319318
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"20260624230722e66bbd98cc9f4f24"
is_byok
false
latency
3022
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782313642-CS1KKQdUIIcuwdCPUhOB"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782313642-paQBCewkDb7vk4XgRoJR"
upstream_id
"20260624230722e66bbd98cc9f4f24"
total_cost
0.0319318
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
567
69.54%
Dialogue to Total Word Ratio
Ratio: 72.70%, Deviation: 2.70%
neededClean
false
wordsTotal
575
wordsDialogue
418
34.7679%