NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX37TPVF2ZKW6QVC5J4FWAH
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 15s
Cost:
$0.019379
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
21.6892%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly stared at the green monstrosity standing by the open doors of the van. "Are you seriously wearing that?" Molly asked. "It’s a dinosaur costume," Bill said, adjusting the inflatable tail with his oversized claws. "It’s a children’s party, Molly. We have to blend in with the locals." "You are a six-foot-four man in a neon green T-Rex suit. You look like a mascot who lost a custody battle." "Exactly. They’ll just think I’m the entertainment. You’re only jealous because you chose a black turtleneck." Molly crossed her arms and sighed. "I am wearing tactical black because we are thieves. That is the uniform of a professional. Now, what is the entry point?" "The bouncy castle," Bill said proudly. "Absolutely not." "Hear me out. The bouncy castle is adjacent to the wooden fence. I climb the fence, bounce to get momentum, and leap over." "You cannot even step over a puddle without twisting your ankle. Plus, it’s a bouncy castle. You will get trapped in the mesh netting." "I have a contingency. I bring a pair of safety scissors." Molly rubbed her temples. "So your grand heist involves slicing through a vinyl bouncy castle in broad daylight while twenty toddlers scream?" "It’s a camouflage scream. Everyone screams at kids’ parties. It blends right in." "Bill, we are stealing presents from six-year-olds. We need absolute stealth." "Stealth is my middle name." "Your middle name is Bartholomew. What about securing the loot?" "Loot?" Bill asked, scratching his giant foam dinosaur head. "The presents, Bill. We are there to steal the presents." "Right. I figured we just grab the big pile by the picnic table and run." "Which picnic table?" "The one with the big cake." "There is going to be a dedicated present table. Probably by the adult chairs. We wait until the magician starts his act." "The magician? I hate magicians." "Everyone hates magicians. That is why it is the perfect distraction. While he is pulling a rabbit out of a hat, you and I load the presents into my duffel bag." "You brought a duffel bag? I thought we were using your fanny pack." "My fanny pack holds exactly one juice box and a pack of gum. There are forty presents out there." "Okay, new plan. I use my tiny dinosaur arms to carry the big boxes. You take the gift bags." "They are not tiny. They are proportional to the suit." "Bill, focus. We go in, grab the gifts, toss them over the fence to the van, and leave." "What about the cake?" "We are not taking the cake." "But it looks like a chocolate cake. You love chocolate cake." "I am not risking a prison sentence for a supermarket sheet cake." "What if it is from a boutique bakery?" "It does not matter. Grab the presents, ignore the cake." "Fine. But if there are goodie bags, I am taking one. I need a new pencil eraser." "You are the worst criminal I have ever met." "Thank you. Does the T-Rex zip up in the back? I feel a breeze." "Just get in the van, Bill."
created_at
"2026-06-24T15:16:07.02Z"
6/24/2026, 3:16:07 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3136
moderation_latency
(null)
generation_time
75149
tokens_prompt
77
tokens_completion
4657
native_tokens_prompt
73
native_tokens_completion
4381
native_tokens_completion_images
(null)
native_tokens_reasoning
3700
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0193786
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"20260624231607147fd053fa73438b"
is_byok
false
latency
3136
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782314167-focnIR2SEEpCcXaewrLT"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782314167-IjwW4HVpMAXq5OYGCrZh"
upstream_id
"20260624231607147fd053fa73438b"
total_cost
0.0193786
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
517
0.00%
Dialogue to Total Word Ratio
Ratio: 90.98%, Deviation: 20.98%
neededClean
false
wordsTotal
521
wordsDialogue
474
21.6892%