NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX2G1JVH8N8VVBS6HTZWH6H
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 10s
Cost:
$0.018331
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
50.55%
Dialogue to Total Word Ratio
(details)
25.2760%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly sighed and watched a group of kids run screaming across the lawn, hopped up on sucrose. The suburban backyard was a chaotic sprawl of streamers and parental anxiety. They were parked three houses down, the van smelling faintly of old french fries and pure ambition. "Are you wearing a tutu?" Molly asked, peering over the dashboard. "It's a tactical tutu, Molly. Blends right in with the target demographic." Bill adjusted the pink tulle, which barely covered his massive knees. The fabric ripped slightly as he shifted his weight. "Plus, it has pockets." "It does not have pockets. You just shoved a crowbar down your tights. You look like a lumberjack in a Swan Lake production. You’re going to blow the whole operation before we even get past the bouncy castle." "The bouncy castle is the weak point," Bill said, tapping the blueprint, which was actually just a crayon drawing by his niece. "If I can create a diversion on the inflatable obstacle course, you can slip into the garage and bag the loot." "The loot is a pile of Legos and an Easy-Bake Oven." "Don't forget the Pony Princess Dream Castle. That thing retails for two hundred bucks on the secondary market. We’re sitting on a goldmine, Molly." "It’s a child's birthday party, Bill. There are going to be thirty hyped-up seven-year-olds, a face painter, and a clown. How exactly do you plan to divert them?" "I have a repertoire," Bill said confidently. "I’ll challenge the clown to a dance-off. Clowns are notoriously territorial. While he’s distracted trying to out-mime me, you hit the present table." "You can’t out-mime a clown. The mother will spot you." "Which is why you’re wearing the caterer's apron," Bill reminded her. "I am pretending to be a caterer at a party that already has a caterer? What if the real one asks me to refill the juice boxes?" "Tell her you’re strictly a high-end charcuterie specialist. Seven-year-olds don't even know what charcuterie is." "Nobody knows what charcuterie is, Bill! It’s just fancy cheese!" Molly took a deep breath, rubbing her temples to quell the rising migraine. The sheer audacity of the heist was starting to weigh on her. "Okay. New plan. I will cause the diversion. I will casually unplug the air compressor to the bouncy castle." "Genius. Total deflation panic." "While the parents are scrambling to rescue the suffocating children, you grab the presents. But leave the wrapping paper, it’s traceable." "Wait, we don't unwrap them here?" "No, we do not unwrap them in the van! We take them to the safe house and sort the merchandise!" "Fair enough. Do you think they have those little gift bags with the tissue paper? I love rustling the tissue paper." "Focus, Bill. We go in at exactly 2:15, right after the magic show but before the piñata. The sugar crash will be starting. They'll be lethargic." "I love a good piñata." "You can't hit the piñata! You're a thief, not a guest!" Molly hissed, gripping the steering wheel until her knuckles turned white. She stared at the crayon drawing again, tracing the crude outline of the garage with her finger. "Now get the ski masks on." "Ski masks with a tactical tutu? My knees will freeze." "Put on the mask, Bill."
created_at
"2026-06-24T15:03:07.625Z"
6/24/2026, 3:03:07 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3505
moderation_latency
(null)
generation_time
70011
tokens_prompt
77
tokens_completion
4513
native_tokens_prompt
73
native_tokens_completion
4143
native_tokens_completion_images
(null)
native_tokens_reasoning
3395
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0183314
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"202606242303078c69459c4d944eb4"
is_byok
false
latency
3504
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782313387-KgzOF1HgBDSRezFYI8xR"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782313387-0ovqkIw09B4UIxWM2FTk"
upstream_id
"202606242303078c69459c4d944eb4"
total_cost
0.0183314
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
547
50.55%
Dialogue to Total Word Ratio
Ratio: 73.69%, Deviation: 3.69%
neededClean
false
wordsTotal
555
wordsDialogue
409
25.2760%