NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX2G6XAWSHH6ZBB8HRSGY92
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 17s
Cost:
$0.020949
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
53.24%
Dialogue to Total Word Ratio
(details)
60.6692%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her tactical binoculars, the heavy plastic strap leaving a deep red indentation across the bridge of her nose. The suburban fortress loomed ahead of their parked van, an unassuming structure adorned with neon pink lawn flamingos and a massive vinyl banner that cheerfully proclaimed "Happy 7th Birthday, Timmy!" The afternoon sun glared off the inflatable bouncy castle in the driveway. "The perimeter is guarded by a golden retriever. He looks remarkably fat and slow, but I am not taking any chances with a beast of that size. Did you bring the chloroform-soaked knockout drops or the industrial zip ties?" Bill rummaged loudly through a heavy canvas duffel bag overflowing with high-tech lockpicking gear, thermal blankets, and a suspiciously half-eaten family-size bag of extruded cheese puffs that crinkled with every movement. "I brought a premium squeaky toy shaped exactly like a pork chop. That should distract him long enough for us to silently bypass the back sliding door. What is the situational status on the parents?" Molly clicked her tactical penlight, shining the narrow beam directly onto a crayon-drawn map of the living room layout that Timmy had apparently made at school. "The dad is currently manning the charcoal grill. He is on his third lite beer, so his reaction time is severely compromised. The mom is in the kitchen cutting a massive cake shaped like a stegosaurus. We have a strict ten-minute window while they gather around the dining table to sing." Bill nodded slowly, chewing thoughtfully on a bright orange cheese puff. He carelessly wiped his powdery, sticky fingers on the sleek fabric of his black tactical turtleneck. "Ten minutes is tight for a job this size. We have to secure the Lego Death Star, the remote-control monster truck, and whatever is in that enormous gift bag with the glittery tissue paper. I heard Timmy's grandma is loaded." Molly sighed, rubbing her temples in slow, frustrated circles. The stakes of this afternoon operation were incredibly high, and the immense pressure was clearly getting to her partner before they even breached the perimeter. "Leave the glitter bag for absolute last. That stuff gets everywhere and instantly compromises our forensic footprint. We go in through the doggy door, grab the loot, and exfiltrate through the garage. Nobody gets hurt, and we walk away with thousands in retail plastic." Bill checked his waterproof tactical watch, the sweeping second hand ticking down to their designated moment of glory. He took a deep, shuddering breath to steady his rapidly fraying nerves before the big push. "What if Timmy cries? I can't handle crying kids, Molly. It throws off my center of gravity and ruins my focus. I have a very sensitive soul." Molly grabbed Bill firmly by the shoulders, staring directly into his eyes with the unwavering intensity of a seasoned professional thief who had successfully seen it all before. "If Timmy cries, you look him dead in the eye and tell him Santa came early but his elves unionized and demanded immediate repossession. Now put on your ski mask. It's time to ruin a birthday."
created_at
"2026-06-24T15:03:13.084Z"
6/24/2026, 3:03:13 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3437
moderation_latency
(null)
generation_time
76568
tokens_prompt
77
tokens_completion
5088
native_tokens_prompt
73
native_tokens_completion
4738
native_tokens_completion_images
(null)
native_tokens_reasoning
4090
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0209494
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"20260624230313af0b64f6a0434889"
is_byok
false
latency
3437
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782313393-5xBZh0PVrTsoIJDzQ7Jw"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782313393-TPfedJnVFwHxwmPE7e0a"
upstream_id
"20260624230313af0b64f6a0434889"
total_cost
0.0209494
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
514
53.24%
Dialogue to Total Word Ratio
Ratio: 53.55%, Deviation: 3.55%
neededClean
false
wordsTotal
521
wordsDialogue
279
60.6692%