NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX2SDBEEQDYK92Q3SS6NN8Y
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 29s
Cost:
$0.029437
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
76.83%
Dialogue to Total Word Ratio
(details)
60.1038%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crayon map across the beige sedan. The blueprint depicted the backyard in alarming detail, complete with a sandbox designated as Sector Four. Molly adjusted her black turtleneck, which was itching furiously in the afternoon heat, and tapped the paper with a stern finger. Bill stood beside her, chewing a juice box swiped from reconnaissance. The air smelled of freshly cut grass and impending felony. Bill scanned the target residence, a modest home festooned with helium balloons bobbing in the breeze. From beyond the wooden fence, the high-pitched shrieks of sugar-crazed seven-year-olds echoed like war cries. Molly adjusted her ski mask, rolling the fabric up to act as a rather unfashionable beanie. It was eighty-five degrees outside, but professionalism demanded sacrifice. "Alright, Bill, listen up. The target is the Henderson backyard. The princess bounce house is a distraction." "A distraction? I thought we were just grabbing the gifts and running." "Amateur hour. We need a tactical approach. There’s a piñata shaped like a unicorn." "So we smash it and take the candy?" "No, the candy is a decoy. The real loot is in the garage, wrapped in SpongeBob paper." Bill shifted his weight, his boots crunching on the gravel driveway. He had initially suggested simply walking through the front gate, but Molly had immediately vetoed that idea, citing a complete lack of dramatic tension. She insisted on a covert entry through the side garden, navigating a treacherous labyrinth of oversized plastic tricycles and abandoned Capri Sun pouches. The stakes were incredibly high, at least twelve brightly wrapped boxes sat on a picnic table near the patio. That was high-end loot, maybe even a Lego Death Star. "I hate SpongeBob paper. It’s too slippery for a quick grab." Bill muttered, scanning the perimeter. "What about the clown? He looks suspicious." "Leave the clown to me. I brought a banana peel." "A banana peel? This isn’t a cartoon, Molly." "Just drive the van, Bill, and stay focused." "Fine, but if the clown has a squirt flower, I’m out." Molly produced a pair of binoculars, peering through the wooden slats of the fence. The tactical situation was evolving. A piñata was being lowered from a tree branch, creating a potential choke point. The enemy was heavily caffeinated and entirely unpredictable. A toddler in a tutu was currently guarding the present table with a plastic wand, posing a significant security threat. Molly knew they had exactly one window of opportunity. The moment the cake was brought out, the children would swarm the patio, leaving the gifts completely unguarded. "Just keep the engine running and watch out for the obstacle course," Molly whispered. "The slip-and-slide out there?" "Exactly. It’s a death trap." Bill checked his watch. The synchronized attack was set for precisely three o'clock, right when the birthday song reached its chaotic crescendo. He popped the last drop of fruit punch from the juice box and crushed the empty container in his fist. Molly gave the signal. They moved with practiced stealth, creeping toward the side gate, ready to execute the most audacious heist the cul-de-sac had ever seen.
created_at
"2026-06-24T15:08:14.586Z"
6/24/2026, 3:08:14 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2402
moderation_latency
(null)
generation_time
88309
tokens_prompt
77
tokens_completion
6577
native_tokens_prompt
73
native_tokens_completion
6667
native_tokens_completion_images
(null)
native_tokens_reasoning
5997
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.029437
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"20260624230814175d23d6fe41477b"
is_byok
false
latency
2402
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782313694-KsPbmlZFZ69RnqOd0jAA"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782313694-CcYII6g4YZl951gVj7s2"
upstream_id
"20260624230814175d23d6fe41477b"
total_cost
0.029437
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
517
76.83%
Dialogue to Total Word Ratio
Ratio: 27.70%, Deviation: 2.30%
neededClean
false
wordsTotal
527
wordsDialogue
146
60.1038%