NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX3FWDGHXSWB1494CE68A1Z
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 36s
Cost:
$0.056677
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
95.31%
Dialogue to Total Word Ratio
(details)
92.8958%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The midday heat shimmered off the hot pavement. The suburban neighborhood was too cheerful for a heist. It smelled of grass and defeat. Molly adjusted her black tactical mask, squinting against the bright afternoon sun. Beside her, Bill checked his tactical watch, looking entirely too casual for a thief. They crouched behind a fragrant hydrangea bush, observing the target. "Are you absolutely sure this is the right house?" Bill nodded, gesturing toward the garish decorations. "Positive. It has a bouncy castle shaped like a dragon and a giant banner." Molly lowered the binoculars, glaring at her partner. "So we are stealing presents from a kid. This is a new low." Bill pulled a floor plan from his pocket, smoothing it out on the grass. "Those are premium building blocks, Molly. Do you know what those go for online?" She sighed, rubbing her temples. The things she did for rent money. "We are going to hell, Bill." He chuckled, pointing at a side window on the blueprints. "Only if we get caught. The parents hired a clown. They will be busy." A loud cheer erupted from the backyard, followed by the squeak of rubber shoes. "What is our breach point? Please do not say the bouncy castle." Bill tapped the paper with a grubby finger, ignoring her sarcasm. "The side window is unlocked. The mom opened it to let the cake smell out." Molly shook her head in disbelief at the domestic innocence of it all. "That is disgustingly wholesome. What about the extraction route once we have the loot?" He traced a line on the map toward the alleyway behind the fence. "We crawl in, grab the gift table bags, and slip out the back gate." Molly considered the logistics. It sounded almost too simple, lacking the usual alarms. "What if Timmy sees us taking his presents? Kids talk." Bill shrugged, packing the blueprint away into his tactical vest. "Give him a juice box. Children are very easily bribed with sugar." A bright horn honked, signaling the start of musical chairs in the yard. "And if the clown sees us? They are notoriously unpredictable." Bill smiled, checking the zipper on his duffel bag. "Also a juice box. Clowns are just giant, terrifying children anyway." Molly checked her watch. The cake cutting was their only window of distraction. "I hate this plan. My last job was a sophisticated art museum." Bill stood up, brushing dirt off his knees, ready to move. "And you got stuck in a vent. Here, the obstacle is a piƱata." She stood up, tightening her grip on the empty black bag. "Fine. Let us move. But I get the giant pirate ship set." Bill agreed, stepping out with a stealthy crouch. "Deal. Just do not step on a plastic brick during the escape." Molly paused, looking back at the chaotic yard one last time. "What if the guard dog sees us?" "There is no guard dog. Only a lazy cat that sleeps on the presents." "Perfect. A cat will not bark. We are clear for entry now."
created_at
"2026-06-24T15:20:30.909Z"
6/24/2026, 3:20:30 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3289
moderation_latency
(null)
generation_time
154949
tokens_prompt
77
tokens_completion
10488
native_tokens_prompt
73
native_tokens_completion
12858
native_tokens_completion_images
(null)
native_tokens_reasoning
12205
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0566774
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"2026062423203141ab61d6828a4a95"
is_byok
false
latency
3289
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782314430-4kUMTOnktzRHe2HWrduZ"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782314430-1yfC8LcLPr130hxxUnxr"
upstream_id
"2026062423203141ab61d6828a4a95"
total_cost
0.0566774
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
510
95.31%
Dialogue to Total Word Ratio
Ratio: 49.02%, Deviation: 0.98%
neededClean
false
wordsTotal
510
wordsDialogue
250
92.8958%