NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX2H8CHG80FTBYGHRD36EKS
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 14s
Cost:
$0.039979
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
0.49%
Matches word count
(details)
99.23%
Dialogue to Total Word Ratio
(details)
49.8594%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars, peering through the tinted window of the van at the Henderson residence. The backyard was a chaotic sea of primary colors, completely devoid of tactical awareness. Bouncy castles inflated like bloated military tents, swaying in the humid afternoon breeze. A clown with highly questionable credentials was currently twisting balloons into unrecognizable animals, oblivious to the impending tactical incursion. Somewhere beneath the layers of streamers, half-eaten cupcakes, and screaming seven-year-olds lay the primary objective. The motherlode. Tommy Henderson’s birthday presents. "Remind me why we aren't knocking over a bank," Bill whined from the passenger seat, aggressively scratching his stubble. He was wearing a tactical black turtleneck that was entirely too hot for a Saturday afternoon in July. Sweat was pooling around his collar, staining the expensive fabric. "Banks have lasers and guards," Molly replied, not taking her eyes off the yard. "Tommy Henderson has a piñata and a dog." "That dog looks mean." "It's wearing a tutu. We go in during the cake." Bill looked down at the hand-drawn schematic of the backyard spread across his knees. The crayon annotations were a bit much, but Molly always insisted on visual aids for complex operations. The perimeter was clearly marked with a purple crayon, indicating the boundaries of the Henderson property. A faint smell of stale coffee filled the van as they waited in absolute silence. "So I hit the fence and infiltrate the present table?" "Exactly. I'll provide cover. If anyone spots you, I'll throw a juice box." "A juice box?" "Do you want the loot? Tommy's uncle got him a solid gold Millennium Falcon. That's fifteen grand." "Fifteen grand. I could fix my boat." Bill chewed his lower lip, his eyes widening at the thought of that shiny bounty. Fifteen grand was enough to finally get his pontoon boat fixed, assuming the engine block hadn't rusted completely through during the winter. He imagined himself cruising down the river, the wind in his hair, without a care in the world. No more second-story jobs, no more crawling through air ducts. "Stop worrying about the dog. Grab the loot, stuff it in the duffel, and blend in. If a mom talks to you, complain about your back." "Got it. What if they ask where my kid is?" "Say he's in the bounce house. Nobody checks it." Bill nodded, pulling down his ski mask despite the sweltering heat. He looked completely ridiculous, like a trained ninja who had lost his way to a winter Olympics tryout. The dark wool contrasted sharply with the bright, cheerful noise bleeding through the van's thin metal walls. Molly grabbed his arm before he could open the door. "Wait. The clown is getting the cake ready." She reached into the glovebox and pulled out two walkie-talkies, tossing one to him. He caught it clumsily, nearly dropping the bulky plastic device onto the gear shift. He fumbled with the dial, making sure the volume was turned down to avoid any sudden static bursts that might give away their position. "Loud and clear, boss. Let's hit them." "Good. And Bill?" "Yeah?" "If you step on a Lego, you're on your own out there."
created_at
"2026-06-24T15:03:47.359Z"
6/24/2026, 3:03:47 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2442
moderation_latency
(null)
generation_time
133949
tokens_prompt
77
tokens_completion
8962
native_tokens_prompt
73
native_tokens_completion
9063
native_tokens_completion_images
(null)
native_tokens_reasoning
8373
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0399794
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"2026062423034749dfc87442b5437e"
is_byok
false
latency
2441
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782313427-hYccTcMnFHKkbW3bU1wi"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782313427-1R8rsv4ZZ0kErVVeqg9h"
upstream_id
"2026062423034749dfc87442b5437e"
total_cost
0.0399794
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.49%
Matches word count
n/a
neededClean
false
words
527
99.23%
Dialogue to Total Word Ratio
Ratio: 30.39%, Deviation: 0.39%
neededClean
false
wordsTotal
533
wordsDialogue
162
49.8594%