NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-24-01KVX36WH7EDVNEKRZBBH25CBJ
Run Details
Model:
Z.AI GLM 5.2 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 52s
Cost:
$0.035293
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
72.70%
Dialogue to Total Word Ratio
(details)
81.5913%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"So let me get this straight. You want to infiltrate a six-year-old's birthday party by dressing up as a dinosaur?" Molly tapped the blueprints with a chewed-up pen, glaring at the layout. "It's a T-Rex, Molly. Specifically, an inflatable one. Kids love inflatables." Bill adjusted his black ski mask, pulling it tight over his unshaven face. The living room was dim. "It's ninety degrees outside. You'll pass out before you even reach the bouncy castle. And why the ski mask?" she asked, wiping sweat from her forehead. "For the stealth portion. The dino suit is for the approach. I take off the head when I get inside, slip the mask on, grab the loot, and vanish." "Vanish from a room full of first graders and their angry mothers? Bill, they'll tear you apart." Molly paced the length of the small room, her boots crunching on broken glass. "Which is exactly why we need a distraction. That's where you come in." She stopped pacing and crossed her arms. The deep, familiar dread of a terrible plan settled heavily in her stomach. "What am I doing?" "You're going to be the entertainment. A fairy princess. A very clumsy fairy princess who accidentally unplugs the bouncy castle." "That will cause a riot. A literal toddler riot." She rubbed her temples, feeling a sharp migraine forming right behind her eyes. "Exactly. While the moms resuscitate the deflated castle and calm the screaming brats, I sneak into the living room, bag the loot, and climb out the bathroom window." Molly stared at the crude map, tracing a finger over the drawn square labeled 'prize table'. "The loot being..." "Three American Girl dolls. We're talking ten grand easy on the secondary market." Bill's eyes gleamed with a manic energy that she had learned to fear. "American Girl dolls? We're knocking over a kindergarten party for dolls?" She let out a sharp, incredulous laugh that echoed off the peeling wallpaper. "Plus, the birthday boy's dad is a regional manager for a tech company. There's going to be iPads in those gift bags, Molly. iPads!" "Okay, fine. Say I'm a clumsy princess and you're a heatstroke dinosaur. How do we get past the front door? Mrs. Henderson called the cops on us last month for the lemonade stand sting." "We go through the side gate. I checked it. It's unlocked. The dog is in the backyard." "The dog is a Rottweiler named Killer. He ate a mailman." "He's a big softie. I have bacon. I'll toss him the bacon while wearing the T-Rex costume and carrying the sack of iPads." Molly chewed her thumbnail until it bled. The rent was overdue and their fence was expecting electronics. She looked at her partner, a man who believed a blow-up dinosaur suit was the pinnacle of stealth. "Fine. But I'm not wearing the pink tutu." "It's a ball gown. I already bought it. It has sequins and a built-in tiara."
created_at
"2026-06-24T15:15:36.117Z"
6/24/2026, 3:15:36 PM
model
"z-ai/glm-5.2-20260616"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2374
moderation_latency
(null)
generation_time
111463
tokens_prompt
77
tokens_completion
7767
native_tokens_prompt
73
native_tokens_completion
7998
native_tokens_completion_images
(null)
native_tokens_reasoning
7344
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0352934
router
(null)
provider_responses
0
endpoint_id
"442ea97f-ad5e-40d5-b9a6-66e9e0417dce"
id
"20260624231536222c1b8bdb3246db"
is_byok
false
latency
2374
model_permaslug
"z-ai/glm-5.2-20260616"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1782314136-7G5IsyU9yLf4gU0cC0FO"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782314136-lXBSBHD2lqN9MPdclxKl"
upstream_id
"20260624231536222c1b8bdb3246db"
total_cost
0.0352934
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
72.70%
Dialogue to Total Word Ratio
Ratio: 67.47%, Deviation: 2.53%
neededClean
false
wordsTotal
495
wordsDialogue
334
81.5913%