NC Bench
Overview
About
Tests
Models
Vendors
Model
Z.AI GLM 5
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-11-01KH7GREEC1JZMCWEB0YYB2692
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the rearview mirror, staring past the steering wheel to the suburban house at the end of the cul-de-sac. "Okay, look alive. We’re looking at a standard single-point entry through the side gate. Visibility is low, but the target density is high." Bill squinted through the windshield, unwrapping a hoagie with a crinkle of wax paper. "Target density? It’s a backyard, Molly. I see a bouncy castle and a guy making balloon animals. Is that a sword? He just made a sword." "Forget the sword. Focus on the loot. We have a pile of presents approximately four feet high located under the maple tree. Ribbons, paper, the works. It’s a high-yield grab, Bill. We’re talking LEGOs, maybe a bicycle, definitely some high-end electronics if this kid’s parents are as loaded as the file suggests." "File? You mean the flyer you stole from the grocery store bulletin board? It’s a six-year-old’s birthday party, not the Crown Jewels." "Exactly. Low security, high emotional volatility. When the cake comes out, everyone’s distracted. That’s our window. You go low, I go high. We bag the goods and vanish." "You want me to tackle a six-year-old?" "I want you to tackle the gift table. Keep your interactions with the hostiles to a minimum. If confronted, use the cover story." "I’m still not comfortable being your 'eccentric Uncle Steve.' I don’t even have a fake beard." "You have a mustache. Grow the rest out in your head. It’s called method acting, Bill." Bill took a massive bite of his sandwich, chewing thoughtfully. "What about the clown? I don’t trust him. He’s got eyes like a shark." "He’s a birthday clown. He’s probably high on nitrous. Ignore him. The real threat is the mother. She’s the gatekeeper. I saw her checking a clipboard. She’s organized. That’s dangerous." "She’s just checking RSVPs." "She’s doing a headcount. That means she’ll notice when the stack of gifts starts shrinking. We need to be surgical. In and out. No traces." "You know," Bill said, wiping mayo off his chin, "there’s a piñata. We could wait for the piñata. When that thing breaks, it’s total chaos. Candy everywhere. Kids screaming. It’s the perfect diversion." Molly looked at him, her eyes narrowing. "That’s... actually not terrible. The piñata creates a cone of confusion. Blindfolds, bats, sugar-crazed maniacs swinging for the fences. The parents will be focused on preventing concussions." "See? I’m an ideas man. So, we wait for the papier-mâché donkey to die?" "We wait. But keep your mask ready. The second that candy hits the grass, we move on the presents." "I still think we should just grab the goodie bags and run. I heard there are minifigurines in there." "We don't do small-time anymore, Bill. We’re professionals. Now, put the sandwich down and check the perimeter. I think I see the dad bringing out the cooler. Phase one is imminent." "Fine. But if that clown comes at me with a balloon sword, I’m throwing a punch." "You throw a punch at a clown, you lose the moral high ground. Just get the gifts."
created_at
"2026-02-11T23:34:39.727518+00:00"
Wednesday, February 11th, 2026 at 11:34:39 PM GMT+00:00
model
"z-ai/glm-5-20260211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
577
moderation_latency
(null)
generation_time
151842
tokens_prompt
77
tokens_completion
1953
native_tokens_prompt
66
native_tokens_completion
1832
native_tokens_completion_images
(null)
native_tokens_reasoning
1138
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0059284
router
(null)
provider_responses
0
status
429
is_byok
false
latency
103
endpoint_id
"14b8ca12-e20c-422a-b2e1-0b0f7f8048bf"
provider_name
"Parasail"
model_permaslug
"z-ai/glm-5-20260211"
1
id
"476237467dce4eb889939163ef5a8a6f"
status
200
is_byok
false
latency
577
endpoint_id
"a3983e50-dec5-4453-80f6-8ab726493ca4"
provider_name
"Friendli"
model_permaslug
"z-ai/glm-5-20260211"
api_type
"completions"
id
"gen-1770852727-FmS3zQElpvlWU1APweDP"
upstream_id
"476237467dce4eb889939163ef5a8a6f"
total_cost
0.0059284
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Friendli"
Evaluation details
Result
Evaluator
Details
Meta Data
2.0116%
Matches word count
n/a
neededClean
false
words
525
0.0000%
Dialogue to Total Word Ratio
Ratio: 88.57%, Deviation: 18.57%
neededClean
false
wordsTotal
525
wordsDialogue
465
1.0058%