NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.6
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYXGWJQRZT1HGET1Q9E53AP
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The inside of the van smelled of stale coffee and regret. Across the street, number 42 was a crime against good taste, a riot of pink and purple balloons clinging to a mailbox shaped like a cartoon castle. Molly hunched over a crudely drawn floor plan on the back of a pizza box, her pen tapping a nervous rhythm against a grease stain. Bill was meticulously picking the sprinkles off a crumbly donut. “The target is a seven-year-old’s birthday party. The payload is in the living room, by the giant inflatable pirate ship,” Molly said, her voice a low hiss. “Are those little sausages? In a blanket? I love those,” Bill mumbled, his eyes glazing over as he stared at the backyard. “Focus, Bill. The presents.” “Right, presents. But do you see the bouncy castle? We could…” “That’s the secondary objective. Your role is the diversion.” Bill sighed, a cloud of powdered sugar puffing from his lips. “The sad panda? I still feel like a bear would be more intimidating.” Molly didn’t look up. “A sad panda is disarming. Parents see a sad panda, they feel pity. They don’t see a threat. You will create a ten-minute window of social awkwardness for me to get in and out. I’ll enter through the patio door during the chaos of ‘Pin the Tail on the Donkey.’ The lock is a simple latch.” “And what if the presents are heavy? What if one of them is a Lego set? Those things are a nightmare to carry. A million tiny pieces. You step on one in the dark and it’s game over.” “We have a duffel bag, Bill. We’re professionals.” She finally met his gaze, her expression dead serious. “The extraction window is narrow. Once the presents are secured, we rendezvous back here. No detours.” “Right. Professionals.” He popped the last of the plain donut into his mouth. “So, after we grab the loot, we can still have cake, right? I saw a unicorn cake. That feels like a violation not to at least try.” Molly dropped her head into her hands, the pizza box crinkling under the strain. The heist was already a disaster.
created_at
"2026-02-08T15:22:56.727728+00:00"
Sunday, February 8th, 2026 at 3:22:56 PM GMT+00:00
model
"z-ai/glm-4.6"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
208
moderation_latency
(null)
generation_time
54817
tokens_prompt
61
tokens_completion
2471
native_tokens_prompt
66
native_tokens_completion
2490
native_tokens_completion_images
(null)
native_tokens_reasoning
2106
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0043385799999552
router
(null)
provider_responses
0
id
"chatcmpl-RWSKsQG3cBF4XcJJujBqvpDt"
status
200
is_byok
false
latency
58
endpoint_id
"c8607edc-8577-4199-b03d-1213da4743dc"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.6"
api_type
"completions"
id
"gen-1770564121-1K9KSTa0D1NyT6jKUuxC"
upstream_id
"chatcmpl-RWSKsQG3cBF4XcJJujBqvpDt"
total_cost
0.0043385799999552
cache_discount
0.0000224000000448
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
364
0.0000%
Dialogue to Total Word Ratio
Ratio: 59.62%, Deviation: 29.62%
neededClean
false
wordsTotal
364
wordsDialogue
217
0.0000%