NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.6
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYYMH8G0J7DJJZ7NRA8P0QC
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The sedan smelled of stale coffee and desperation. From their vantage point across the street, the target house—a cheerful bungalow vomiting a rainbow of streamers onto the lawn—was less a fortress and more a minefield of pure, unadulterated joy. A battalion of inflatable unicorns guarded the driveway. Through the bay window, the high-pitched shrieking of a dozen under-sugared seven-year-olds was visible as a frantic, blurry motion. Molly ignored the cacophony, her focus locked on the glowing tablet in her lap. A detailed schematic of the house’s ground floor was overlaid with a tactical plan. Beside her, Bill was watching the party through binoculars, his expression a mixture of awe and horror. “Are you sure about this, Molls?” he asked, lowering the binoculars. “The tactical approach seems… aggressive.” “It’s a two-pronged assault, Bill,” Molly said, not looking up. “One creates a diversion. The other secures the asset. The assets, plural. The presents.” Bill squinted at the house. A mountain of gaudily wrapped cardboard was teetering precariously next to a fireplace. He could just make out the corner of a box promising a ‘Super Soaker 9000’. “A clown?” he said, his face, usually a rugged landscape of criminal indifference, falling as he spotted the costume crumpled in the back seat. “You want me to go in there as a clown? The squeaky shoes are a security risk. They announce your presence. And the nose. It honks. What if I need to be stealthy?” “The honk is a secondary distraction,” Molly snapped, finally tearing her eyes away from the screen. “And you won’t need to be stealthy. You’re the diversion. You’re supposed to be seen.” “But the face paint gets itchy,” Bill whined, gesturing at his own rugged face. “And what if one of them has a phobia? We could be looking at a psychological trauma lawsuit.” “Bill, for the last time, we are not litigating the emotional distress of a seven-year-old. You are a clown. Your job is to be loud and regrettable. Now, focus on the party-poppers.” She pointed a manicured finger at a bag on the floor. Just then, a new sound drifted from the house, a synthesized horror that made the hair on her arms stand on end. It was the opening notes of "The Wheels on the Bus." A muscle twitched in her jaw. She snapped the laptop shut. The plan was set. It was time to go to work.
created_at
"2026-02-08T15:42:40.898656+00:00"
Sunday, February 8th, 2026 at 3:42:40 PM GMT+00:00
model
"z-ai/glm-4.6"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
388
moderation_latency
(null)
generation_time
71072
tokens_prompt
61
tokens_completion
2492
native_tokens_prompt
66
native_tokens_completion
2511
native_tokens_completion_images
(null)
native_tokens_reasoning
2117
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0043751199999552
router
(null)
provider_responses
0
id
"chatcmpl-R5pYlF5jwm3NyWSmK29utGLk"
status
200
is_byok
false
latency
126
endpoint_id
"c8607edc-8577-4199-b03d-1213da4743dc"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.6"
api_type
"completions"
id
"gen-1770565289-sLuMCpzAlTReUY786oO0"
upstream_id
"chatcmpl-R5pYlF5jwm3NyWSmK29utGLk"
total_cost
0.0043751199999552
cache_discount
0.0000224000000448
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
412
6.5729%
Dialogue to Total Word Ratio
Ratio: 37.38%, Deviation: 7.38%
neededClean
false
wordsTotal
412
wordsDialogue
154
3.2864%