NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.6
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYYBF6ZMZHYG7X001XVYS2R
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled of stale coffee and desperation. Through a pair of binoculars, Bill watched a rainbow-colored chaos erupt across the street. Small humans, fueled by cake and pure unadulterated joy, swarmed a bouncy castle shaped like a castle. A real, live pony stood nearby, looking deeply philosophical. “Is that a real pony?” Bill asked, lowering the binoculars. “Because that feels like overkill for a six-year-old’s party.” “It’s a tactical distraction, Bill,” Molly snapped, not looking up from the blueprints she’d drawn on a grease-stained napkin. “A four-legged, hay-fueled security risk. We need to account for it. Now, focus. Which one is the target, Chloe?” “The one in the tiara and the Sparkle-Mania princess costume,” Bill said, pointing. “She’s currently trying to ride the pony like it’s a roller coaster.” “Good. That means her attention is occupied. Now, for the plan. Phase One: Infiltration. I go in disguised as ‘Aunt Gladys,’ a distant relative with a terrible gift. That creates confusion and goodwill.” “What’s the terrible gift?” “A self-help book on overcoming sugar addiction,” Molly said flatly. “While the parents are awkwardly fumbling with that, you create a secondary diversion at the far end of the yard.” “And my part?” “You’re on crowd control,” Molly said, pointing to a large plastic tub in the back of the van. “With the bubble machine. Unleash a torrent of bubbles. The kids will be mesmerized.” “We’re using a bubble machine for a heist?” “This isn’t a heist, Bill, it’s a tactical acquisition of assets. And yes, we are. It’s surprisingly effective. While they’re lost in the shimmering sphere-pocalypse, I’ll be by the present table. I need you to bring the extraction device.” Bill hefted a garish gift bag with a cartoon unicorn on it that was nearly as tall as he was. “The Bag of Holding?” “The Bag of Holding. Now, the window of opportunity is the ‘Pin the Tail on the Donkey’ event. It requires blindfolds and spins, which is pure chaos. That’s our moment. We grab everything and vanish.” Bill peered through the binoculars again. “So the window is when they’re pinning the tail? Can I try to pin the tail first? I’m really good at that.” Molly didn’t answer. She just put her face in her hands and took a deep, shuddering breath. Then she opened the van door, the scent of bubble soap and impending doom filling the air.
created_at
"2026-02-08T15:37:25.782978+00:00"
Sunday, February 8th, 2026 at 3:37:25 PM GMT+00:00
model
"z-ai/glm-4.6"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
266
moderation_latency
(null)
generation_time
52810
tokens_prompt
61
tokens_completion
2187
native_tokens_prompt
66
native_tokens_completion
2196
native_tokens_completion_images
(null)
native_tokens_reasoning
1666
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0038382199999776
router
(null)
provider_responses
0
id
"chatcmpl-RbPlR0fKJZL8Pe1ZxK4iYARo"
status
200
is_byok
false
latency
67
endpoint_id
"c8607edc-8577-4199-b03d-1213da4743dc"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.6"
api_type
"completions"
id
"gen-1770564992-yXX0F2oNn038msABexol"
upstream_id
"chatcmpl-RbPlR0fKJZL8Pe1ZxK4iYARo"
total_cost
0.0038382199999776
cache_discount
0.0000112000000224
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
409
0.0100%
Dialogue to Total Word Ratio
Ratio: 63.57%, Deviation: 13.57%
neededClean
false
wordsTotal
409
wordsDialogue
260
0.0050%