NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.6
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYYXWV8M42J79697BWM65Q1
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the crayon-smeared blueprint across the hood of the stolen sedan, her expression deadly serious. The diagram depicted a suburban backyard with alarming detail. A gravity-defying castle of primary colours dominated the centre, labelled ‘The Bouncy Sentinel’. Bill, meanwhile, was trying to discreetly eat a bag of cheese puffs he’d shoplifted from the garage. “Alright, let’s run it one more time,” Molly said, tapping a spot marked ‘Payload’. “The objective is the complete acquisition of all wrapped presents. No exceptions.” Bill wiped orange dust from his chin. “Right. The loot. But I still don’t get why we can’t just wait until they all go home. Less… screaming.” Molly fixed him with a withering glare. “Amateur. The parental units enter a high-alert state post-party. The security perimeter tightens. Now, during the piñata bash, that’s our window of opportunity. The Sugar-Addled Horde will be completely fixated. It’s the perfect diversion.” She pointed at a stick figure wearing a cape. “This is our wildcard. ‘Captain Chaos’. Age six. Highly unpredictable. He guards the gift table with the ferocity of a rabid badger. Bill, you’re on Chaos duty.” “Me? What do I do? Challenge him to a staring contest?” “You engage him,” Molly explained, her patience thinning. “You ask him about his superpowers. You compliment his cape. You become his sidekick. Keep him occupied for ninety seconds.” “Ninety seconds? He’s a kid, not a bomb. What if he asks me my superpower?” “Tell him you can eat an entire bag of cheese puffs without breathing,” Molly snapped, snatching the bag from him. “Now, for the extraction. I’ll trigger the acoustic diversion device.” She reached into the back seat and pulled out a giant, rainbow-coloured party horn. Bill stared at it. “That’s the… device?” “It’s multi-phonic,” she said defensively. “Once the Horde is disoriented by the sound, we grab the payload and make for the extraction vehicle.” She gestured towards a little red wagon with a squeaky wheel leaning against the fence. Bill’s eyes lit up. “A wagon? I get to pull the wagon?” Molly sighed, the sound of a woman who chose her partners poorly. “Yes, Bill. You get to pull the wagon.” “Awesome,” he grinned, already forgetting about Captain Chaos. “Can we get one of those flags that sticks up?”
created_at
"2026-02-08T15:47:33.284595+00:00"
Sunday, February 8th, 2026 at 3:47:33 PM GMT+00:00
model
"z-ai/glm-4.6"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
260
moderation_latency
(null)
generation_time
56681
tokens_prompt
61
tokens_completion
1990
native_tokens_prompt
66
native_tokens_completion
1996
native_tokens_completion_images
(null)
native_tokens_reasoning
1493
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0034790199999552
router
(null)
provider_responses
0
id
"chatcmpl-RPyxgtby0CgPCrBDZoVMx3Fw"
status
200
is_byok
false
latency
76
endpoint_id
"c8607edc-8577-4199-b03d-1213da4743dc"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.6"
api_type
"completions"
id
"gen-1770565596-bq4Vae4FGvmlN1cYrWdd"
upstream_id
"chatcmpl-RPyxgtby0CgPCrBDZoVMx3Fw"
total_cost
0.0034790199999552
cache_discount
0.0000224000000448
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
384
0.9638%
Dialogue to Total Word Ratio
Ratio: 59.64%, Deviation: 9.64%
neededClean
false
wordsTotal
384
wordsDialogue
229
0.4819%