NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.6
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYXDCDN21T4E7QDV1PBX1MN
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a blurry photo on her phone. “The target is six-year-old Timmy Anderson’s superhero-themed birthday party. Reconnaissance puts the present pile in the living room, by the fireplace.” Bill squinted. “He looks like he’s had three too many juice boxes. That’s our security detail? A hyped-up first-grader?” “Don’t underestimate him, Bill. He’s got a party blower and he’s not afraid to use it.” Molly zoomed in on a plate of cupcakes. “Our window of opportunity is when the magician starts his second act. Parents are distracted, captivated by a man pulling a gerbil out of his trouser pocket.” “The bouncy castle, Molly. It’s a giant, inflated witness right in the backyard. How do we get past that?” “That’s your job. You’re the diversion.” Bill sat up straight. “Diversion? I like the sound of that. Am I creating a power outage? Faking a medical emergency?” “You’re going to be a clown.” He stared at her. “A clown.” “Yes. I’ve got the costume in the back. Red nose, oversized shoes, the whole nine yards. You’ll go in and make balloon animals. A badly-made poodle will distract everyone long enough for me to bag the assets.” “Bag the… Molly, we’re stealing presents from a child. It’s a Lego Death Star and, according to my intel from the checkout line, a vaguely anatomical model of a horse. This isn’t exactly the Casino de Monte-Carlo.” “A score is a score, Bill. We get in, we get the goods, we exfiltrate. I’ll load the presents into the wheelie bin by the curb. You’ll create a commotion—a tragic balloon-popping incident—and meet me at the van.” He sighed, running a hand through his hair. “A tragic balloon-popping incident. And what if one of the parents asks me to twist a giraffe?” “Improvise, Bill. You’re a professional.” Molly grinned, a predator’s glint in her eye. “Just don’t accept any cake. We’re on the clock.”
created_at
"2026-02-08T15:20:55.4664+00:00"
Sunday, February 8th, 2026 at 3:20:55 PM GMT+00:00
model
"z-ai/glm-4.6"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
200
moderation_latency
(null)
generation_time
48232
tokens_prompt
61
tokens_completion
1840
native_tokens_prompt
66
native_tokens_completion
1851
native_tokens_completion_images
(null)
native_tokens_reasoning
1482
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0032267199999552
router
(null)
provider_responses
0
id
"chatcmpl-RbF0Ga0BpfuvH675ISH2pT2l"
status
200
is_byok
false
latency
55
endpoint_id
"c8607edc-8577-4199-b03d-1213da4743dc"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.6"
api_type
"completions"
id
"gen-1770564006-0ximcQWOqIvB0WHLQ3MQ"
upstream_id
"chatcmpl-RbF0Ga0BpfuvH675ISH2pT2l"
total_cost
0.0032267199999552
cache_discount
0.0000224000000448
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
327
0.0000%
Dialogue to Total Word Ratio
Ratio: 87.16%, Deviation: 17.16%
neededClean
false
wordsTotal
327
wordsDialogue
285
0.0000%