NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYZEC6QGMF9ZYQJA3H8BZCB
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the rhododendron, adjusting the earpiece with a precision that betrayed the absurdity of the mission. "Target is in sight. The bouncy castle is inflated." "I can’t believe I agreed to this," Bill hissed into his microphone, wrestling with a oversized duffel bag. "I’m a cat burglar, Molly. I steal emeralds from penthouses. I do not steal plushies from a six-year-old named Timmy." "Desperate times call for desperate measures," Molly replied sharply, her eyes scanning the suburban lawn. "And that’s a high-end bouncy castle. It means the parents have money. The payload in that living room is probably eighty percent electronics." "It’s mostly noise-making toys, Molly. The goal of a heist is to get rich, not to get a migraine." "Stop whining. We need a tactical approach to the gift table. It’s guarded by a Labrador." "A what?" "A Golden Retriever. He looks sleepy, but he’s positioned right in front of the LEGO haul." "I’m not tackling a dog." "Then you distract him. I have a squeaky bone in the utility belt." "You came prepared." "I’m a professional. Now, here’s the plan. You create a diversion at the snack table. Spill the punch." "The red or the blue?" "The red. It stains. Absolute chaos. While the moms panic about the rug, I slide in, bag the presents, and we exfil through the patio doors." "What if one of the kids asks for my autograph?" "Give them a fake one. Sign it 'Santa.'" "That’s sick." "That’s business. Wait, movement. The clown is arriving." "Oh god, clowns," Bill groaned. "I hate this job." "The clown is a wild card. If he starts making balloon animals, the children will swarm. That’s our cover. We move when the balloon poodle is finished." "I still think we should just knock over a liquor store like normal criminals." "Liquor stores have cameras and security guards. This place has a piñata and a dad wearing a 'World’s Okayest Golfer' t-shirt. We’re practically stealing candy from a baby." "Well, technically..." "Shut up and get ready to spill the punch. On my mark. Three, two, one."
created_at
"2026-02-08T15:57:36.430859+00:00"
Sunday, February 8th, 2026 at 3:57:36 PM GMT+00:00
model
"z-ai/glm-4.7-20251222"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1489
moderation_latency
(null)
generation_time
120045
tokens_prompt
61
tokens_completion
2540
native_tokens_prompt
66
native_tokens_completion
2567
native_tokens_completion_images
0
native_tokens_reasoning
2073
native_tokens_cached
2
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.005117418
router
(null)
provider_responses
0
id
"a3ae43c254c7b4c8e2a724436f48c7e8"
status
200
is_byok
false
latency
1489
endpoint_id
"53d155e7-021c-4268-bde4-991847ef8389"
provider_name
"Novita"
model_permaslug
"z-ai/glm-4.7-20251222"
api_type
"completions"
id
"gen-1770566136-cw6fe1C8YVr2zbdybI8f"
upstream_id
"a3ae43c254c7b4c8e2a724436f48c7e8"
total_cost
0.005117418
cache_discount
9.8e-7
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
353
0.0000%
Dialogue to Total Word Ratio
Ratio: 88.67%, Deviation: 18.67%
neededClean
false
wordsTotal
353
wordsDialogue
313
0.0000%