NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7 Flash
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYZ2DJDMT6J0W4DJDSF5EY4
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill slammed the lid of his toolbox shut. The sound echoed through the concrete block, signaling the end of a negotiation that had already lasted too long. "Bill, look at the timeline. Sarah is bringing a three-tier cake." "Cake is edible evidence, Molly. It must be disposed of immediately. In the dumpster." "What about the balloons? Those are tripwires. She ties them to everything." "You’re overthinking this. The balloons are for the kids. The presents are for us. It’s basic mathematics." "There’s a clown in the backyard. Ringmaster Rudy." "Rudy? We can bludgeon him with a banana. Or distract him with a rubber chicken." "Rudy has a sledgehammer. He uses it to pound nails into the cake stands to keep them stable." "Brilliant. A heavy object. Perfect for smashing glass and scaring the toddlers." "Molly, we are thieves, not demolition experts." "In this job, everything is for sale. Even the destruction. That sledgehammer will look great on the mantle." Bill rubbed his tired eyes and leaned back against a stack of rusty barrels. "Fine. But if we get caught by Rudy, I’m singing. I will name names. I’ll say you dress like a bat." "You can say that. It’s true, isn't it?" "Also, I’m not wearing the tight neon spandex. It’s undignified." "You’ll look sleek. Like a wasp. Or a kick-ass vampire." "And if we don’t steal the Transformer set, I’m singing too." "I will pay you. I’ll give you the entire bag of donuts from the loot bags." "I’ll take the Twinkies. And the switch." Bill groaned, reaching into his pocket to pull out a schematic map of the house. He tapped a specific spot on the driveway. "Do we really need the surveillance van? It’s a Saturday." "We need it for the air conditioning. The van is the only thing keeping us sane in this heat." "I’m going to regret this."
created_at
"2026-02-08T15:50:32.552829+00:00"
Sunday, February 8th, 2026 at 3:50:32 PM GMT+00:00
model
"z-ai/glm-4.7-flash-20260119"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
179
moderation_latency
(null)
generation_time
87630
tokens_prompt
61
tokens_completion
3211
native_tokens_prompt
66
native_tokens_completion
3232
native_tokens_completion_images
(null)
native_tokens_reasoning
3130
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0012935600000128
router
(null)
provider_responses
0
id
"chatcmpl-REXg6z2zHSrWTtLnnz97PTj0"
status
200
is_byok
false
latency
53
endpoint_id
"6efb5994-8fcb-4300-9c64-ba393193f5f6"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-flash-20260119"
api_type
"completions"
id
"gen-1770565744-DvNAqjp35bCHedy4lnTp"
upstream_id
"chatcmpl-REXg6z2zHSrWTtLnnz97PTj0"
total_cost
0.0012935600000128
cache_discount
0.0000031999999872
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
315
0.0000%
Dialogue to Total Word Ratio
Ratio: 79.68%, Deviation: 29.68%
neededClean
false
wordsTotal
315
wordsDialogue
251
0.0000%