NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7 Flash
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYWZE3S8PF0CZ03Q2TJXG19
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The alleyway smelled of wet cardboard and unwashed children, a scent that usually made Molly’s stomach churn unless there was a new PlayStation or a solid gold watch inside involved. She stood with her back to the brickwork, shielding her eyes from the sodium streetlamps that hummed with a headache-inducing buzz. The house across the alley was drowning in decorations: enough streamers to clothe a clown, a banner that spelled "KELLY'S BIG NIGHT," and a piñata shaped like a mummy that swung lazily in the breeze, threatening to hit a pedestrian at any moment. "That’s the objective?" Bill asked, his voice muffled behind his massive aviator sunglasses. He looked like a raccoon caught in a Dumpster, wearing a raincoat that zipped up the wrong way. "That’s the battlefield," Molly corrected, peering through a cheap pair of binoculars. "And the loot is inside. Look at that wall of wrapped boxes in the dining room. It’s a logistical nightmare, but it’s also a monetary opportunity." Bill adjusted his glasses, peering at the house through a fogged-up lens. "I see parents. Three of them. One looks like she’s passed out in a lawn chair, a waffle cone in one hand and her shoe in the other. The other two are wrestling with a magician who appears to be made of scarves. We’ve got a window of about two hours before the sugar crash turns the living room into a war zone." "Two hours is plenty of time to secure the merchandise." "Okay, Miss Noir. What’s the entry point?" "The garage door. It was open when we passed by earlier. I picked the lock with a paperclip; it’s too easy, it’s almost insulting." Bill groaned, rubbing his temples. "You’re the boss. Just... don't touch the cake." "The cake is safe. It's in a kitchen that is currently occupied by an intoxicated magician." "Not if you jostle the pile," Bill warned, digging into his satchel. "The poop emoji frosting is bulletproof. We don't mess with the poop cake. It’s an ancient law." "Fine. We leave the cake. We grab the electronics. We steal a stack of cards for gas money. That’s the transaction." Molly grabbed the door handle of the battered sedan they were parked behind. "Okay, listen up. Rule number one: We are ghosts. Rule number two: If we hear a dog bark, we pretend we are part of the piñata party." "What if the dog barks and then we shoot a glance at each other?" "We die, Bill. We die quickly." Molly pushed the car door open with a screech of hinges. "Moving out."
created_at
"2026-02-08T15:13:28.201452+00:00"
Sunday, February 8th, 2026 at 3:13:28 PM GMT+00:00
model
"z-ai/glm-4.7-flash-20260119"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
268
moderation_latency
(null)
generation_time
58312
tokens_prompt
61
tokens_completion
3561
native_tokens_prompt
66
native_tokens_completion
3586
native_tokens_completion_images
(null)
native_tokens_reasoning
3224
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0014351600000128
router
(null)
provider_responses
0
id
"chatcmpl-RCR9fYGzFEwY7ALEqWmGcaV7"
status
200
is_byok
false
latency
109
endpoint_id
"6efb5994-8fcb-4300-9c64-ba393193f5f6"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-flash-20260119"
api_type
"completions"
id
"gen-1770563549-j3kFFJ93XWeKvOA6KLe1"
upstream_id
"chatcmpl-RCR9fYGzFEwY7ALEqWmGcaV7"
total_cost
0.0014351600000128
cache_discount
0.0000031999999872
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
433
0.0000%
Dialogue to Total Word Ratio
Ratio: 59.12%, Deviation: 29.12%
neededClean
false
wordsTotal
433
wordsDialogue
256
0.0000%