NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYXEKFJSFQFBGJTBVZCN99Y
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The surveillance van smelled of stale corn chips and the lingering, desperate odor of a diamond heist gone wrong in '98. Through the scratched lens of the binoculars, the suburban bungalow didn't look like a fortress; it looked like a rainbow had violently exploded over a manicured lawn. Neon streamers flapped in the wind, and a battalion of pink balloons was currently staging a hostile takeover of the mailbox. It was a target rich in potential loot, yet terrifyingly high in auditory risk. Molly adjusted the focus, zooming in on the stack of gifts piled high on the patio furniture like a glittering pyramid of consumerist promise. "Listen to me, Bill," Molly hissed, tapping the side of the lens. "This isn't just a birthday party. It’s a goldmine of wrapped plastic and unopened electronics. We go in, we grab the bags, we get out before the cake cutting starts. It is an in-and-out job." Bill shifted uncomfortably in the passenger seat, his eyes darting toward the front yard where a group of six-year-olds were currently beating a defenseless dinosaur-shaped effigy with a wiffle bat. He rubbed the back of his neck, trying to calculate the sheer volume of high-pitched screaming awaiting them. It wasn't the police he feared; it was the sheer, unadulterated sugar-fueled energy of fifty kindergarteners hyped up on red dye number forty. They were like tiny, sticky terminators. "I don't know, Molls. The perimeter is swarming," Bill whispered, wiping a bead of nervous sweat from his forehead. "Look at the guy by the grill. He’s an ex-Marine. I can tell by the way he’s handling those tongs. And the kids... they’re unpredictable. One sees us, and the whole operation is blown with tears." Molly scoffed, turning away from the window to glare at her partner. She pulled out a blueprint of the house, which was crudely drawn on the back of a Denny's placemat. She tapped a frantic finger on the rear entrance near the garage, marking the weak point. The parents were distracted by the magician currently pulling a rabbit out of a hat, leaving the pile of presents near the sliding glass door completely unguarded. It was the perfect crime, provided they could survive the noise. "We have a distraction," she insisted, pointing toward the driveway where a desperate mother was pacing. "You’re going in as the entertainment. I saw an ad on Craigslist; the original mascot cancelled. You put on the giant kangaroo suit, you lead them to the backyard for a 'hopscotch tournament,' and I sack the living room." Bill looked down at the furry, marsupial monstrosity at his feet. It smelled like expectations, mothballs, and old gym socks. He realized then that there was no dignity left in the life of crime, only the crushing weight of a kangaroo head and the promise of stolen Xbox games. He sighed heavily, reaching for the costume with all the enthusiasm of a man walking to the electric chair. "Fine," Bill grumbled, sticking his foot into the plush suit. "But if I get tea-bagged by a six-year-old, I’m taking the larger cut of the Pokémon cards."
created_at
"2026-02-08T15:23:41.388408+00:00"
Sunday, February 8th, 2026 at 3:23:41 PM GMT+00:00
model
"z-ai/glm-4.7-20251222"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
272
moderation_latency
(null)
generation_time
174077
tokens_prompt
61
tokens_completion
3088
native_tokens_prompt
66
native_tokens_completion
3104
native_tokens_completion_images
(null)
native_tokens_reasoning
2602
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00543792
router
(null)
provider_responses
0
id
"chatcmpl-Rlqeaz6CPdGplWWrf5l3AJ37"
status
200
is_byok
false
latency
83
endpoint_id
"ab79ffa8-8dcc-41c4-9a9f-6cc408135243"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-20251222"
api_type
"completions"
id
"gen-1770564046-HfMkwI4OzdoZ4kIrrl9w"
upstream_id
"chatcmpl-Rlqeaz6CPdGplWWrf5l3AJ37"
total_cost
0.00543792
cache_discount
0.00002048
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0098%
Matches word count
n/a
neededClean
false
words
531
85.7810%
Dialogue to Total Word Ratio
Ratio: 28.25%, Deviation: 1.75%
neededClean
false
wordsTotal
531
wordsDialogue
150
42.8954%