NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7 Flash
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYZF254AT4PAA7RSHS93J8J
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The surveillance van sat like a metal hermit crab in the shade of the ancient oak tree, engine idling with a nervous, guttural shudder. Molly sat in the driver’s seat, the high-tech tablet on her lap lighting up her face like a surgeon’s operating table. She wasn’t looking at city maps or blueprints; she was watching a live HD feed of a toddler aggressively pedaling a tricycle. Her eyes were sharp, dissecting the perimeter for any gaps in the security net provided by over-enthusiastic suburban parents. "Are you sure this is the prime location?" Bill asked, slumped low in the passenger seat. He was clutching a jagged piece of PVC pipe like a sacred religious relic. "I’m seeing a lot of variables. The swing set is a death trap. High trajectory. Risk of serious impact." "It’s plastic, Bill," Molly snapped, keeping her voice a harsh, covert whisper. "That kid is three. He thinks spinning around really fast counts as parkour. We have a strictly limited thirty-second window before the parents start passing out wine coolers and the security guard gets lazy." "I brought the grappling hook," Bill said, grinning with yellowed teeth. "And the glitter bomb. The instructions said it would cause temporary blindness." "We’re not causing blindness, we’re causing confusion. We’re professionals. You just want to blow things up." She swiped a finger across the screen, muting the audio feed. "Okay, listen. The target is the picnic blanket on the lawn. The loot is in three specific boxes: the Lego set, the Nerf gun, and the crates of fruit snacks. That is all." "What about the unicorn piñata?" Bill asked, gesturing wildly with the PVC pipe. "That seems like the tactical center. If we take that, we take the psychological victory." "The unicorn is too buoyant, Bill. It floats on air. We can't catch it. We need something dense. Something with heavy cubic weight." Molly checked her watch, then looked at her partner. "You ready?" "I'm ready to vanish," Bill said, zipping his jacket up to his chin. "But what if the baby cries? I panic around babies. They smell like milk and fear. It’s corrosive." "They smell like desperation. That’s what we feed on. It’s our fuel." She gripped the steering wheel, knuckles white. "Go. Hit the lawn, grab the snacks, and get back here before the Jehovah’s Witnesses walk past the van." "You really don't like them, do you? What did they ever do to you?" "They have a sales quota, Bill. It’s corporate espionage. Now, execute." Bill shoved the door open and sprinted into the blinding sun, PVC pipe leading the charge. "Party time!"
created_at
"2026-02-08T15:56:31.923131+00:00"
Sunday, February 8th, 2026 at 3:56:31 PM GMT+00:00
model
"z-ai/glm-4.7-flash-20260119"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
558
moderation_latency
(null)
generation_time
32260
tokens_prompt
61
tokens_completion
3788
native_tokens_prompt
66
native_tokens_completion
3804
native_tokens_completion_images
(null)
native_tokens_reasoning
3535
native_tokens_cached
22
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0015249
router
(null)
provider_responses
0
id
"08daa91c70008678d308fab465e81ce0"
status
200
is_byok
false
latency
558
endpoint_id
"6999081f-9017-4517-bbdb-47df8778c940"
provider_name
"Novita"
model_permaslug
"z-ai/glm-4.7-flash-20260119"
api_type
"completions"
id
"gen-1770566158-g4mRfGUJvjYiLLx4MO5Z"
upstream_id
"08daa91c70008678d308fab465e81ce0"
total_cost
0.0015249
cache_discount
0.00000132
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
441
3.9163%
Dialogue to Total Word Ratio
Ratio: 58.05%, Deviation: 8.05%
neededClean
false
wordsTotal
441
wordsDialogue
256
1.9581%