NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7 Flash
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYYKSK8QFG3A4GF2AH6Q49E
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly held up a glossy invitation against the flickering ceiling fan. It looked innocent, folded in quarter inches like a fragile origami crime scene. She stared at it with the intensity of someone appraising a diamond necklace worth millions. "Right here," she said, tapping the paper with a chipped fingernail. "The Golden Birthday. Triplets. Two dads. The target is the domestic safety zone." "The zone?" Bill asked, staring out the window, his hand hovering over the latch. "The target is their bank account, Molly. We can't rob the furniture. The van doesn't fit under their deck." "We go through the hedge. Old French Provincial style. Classic OAP entry. But the loot, Bill. The loot is the holy grail. Look at the registry." "Why all of them? You only really need the big ones. The Lego set. The iPhone. Maybe the waffle iron if you're feeling fancy." "That’s petty. That’s something a kindergarten teacher does, not a master criminal. I want the Macaroni Artist Starter Kit, too. I want the molds. I want the glitter glue. If the kid turns out to be a vampire, do you want to be unprepared? I don't want him putting up defenses while we try to rob his house. It’s basic preparedness." "Okay, first of all, stop supervillaining. Second of all, how are we getting past the security? Two dads on the lawn. One is probably taking photos of the sunshine. The other is probably crying because he doesn't know what a 'jam session' is." "The play is the cake," Molly said, her eyes gleaming. "We disrupt the perimeter while the dads are distracted by the calorie count. I’ll set off the smoke alarm using the leftover smoke detector from the convenience store heist. They’ll panic. They’ll open the windows. You waltz in on that rented scooter you bought online." "The red one that squeaks?" "The red one. You slide under the table, you snatch the crates, and you back out through the window. Speed is key. You ride that scooter like you owe the universe money. You look desperate, like you’re racing to get to the finish line." "I look like a toddler in stretchy pants," Bill groaned, pulling a dark hoodie over his head. "And what about the presents? They are in the living room. The glass cabinet." "We smash it. With a bat. You’re the muscle, Bill. It’s what you do." "And then what? We strap thirty presents to a scooter?" "It’s aerodynamics, Bill. Think of the drag coefficient. Now zip it. The pizza delivery is five minutes out. Go."
created_at
"2026-02-08T15:41:57.177757+00:00"
Sunday, February 8th, 2026 at 3:41:57 PM GMT+00:00
model
"z-ai/glm-4.7-flash-20260119"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
179
moderation_latency
(null)
generation_time
51624
tokens_prompt
61
tokens_completion
3349
native_tokens_prompt
66
native_tokens_completion
3366
native_tokens_completion_images
(null)
native_tokens_reasoning
3018
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0013487600000064
router
(null)
provider_responses
0
id
"chatcmpl-R6bRU93QkgQ2bABlUUCHMxbL"
status
200
is_byok
false
latency
53
endpoint_id
"6efb5994-8fcb-4300-9c64-ba393193f5f6"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-flash-20260119"
api_type
"completions"
id
"gen-1770565265-dUIaejxbFYifPEUKGeli"
upstream_id
"chatcmpl-R6bRU93QkgQ2bABlUUCHMxbL"
total_cost
0.0013487600000064
cache_discount
0.0000015999999936
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
428
0.0000%
Dialogue to Total Word Ratio
Ratio: 82.71%, Deviation: 32.71%
neededClean
false
wordsTotal
428
wordsDialogue
354
0.0000%