NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYYMR4YE7P2FTWSVMGQE95J
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprints were spread across the hood of the van, though they were less a technical schematic and more a crayon-scrawled menu from a pizza place. Molly tapped a greasy finger against a drawing of a pepperoni slice. "Entry point is the side gate near the hedge maze," Molly whispered, her eyes scanning the suburban street. "We wait for the magician to arrive. That’s our window." Bill shifted in the driver’s seat, adjusting his balaclava so it didn't squish his nose. "A magician? Seriously? I thought we were doing banks now. High stakes." "Banks have cameras, Bill. Six-year-olds have sugar rushes and zero attention spans," Molly snapped. "We go in low, we grab the haul, we get out. The payload is significant." "Payload? It’s a bunch of wrapped plastic." "It's not just plastic. I saw the registry. There’s a limited-edition cyber-droid in that pile. Retail value? Three hundred bucks. Plus the batteries." "We’re risking felony charges for a robot doll?" "Not just one. There’s twenty kids, Bill. That’s twenty gift bags. We’re talking a diversified portfolio of assets. Legos, board games, maybe a gaming console if the birthday boy’s an only child." "I just feel weird stealing from a kid named Timmy." "Timmy is a capitalist oppressor, Bill. He has everything. We’re just redistributing the wealth to the... financially challenged." She pointed a gloved hand at a colorful banner flapping in the wind. "See that?" "The 'Happy Birthday' banner?" "That’s the target. The presents are stacked right under the food table. It’s a classic honey trap." "How do we blend in? I didn't bring my 'cool uncle' costume." "You’re going in as the entertainment. I put in a call. The regular clown cancelled. Food poisoning." Bill’s eyes went wide behind the mask. "You want me to be a clown? I don't know how to make balloon animals." "You don't need to make them. You just need to distract them. Make a balloon poodle, throw it into the bouncy castle, and cause chaos. While the kids are diving for cover, I swipe the bags." "And what if the parents ask questions?" "Tell them you’re from the union. The Clown Union Local 402. They won't argue with bureaucracy." "This is the worst heist we’ve ever planned." "It’s the most efficient, Bill. Look at the security. It’s a golden retriever and a grandma in a lawn chair. Minimal risk, maximum reward." "I still don't like the shoes." Bill looked down at his oversized red sneakers. "I feel ridiculous." "Confidence, Bill. You’re a criminal mastermind disguised as a buffoon. Act natural." "I hate you." "Focus. The magician is late. You’re up in thirty seconds. Remember the signal." "Which signal?" "I honk the horn, you drop the balloons, and we run like hell." "Fine. But I’m keeping the cyber-droid." "We sell it, Bill. We sell it all." "Maybe I keep the Legos." "Just go make a poodle."
created_at
"2026-02-08T15:43:33.671411+00:00"
Sunday, February 8th, 2026 at 3:43:33 PM GMT+00:00
model
"z-ai/glm-4.7-20251222"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6023
moderation_latency
(null)
generation_time
116838
tokens_prompt
61
tokens_completion
2233
native_tokens_prompt
66
native_tokens_completion
2239
native_tokens_completion_images
(null)
native_tokens_reasoning
1741
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00392417
router
(null)
provider_responses
0
id
"chatcmpl-R4uI1ZbQ3VzKCvTpobJjm2Dm"
status
200
is_byok
false
latency
54
endpoint_id
"ab79ffa8-8dcc-41c4-9a9f-6cc408135243"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-20251222"
api_type
"completions"
id
"gen-1770565296-xNgLr75t2oDzmGD94qYQ"
upstream_id
"chatcmpl-R4uI1ZbQ3VzKCvTpobJjm2Dm"
total_cost
0.00392417
cache_discount
0.00002048
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
81.2727%
Matches word count
n/a
neededClean
false
words
488
0.2002%
Dialogue to Total Word Ratio
Ratio: 81.15%, Deviation: 11.15%
neededClean
false
wordsTotal
488
wordsDialogue
396
40.7365%