NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7 Flash
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGZ03P7Y9JVRNFXB417H7WVK
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Molly, look at that." "What is it now, Bill?" "That inflatable flamingo. It’s looking right at us." "It’s a lawn ornament, Bill. It has no eyes." "I know, but it knows. It knows we’re up to something. It knows we’re going to steal presents from a six-year-old named Kevin." "We are not stealing from Kevin. We are liberating those presents. Think of it as gift recycling." "Recycling? It’s petty theft." "It’s capitalism in reverse, baby. Now, get your bag. The noise cancelling headphones are vibrating in your pocket, you’re just too nervous to feel them." "I’m nervous because I’m holding a bag of canned air. What am I supposed to do with that if they catch us? Blow them up?" "You wingman, Bill. You provide the distraction. You trip over the sprinkler system. You trip over the garden hose. You trip over absolutely everything." "I can’t trip over the sprinkler system, that makes a lot of noise." "Then hold onto it. It’s part of the tactical loadout. Come on, the bat signal isn’t going to beep itself." "The bat signal is a sky projector." "It’s a signal from the Bat-Man. Now, up the trellis. You first." "I feel like I’m participating in a crime against nursery rhymes. It feels wrong. I could be at home watching soap operas." "You’ll be watching soap operas in a much nicer house, provided we get that PS4. Did you research the target?" "I checked the gift registry online. It’s going to be a massive haul. Lego set, a puppy, maybe a wagon." "A wagon is a weapon of mass destruction in the wrong hands." "That’s what I keep telling myself. Is the rear exit unlocked?" "It’s locked, but the lock is covered in toothpaste. It’s the work of a genius, really." "Toothpaste?" "It’s slippery, but you can usually jimmy it with a credit card if you have the right technique. Which I do." "You have three credit cards and they’re all maxed out." "They’ll be less maxed out after tonight. Okay, listen up. We hit the back door. You handle the gnome. I’ll get the loot. We meet at the van in exactly four minutes." "What if Kevin tries to hug me?" "He’s a baby. He can’t tie a shoelace, he can’t choke you." "He has sharp teeth." "So does the thing I’m stuffing under my coat. Now, move." "God, I hate children." "That’s why they’re the perfect mark. They are adorable and defenseless. Like pigeons in a park, but with more noise." "Fine. I’m going in." "Go get 'em, tiger. Go get 'em." "Smile for the bunny."
created_at
"2026-02-08T16:07:57.293522+00:00"
Sunday, February 8th, 2026 at 4:07:57 PM GMT+00:00
model
"z-ai/glm-4.7-flash-20260119"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
186
moderation_latency
(null)
generation_time
41954
tokens_prompt
61
tokens_completion
1988
native_tokens_prompt
66
native_tokens_completion
1996
native_tokens_completion_images
(null)
native_tokens_reasoning
1462
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0008007600000064
router
(null)
provider_responses
0
id
"chatcmpl-R4ymi5r35vBtOeJJVa9au8SX"
status
200
is_byok
false
latency
54
endpoint_id
"6efb5994-8fcb-4300-9c64-ba393193f5f6"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-flash-20260119"
api_type
"completions"
id
"gen-1770566834-MR7HcGA7SVyIszf74VPZ"
upstream_id
"chatcmpl-R4ymi5r35vBtOeJJVa9au8SX"
total_cost
0.0008007600000064
cache_discount
0.0000015999999936
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
437
0.0000%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 30.00%
neededClean
false
wordsTotal
437
wordsDialogue
437
0.0000%