NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYXCCR23J8Q9S1WQD99BEDV
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the wire in her ear, keeping her eyes locked on the colorful banner flapping in the wind. She signaled Bill to stay low behind the rhododendron. "Stop looking at the bouncy castle, Bill. Focus on the objective." "I can't help it, Mol. It looks structurally unsound," Bill whispered, crouching lower. "Are we sure this is a six-year-old’s party? The security is tight. I counted two moms with walkie-talkies and a guy dressed as Spider-Man who looks like he actually works out." "That’s not Spider-Man, that’s just a dad in a costume, and he’s not a threat if he’s trying to tie his own shoes," Molly snapped, checking her watch. "We have a strictly defined two-minute window. The magician is about to pull the rabbit out of the hat. That’s when the parents reach for their smartphones. Their visual field drops by ninety percent." "And that’s when I move?" "No, that’s when I move. You’re on crowd control. If any kid gets between me and the gift table, you deploy the diversion." "I didn't practice the diversion, Molly. I practiced picking locks," Bill argued, fidgeting with a set of tension wrenches. "Besides, look at the loot. It’s massive. There’s a box shaped like a spaceship. It has to be at least three feet wide." "We leave the spaceship. It’s a decoy. We want the small, dense boxes. Legos, video games, anything that looks like it contains expensive electronics," Molly explained, pointing a gloved finger toward the pile of wrapped presents. "We hit the table, we fill the duffel, we vanish like ghosts before the cake comes out. Cake time is chaos. We don't do chaos." "What about the ice cream? If they’re eating ice cream, they’re distracted by the brain freeze." "Ice cream is a variable, Bill. Variables get you caught. We stick to the timeline. Eye on the prize. We are professionals." "Do we at least get to keep the piñata?" "Bill, you’re a master thief. You don’t want a papier-mâché donkey filled with cheap candy." "I like the Tootsie Rolls. They hold up well in the van," he muttered, adjusting his ski mask so it looked like a jaunty beret. "Okay, I’m in position. The magician is doing the card trick. The crowd is gasping. He’s reaching into the hat." "Wait for the rabbit," Molly hissed, tightening her grip on the gym bag. "Remember, act natural. If anyone asks, you’re Uncle Gary and you’re looking for the bathroom." "Roger that. Hey, is that a pony?" "What? Where?" "Over by the fence. They hired a pony. That changes the escape route." "Forget the pony! We are not stealing the pony. We are stealing the presents!" "It would fit in the van if we folded down the seats." "We are not stealing a child's pony! Focus!" "Okay, okay. Rabbit is out. The crowd is cheering. The dad-Spider-Man is taking a selfie. Go, go, go!" "Cover me." "I’ve got your back. Just watch out for the kid with the sticky face. He looks like he’s watching us. He has the eyes of a hawk." "I see him. I’m moving in. Don't let the pony block the exit."
created_at
"2026-02-08T15:23:10.926499+00:00"
Sunday, February 8th, 2026 at 3:23:10 PM GMT+00:00
model
"z-ai/glm-4.7-20251222"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2457
moderation_latency
(null)
generation_time
215955
tokens_prompt
61
tokens_completion
3091
native_tokens_prompt
66
native_tokens_completion
3097
native_tokens_completion_images
(null)
native_tokens_reasoning
2558
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00543591
router
(null)
provider_responses
0
id
"chatcmpl-RcvI9yyzuFfj3KFrdLinuzSh"
status
200
is_byok
false
latency
75
endpoint_id
"ab79ffa8-8dcc-41c4-9a9f-6cc408135243"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-20251222"
api_type
"completions"
id
"gen-1770563973-VFyMoEfyBgEro3nwv9ua"
upstream_id
"chatcmpl-RcvI9yyzuFfj3KFrdLinuzSh"
total_cost
0.00543591
cache_discount
0.00001024
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
536
0.0012%
Dialogue to Total Word Ratio
Ratio: 85.07%, Deviation: 15.07%
neededClean
false
wordsTotal
536
wordsDialogue
456
0.0006%