NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.6
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYWZR08RQFGX39XRY72QFHV
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a hand-drawn map of a suburban backyard. “Okay, run the plan one more time. From the top.” “Right. I’m the Distraction,” Bill said, puffing out his chest. “I approach the target area—the bouncy castle—and initiate Operation: Funny Dance. This draws the primary security force, the children, away from the prize table.” “Good. Then you create a secondary diversion.” “Ah, yes. The ‘I’ve fallen and I can’t get up’ maneuver. Classic. While they’re all pointing and laughing, you, the Infiltrator, make your move.” “Correct. I grab the presents. All of them.” “Even the one for little Timmy? The one that looked like a rocket ship?” “Especially the one that looked like a rocket ship, Bill. We’re not sentimental.” “Right. Cold, hard, plastic. Got it. Then we rendezvous at the extraction point.” “The minivan two blocks down.” “The Swagger Wagon,” Bill corrected. “It’s a Chrysler Town & Country, Bill. It does not have ‘swagger’. We load the loot and we disappear.” “What about the cake? A heist isn’t a heist without a celebratory cake.” Molly sighed, pinching the bridge of her nose. “The objective is presents, not pastries. We are not stealing cake from a seven-year-old’s birthday party.” “But it was a dinosaur. A volcano cake with a lava frosting explosion.” “Focus! We’re in, we’re out. Thirty seconds, max. Any questions?” “Just one. What’s the code word? For when I’ve successfully initiated Operation: Funny Dance?” Molly stared at him, her expression flat. “The code word is… ‘Boogers.’” “Boogers?” Bill frowned. “Isn’t that a bit unprofessional?” “Fine,” Molly snapped. “The code word is ‘Pterodactyl.’ Happy?” “Very,” Bill beamed. “Pterodactyl. I like it. It’s got a real sense of occasion.”
created_at
"2026-02-08T15:14:30.466545+00:00"
Sunday, February 8th, 2026 at 3:14:30 PM GMT+00:00
model
"z-ai/glm-4.6"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
367
moderation_latency
(null)
generation_time
110079
tokens_prompt
61
tokens_completion
2612
native_tokens_prompt
66
native_tokens_completion
2632
native_tokens_completion_images
(null)
native_tokens_reasoning
2194
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0045968599999776
router
(null)
provider_responses
0
id
"chatcmpl-RXn5V4wvfXdsTbeiXbB8gmio"
status
200
is_byok
false
latency
64
endpoint_id
"c8607edc-8577-4199-b03d-1213da4743dc"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.6"
api_type
"completions"
id
"gen-1770563559-aE3Vb1XSoQ4oPkHNenkR"
upstream_id
"chatcmpl-RXn5V4wvfXdsTbeiXbB8gmio"
total_cost
0.0045968599999776
cache_discount
0.0000112000000224
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
283
0.0002%
Dialogue to Total Word Ratio
Ratio: 86.22%, Deviation: 16.22%
neededClean
false
wordsTotal
283
wordsDialogue
244
0.0001%