NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.6
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYX4PS0BR3QG5W19TPEM7V2
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly stared at the crayon-drawn map of the Johnson’s backyard, the words “PINATA STAGING AREA” circled in a wobbly red line. She pinched the bridge of her nose, a familiar headache forming behind her eyes. “Okay, let’s run it one more time. The diversion is key. Bill, are you listening?” “Totally,” Bill said, not looking up from the bright red clown nose he was trying to affix to his balaclava. “But I feel like this sends the wrong message. We’re thieves, not children’s entertainers.” “That’s the point, you genius! Misdirection. While you’re ‘entertaining’ the kids with your… whatever that is, I’ll be slipping around the side of the house to the present table.” “What about the bouncy castle?” Bill asked, his voice muffled by the latex. “The pirate ship one. A quick bounce is crucial for morale. High morale leads to a cleaner getaway.” “There will be no bouncing!” Molly hissed, snatching the map. “The goal is presents. Not cake, not bouncy castles, not the sad-looking magician. Presents.” “But what if one of the presents is for us? Like, karmically? A new set of lockpicks, maybe?” “Bill, we are stealing a seven-year-old’s LEGO spaceship and a Make-Your-Own-Slime kit. I doubt the universe has hidden a set of professional-grade tools in a box marked ‘To Timmy.’” “I don’t know,” Bill mused, holding up a grappling hook. “Seems like overkill for a Barbie Dreamhouse. Can’t we just use the doggy door? I saw a golden retriever. He seemed friendly.” “We are not befriending the canine security system. We go in hard, we go in fast. I create a smoke screen by the grill, you grab the sacks, and we rendezvous at the van in ninety seconds. Ninety. Seconds.” “Right, right, the sacks.” He held up two brightly colored gift bags with cartoon unicorns on them. “I figured these would be less conspicuous than burlap.” Molly just stared, her mouth agape. “Where did you even get those?” “Party City. They were having a sale. But, Molly, I have one more question.” “What?” “Are we taking the party favor bags, too? Because I saw they have little whistles in them.”
created_at
"2026-02-08T15:16:44.25571+00:00"
Sunday, February 8th, 2026 at 3:16:44 PM GMT+00:00
model
"z-ai/glm-4.6"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
295
moderation_latency
(null)
generation_time
81832
tokens_prompt
61
tokens_completion
1836
native_tokens_prompt
66
native_tokens_completion
1846
native_tokens_completion_images
(null)
native_tokens_reasoning
1381
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0032180199999552
router
(null)
provider_responses
0
id
"chatcmpl-RWy6IuaFuHpP524vmE0qG3tl"
status
200
is_byok
false
latency
65
endpoint_id
"c8607edc-8577-4199-b03d-1213da4743dc"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.6"
api_type
"completions"
id
"gen-1770563722-IoJQlIfCETY0QWGRVKf2"
upstream_id
"chatcmpl-RWy6IuaFuHpP524vmE0qG3tl"
total_cost
0.0032180199999552
cache_discount
0.0000224000000448
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
364
28.6505%
Dialogue to Total Word Ratio
Ratio: 75.00%, Deviation: 5.00%
neededClean
false
wordsTotal
364
wordsDialogue
273
14.3252%