NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ05H57G17ZWF7SJ358F5AAT
Run Details
Model:
Z.AI GLM 5.1
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
3m 38s
Cost:
$0.034490
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
88.28%
Dialogue to Total Word Ratio
(details)
45.9518%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the binoculars, the cheap plastic rim digging painfully into her eye sockets as she scanned the chaotic landscape beyond the chain-link fence. The backyard of 42 Maple Street looked like a warzone orchestrated by a sugar-crazed general. An inflatable bounce house shaped like a fire-breathing dragon dominated the lawn, shaking violently as a dozen seven-year-olds hurled themselves against its translucent walls. At the far end of the yard, beneath a brightly colored banner reading "Happy 7th Birthday, Timmy!", sat the motherload. It was a flimsy folding table groaning under the weight of meticulously wrapped boxes, bulging gift bags, and assorted plastic treasures. To the untrained eye, it was a picturesque scene of innocent joy. To Molly, it was a vault waiting to be cracked, and she had the perfect crew for the job, even if her partner was currently scratching his chin and looking entirely too nervous for a professional. "There it is. You see the perimeter?" Bill squinted through the fence slats, his face scrunched in intense concentration as he assessed the security situation. The afternoon sun glinted off the sprinklers, creating a hazardous blinding effect. "I see it. They got a clown, Mol." "Clowns are just men in makeup. Focus. We go in through the side gate when the cake is served. The target will be blinded by the candles." "What if Timmy makes a wish for us to disappear?" "He's seven. He'll wish for a dinosaur. Once we breach, you deploy the smoke bomb." Bill adjusted his disguise, his five o'clock shadow clashing terribly with the neon pink fairy wings strapped to his broad back. He looked less like a stealthy infiltrator and more like a bouncer who had lost a terrible bet on his way to a rave. The sheer absurdity of the outfit was meant to be ironic, but mostly it just looked sad. "Are we sure about the smoke bomb? The wind could shift." "It's tactical. You grab the presents, I'll engage the clown." "Hand-to-hand with a clown? That guy has size sixteen shoes." "Just shove the presents into the bag and head for the minivan. Don't stop for cake or juice boxes." Bill pointed a thick finger toward the dragon. "What about the bounce house? We could bounce over the fence." "That is the stupidest idea you've ever had." "It's an inflatable extraction system!" Molly snatched the binoculars back, shaking her head in disbelief. The sheer audacity of the heist was enough without her partner suggesting they literally bounce away from the scene. She took a deep breath, centering herself for the impending chaos. The screams of the children grew louder, a deafening chorus of sugar-induced hysteria that would perfectly mask their approach. This was their moment, their ticket to the big leagues of petty larceny. She checked her watch, watching the second hand tick down toward their inevitable destiny. "We are professionals. Check your earpiece." Bill tapped the plastic comm dot hidden under his ridiculously large adhesive mustache. He straightened his wings, taking the whole operation far too seriously. "Loud and clear. Ready to ruin a kid's day." "Good. Initiate the sad puppy distraction."
created_at
"2026-04-24T16:35:24.025Z"
4/24/2026, 4:35:24 PM
model
"z-ai/glm-5.1-20260406"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1685
moderation_latency
(null)
generation_time
217506
tokens_prompt
77
tokens_completion
9430
native_tokens_prompt
66
native_tokens_completion
9850
native_tokens_completion_images
(null)
native_tokens_reasoning
8606
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.03449022
router
(null)
provider_responses
0
endpoint_id
"e888c44c-6a07-4a9e-bc56-cb26511865ec"
id
"chatcmpl-RHxzU040ti9ZcWSVkuKlep6K"
is_byok
false
latency
83
model_permaslug
"z-ai/glm-5.1-20260406"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777048524-kNuwYwV7dIDeHlWpu9gf"
session_id
(null)
api_type
"completions"
id
"gen-1777048524-iqR7nYi52YwhLVPKKuaZ"
upstream_id
"chatcmpl-RHxzU040ti9ZcWSVkuKlep6K"
total_cost
0.03449022
cache_discount
0.000054079
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
524
88.28%
Dialogue to Total Word Ratio
Ratio: 31.58%, Deviation: 1.58%
neededClean
false
wordsTotal
532
wordsDialogue
168
45.9518%