NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0425XKVWD7YW1J55QK495F
Run Details
Model:
Z.AI GLM 4.5 Air
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
21.6s
Cost:
$0.000693
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
13.5829%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly traced the swirling frosting pattern on a discarded cupcake, her eyes gleaming with manic energy. "Right, the big haul. Presents. All of them. We'll need grappling hooks, obviously. Maybe a few smoke bombs for dramatic effect. And definitely matching ski masks, very festive." Bill sighed, polishing a bobby pin with his sleeve. "Molly, it's a five-year-old's party. Balloons, juice boxes, and one very stressed-looking Spider-Man piƱata. Grappling hooks? We'll get tangled in the bunting." He peered through the van's dusty window at the suburban street. "Just grab the loot fast, avoid the toddlers, and for the love of Pete, don't trip over the inflatable castle." Molly bounced on the worn van seat. "But the entrance! Should we rappel from the oak tree? Or tunnel? Tunnels are always cool." She hummed a tuneless ditty, drumming her fingers on the dashboard. Bill pinched the bridge of his nose. "Tunnels? In Mrs. Henderson's prize-winning petunia patch? We'd be buried alive by irate gardeners. Simple is best, Molly. In quick, out quicker. Like ninja librarians." He adjusted his slightly-too-large ski mask. "Focus on the presents. And no singing." Molly grinned, already plotting angles. "Oh, Bill, the presents will be singing a sad song when we're done! Hee hee!" Bill just muttered about bad puns and the sheer, terrifying chaos of tiny humans.
created_at
"2026-04-24T16:09:44.637Z"
4/24/2026, 4:09:44 PM
model
"z-ai/glm-4.5-air"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2119
moderation_latency
(null)
generation_time
11380
tokens_prompt
77
tokens_completion
689
native_tokens_prompt
66
native_tokens_completion
619
native_tokens_completion_images
(null)
native_tokens_reasoning
340
native_tokens_cached
7
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00069291
router
(null)
provider_responses
0
endpoint_id
"f7dc79ca-bbbd-41a1-8eef-616d981ff1cb"
is_byok
false
latency
10077
model_permaslug
"z-ai/glm-4.5-air"
provider_name
"SiliconFlow"
status
504
1
endpoint_id
"c8760514-8a81-4a73-ae2c-be4942ba93f3"
id
"202604250009559bdcad822de64380"
is_byok
false
latency
2119
model_permaslug
"z-ai/glm-4.5-air"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777046984-n8KWyV6Ew8lQc5iJeSHQ"
session_id
(null)
api_type
"completions"
id
"gen-1777046984-aQp1Ar8V9p3ALD66OYuL"
upstream_id
"202604250009559bdcad822de64380"
total_cost
0.00069291
cache_discount
0.00000119
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
219
0.00%
Dialogue to Total Word Ratio
Ratio: 61.06%, Deviation: 51.06%
neededClean
false
wordsTotal
226
wordsDialogue
138
13.5829%