NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ04422ZG9RSBXR8Y5A1VASY
Run Details
Model:
Z.AI GLM 4.5 Air
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
7.2s
Cost:
$0.000526
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprints for the ‘Whimsy Woods’ birthday bash were spread across the hood of Bill’s beat-up sedan, held down by a half-empty coffee mug shaped like a grinning dinosaur. Molly tapped a perfectly manicured finger on the diagram of the living room. “Right, so the presents are here, the kids are there, and the target, young Alistair, is currently attempting to blow out the candles on a cake the size of a small car,” she said, her voice a low hum of concentration. “The real question, Bill, is the Great Wall of Candy.” Bill peered over the rim of his glasses. “The what?” “The candy table,” Molly hissed, pointing. “It’s a formidable perimeter defense. Brightly colored, sticky, and strategically placed to slow down any interloper. We need a diversion.” Bill scoffed, leaning back against the car door with a groan of protest. “Molly, love, this is a kids’ party. Not Fort Knox. We could probably just walk in, grab a few nicely wrapped boxes, and be gone before the singalong to ‘Baby Shark’ finishes.” “We are not being sloppy, Bill,” she insisted, drawing a small, sleek tool from her pocket. “A professional leaves no trace. And no, we are not using your ‘I’m a magician’ routine again. Last time you tried to saw a child in half and nearly started a riot.” “It was a quartering! It’s a classic!” “It was a disaster,” Molly countered flatly. “Our diversion will be elegant. It will involve chaos, but the *right* kind of chaos. You will create a disturbance on the far side of the room.” “And how am I supposed to do that?” Bill asked, a sly grin spreading across his face. “I’ve got just the thing.” He reached into his other pocket and pulled out a rubber chicken. “This little fella has caused more chaos in boardrooms than a dozen stock market crashes. One good squeeze, a few well-timed squawks, and the parents will be too busy looking for the source of the noise to notice us quietly liberating the Lego Death Star.” Molly stared at the chicken, then at Bill, a flicker of something that might have been admiration in her eyes. “You’re an absolute menace, you know that?” He gave a theatrical bow. “I prefer to think of myself as an artist. Now, are we in agreement? The chicken creates the chaos, you and I slip in like shadows, and we’re out of there before anyone can even say ‘unauthorized entry’.” Molly rolled up the blueprints with a decisive snap. “Fine. But if you get caught, I am not paying your bail. Again.” “It’s a contract,” Bill said, already slipping the rubber chicken up his sleeve. “Now, let’s go make some children cry.”
created_at
"2026-04-24T16:10:46.249Z"
4/24/2026, 4:10:46 PM
model
"z-ai/glm-4.5-air"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1125
moderation_latency
(null)
generation_time
6460
tokens_prompt
77
tokens_completion
703
native_tokens_prompt
70
native_tokens_completion
600
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0005258
router
(null)
provider_responses
0
endpoint_id
"90fc9bb3-01bb-419f-bb58-dea60a80d201"
is_byok
false
latency
425
model_permaslug
"z-ai/glm-4.5-air"
provider_name
"Novita"
status
429
1
endpoint_id
"f7dc79ca-bbbd-41a1-8eef-616d981ff1cb"
id
"019dc0420d827842b11f183d9b71c028"
is_byok
false
latency
1125
model_permaslug
"z-ai/glm-4.5-air"
provider_name
"SiliconFlow"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777047046-sveXtfa9GLdDYyRwWtRn"
session_id
(null)
api_type
"completions"
id
"gen-1777047046-QfSXS0rXvDth3Na51rIR"
upstream_id
"019dc0420d827842b11f183d9b71c028"
total_cost
0.0005258
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
454
0.00%
Dialogue to Total Word Ratio
Ratio: 71.99%, Deviation: 41.99%
neededClean
false
wordsTotal
457
wordsDialogue
329
0.0000%