NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ053SGRYVVYC9TSMJ7XJW3E
Run Details
Model:
Z.AI GLM 5.1
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5m 8s
Cost:
$0.066220
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.72%
Dialogue to Total Word Ratio
(details)
99.8585%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crumpled birthday invitation across the kitchen table, weighing down the corners with a half-empty bottle of hot sauce and a ceramic frog. Bill leaned in, squinting at the crude map of the Pemberton backyard she had drawn in purple marker. The air was thick with tension and the faint smell of microwave burritos. Molly adjusted her black turtleneck, looking every bit the seasoned cat burglar, if cat burglars typically shopped at discount sporting goods stores. She tapped the paper with a mismatched chopstick, pointing at a cluster of poorly drawn trees representing the perimeter hedges. "This is the perimeter. Standard wooden fence, easily scalable, but watch out for the garden gnomes. They are surprisingly sharp." Bill scratched his chin, looking from the crayon-drawn fortress to the bright pink invitation featuring a cartoon dinosaur holding balloons. He adjusted his glasses, trying to decipher her tactical markings. The whole situation felt utterly ridiculous, but Molly never backed down from a score, no matter how bizarre. "Molly, it is a sixth birthday party. Are we sure we need tactical black ski masks for this?" "Kids are observant, Bill. Especially faces stealing their Lego Star Destroyers." He sighed heavily, pulling a balaclava from his back pocket and stretching it over his messy hair. The eyeholes did not align properly, making him look like a confused raccoon who had gotten into the trash again. He tugged at the fabric, but it stubbornly refused to cooperate, leaving him looking more comical than covert. "Fine. What about the bouncy castle?" "Primary obstacle. It blocks the direct route to the gift table. We will have to edge around the perimeter, staying low beneath the pony ride line." "You want me to crawl past a Shetland pony?" "Those things bite. The target package is heavily fortified. I will create a diversion at the pinata. Chaos erupts, and we grab the big ticket items." Bill raised an eyebrow, crossing his arms over his chest as he visualized the chaotic scene of screaming children and flying candy. It was a risky maneuver, relying entirely on the erratic swinging of a blindfolded six-year-old wielding a plastic baseball bat. "What about the clown?" Molly paused, her expression darkening considerably. The overhead kitchen light flickered ominously, casting long, jagged shadows across her grim, serious face. She gripped the wooden chopstick so tightly that it snapped cleanly in half. "We do not talk about the clown. If he makes a balloon animal, you run." Bill swallowed hard, adjusting his crooked mask one final time. He stared down at the brightly colored, obnoxiously cheerful invitation, contemplating the sheer madness of the entire operation. Stealing presents from a children's birthday party was a new low, even for a pair of seasoned thieves like them. But the potential payout was too tempting to ignore. They needed the money to pay off the dangerous loan sharks who were threatening their kneecaps. "Okay. Smoke the donkey, grab the Barbie, avoid the evil clown. I am totally in."
created_at
"2026-04-24T16:28:06.05Z"
4/24/2026, 4:28:06 PM
model
"z-ai/glm-5.1-20260406"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
854
moderation_latency
(null)
generation_time
307561
tokens_prompt
77
tokens_completion
15922
native_tokens_prompt
66
native_tokens_completion
18908
native_tokens_completion_images
(null)
native_tokens_reasoning
15127
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.06622026
router
(null)
provider_responses
0
endpoint_id
"e888c44c-6a07-4a9e-bc56-cb26511865ec"
id
"chatcmpl-RYhzwRxHOmtGXsuZKnDEEiOn"
is_byok
false
latency
94
model_permaslug
"z-ai/glm-5.1-20260406"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777048086-Qi86gDPzDmM2Wi6yt1B2"
session_id
(null)
api_type
"completions"
id
"gen-1777048086-WSMObULv2y4yGYvrrrE2"
upstream_id
"chatcmpl-RYhzwRxHOmtGXsuZKnDEEiOn"
total_cost
0.06622026
cache_discount
0.000027039
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
99.72%
Dialogue to Total Word Ratio
Ratio: 29.76%, Deviation: 0.24%
neededClean
false
wordsTotal
504
wordsDialogue
150
99.8585%