NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ088KNN8NC9YHQMT0DSTEE9
Run Details
Model:
Z.AI GLM 5.1
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 43s
Cost:
$0.026607
Provider:
Z.AI
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
85.72%
Dialogue to Total Word Ratio
(details)
91.6719%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her earpiece, glaring at Bill across the cluttered van. He was currently strapping a heavy-duty tactical vest over a garishly colored Hawaiian shirt, a bizarre combination that made him look like a mercenary on a tropical vacation. "Are you seriously wearing a ski mask to a six-year-old's birthday party?" "It's a tactical requirement, Molly. You can never be too careful in the field." "It's eighty degrees out, Bill. You'll melt before we even hit the bouncy castle." "The bouncy castle is the perimeter breach. I've studied the schematics extensively." She snatched the crayon drawing from his hands, squinting at the crude, waxy markings on the flimsy paper. "This is a napkin. And you drew a sad face on the clown." "He's a threat. Did you see the size of his shoes? He's definitely packing." "He's making balloon animals, Bill. A poodle, specifically." "That's what he wants you to think. While you're distracted by the poodle, boom, he hits you with the flower squirt." "I'm more worried about the mom. She's got eyes like a hawk and a spreadsheet for a seating chart." Bill pulled a large, brightly painted papier-mâché donkey from under the dented van seat, holding it up like a holy relic. "Which is exactly why I brought the decoy." "A piñata? Where did you even get a donkey-shaped piñata on such short notice?" "Henderson kid's party last week. We toss this into the crowd, they swarm it like rabid piranhas, and we grab the presents." "We are not stealing a Barbie Dreamhouse. We're grabbing the gift cards and the cash. Small, untraceable, high value." "I already told you, I'm taking the Dreamhouse. It's got a working elevator, Molly. A tiny, working elevator!" "You're a forty-year-old man!" "A forty-year-old man who appreciates functional miniaturized architecture! I'll grab the Dreamhouse, you hit the card box. We rendezvous back at the bouncy castle." "The bouncy castle that's currently occupied by fifteen hyped-up first-graders?" "We'll have to neutralize them." Molly grabbed her stun gun from the dashboard, her finger hovering over the trigger, but Bill quickly waved a greasy breadstick in her face, blocking her aim. "Not violently! We drop the breadsticks, they carb-load, we slip away in the chaos." "You're going to give a dozen sugar-crazed six-year-olds unlimited garlic breadsticks?" "It's a tactical distraction!" "It's a bowel obstruction! You'll start a riot!" "A localized, carbohydrate-fueled riot that covers our escape route perfectly!" She rubbed her temples, staring at the sprawling suburban backyard through a pair of high-powered binoculars. Kids were already shrieking on the lawn. "I can't believe I agreed to this. The blueprints said nothing about hostile clowns and stale breadsticks." "Trust me. By the time the clown makes his next poodle, the Dreamhouse is mine, and you'll be rich in Amazon credit." "If I get hit by a water squirt flower, I'm leaving you in the bouncy castle." Bill pulled the dark wool ski mask over his face, already sweating profusely in the afternoon heat. "Put on your party hat, Molly. We're going in."
created_at
"2026-04-24T17:23:09.631Z"
4/24/2026, 5:23:09 PM
model
"z-ai/glm-5.1-20260406"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3510
moderation_latency
(null)
generation_time
163389
tokens_prompt
77
tokens_completion
6300
native_tokens_prompt
66
native_tokens_completion
6026
native_tokens_completion_images
(null)
native_tokens_reasoning
5311
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0266068
router
(null)
provider_responses
0
endpoint_id
"4ce9a513-17dc-4130-8960-2c4ceed3ea56"
id
"20260425012309b18661a2d4d44f92"
is_byok
false
latency
3510
model_permaslug
"z-ai/glm-5.1-20260406"
provider_name
"Z.AI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777051389-LoYu5eQaT92sRhfLKdhq"
session_id
(null)
api_type
"completions"
id
"gen-1777051389-KzWchIuHlVz8MnPrszew"
upstream_id
"20260425012309b18661a2d4d44f92"
total_cost
0.0266068
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Z.AI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
507
85.72%
Dialogue to Total Word Ratio
Ratio: 71.76%, Deviation: 1.76%
neededClean
false
wordsTotal
524
wordsDialogue
376
91.6719%