NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ02N660WK3RBH50VRZENWZX
Run Details
Model:
Z.AI GLM 4.5 Air
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
37.1s
Cost:
$0.000857
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.08%
Dialogue to Total Word Ratio
(details)
0.0385%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly wiped condensation off the truck's windshield, peering at the brightly painted community hall. "Alright, Bill. Recon complete. The party’s inside, loud enough to drown out a chainsaw. Parents mostly glued to the snacks table. Target window: thirty minutes when the piñata happens. Distraction central." Bill fiddled with a stolen party favor shaped like a chicken. "Thirty minutes? That’s tight. What if Junior pinatas the piñata early? Or spills punch everywhere? Or just starts screaming for no reason? Kids are unpredictable!" "Exactly why we’re not waiting for cake. Now, the presents. Stacked near the giant inflatable slide. I’ll cut through the back window, grab the loot, you drive around front for the handoff. Got it?" Bill nodded vigorously, chicken bobbing. "Got it. Slide presents. Got it. Cut window, grab loot, handoff. Easy-peasy. Except…" "Except what, Bill?" Bill leaned closer, voice dropping dramatically. "The presents. Are they wrapped? If they’re wrapped, how do we know which ones are valuable? Aunt Mildred’s terrible sweater versus Great Uncle Bob’s rare baseball card? We can’t just grab everything. That’s inefficient." Molly pinched the bridge of her nose. "Bill, we’re stealing *all* of them. Efficiency isn’t the point. The point is getting out before Little Timmy notices his bike-shaped box is gone." Bill looked horrified. "All of them? Even the handmade potholders? That seems… rude. And bulky. The truck’s cramped enough with the air mattress." "Focus, Bill! We’ll deal with sorting later. Dump them in the truck, we’ll cull the chaff later. Now, the handoff. You be ready. Engine running. Don’t wander off to pet a stray cat again!" Bill puffed out his chest. "I learned my lesson about strays. This time, I’ll pet the inflatable slide. It’s friendly! And big!" "Bill, no petting potential evidence. Drive. Quietly. No revving the engine like you’re at the Indy 500." "Right. Quiet. Got it. Slide presents. Cut window. Grab loot. Handoff. No strays, no slides. Got it." Molly sighed, checking her watch. "Good. And Bill?" "Yeah?" "If you see a kid drop a lollipop, you look the other way. We are not collecting candy." Bill sighed dramatically too. "Fine. No candy. Just slide presents. Cut window. Grab loot. Handoff. Got it." He paused, then brightened. "What if we wear hats? Disguises?" Molly stared at him. "Bill, we’re in a stolen panel van wearing dark clothes. We’re not James Bond." "Point taken. Just chicken hats?" Molly slammed the van door. "Drive."
created_at
"2026-04-24T15:45:10.345Z"
4/24/2026, 3:45:10 PM
model
"z-ai/glm-4.5-air"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1219
moderation_latency
(null)
generation_time
36902
tokens_prompt
77
tokens_completion
1142
native_tokens_prompt
66
native_tokens_completion
986
native_tokens_completion_images
(null)
native_tokens_reasoning
380
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0008572
router
(null)
provider_responses
0
endpoint_id
"90fc9bb3-01bb-419f-bb58-dea60a80d201"
is_byok
false
latency
93
model_permaslug
"z-ai/glm-4.5-air"
provider_name
"Novita"
status
429
1
endpoint_id
"f7dc79ca-bbbd-41a1-8eef-616d981ff1cb"
id
"019dc02aa034c608083ec70aaf9820f2"
is_byok
false
latency
1219
model_permaslug
"z-ai/glm-4.5-air"
provider_name
"SiliconFlow"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777045510-d721SrYMZKWQutCzS1x1"
session_id
(null)
api_type
"completions"
id
"gen-1777045510-59pG337OkwZb7a1u1paC"
upstream_id
"019dc02aa034c608083ec70aaf9820f2"
total_cost
0.0008572
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
403
0.08%
Dialogue to Total Word Ratio
Ratio: 81.98%, Deviation: 11.98%
neededClean
false
wordsTotal
405
wordsDialogue
332
0.0385%