NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ02TEQZEG2NDDA4ZT4W9RH4
Run Details
Model:
Z.AI GLM 4.5 Air
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.0s
Cost:
$0.000912
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The sun beat down on the brightly colored plastic bouncy castle, a cheerful cacophony of shrieks and pop music emanating from the suburban backyard. Molly squinted through a pair of oversized sunglasses, her finger tracing the blueprint of the house tacked onto a clipboard balanced on her knee. "Alright, Bill," she murmured, "intel confirms the loot is concentrated in the back conservatory. High-value targets, minimal foot traffic once the piñata is cracked." Bill, currently attempting to balance a half-eaten chocolate éclair on his nose, nearly toppled over. "Minimal? Molly, those ankle-biters sound like a swarm of angry bees! What if one spots us? They’ve got eyes like hawks. And sticky fingers. Worse than sticky fingers." "They'll be distracted," Molly said, shooing the éclair towards her own mouth. "By the magician. Who, incidentally, is terrible. Saw him try to make a dove vanish yesterday. Only managed to make a child cry." She licked chocolate off her thumb. "So, approach: we shimmy down the trellis near the garage – you remember, the wobbly one?" Bill winced, patting his stomach protectively. "The one that groans like an old man with indigestion? Not my favorite." "Compromise," Molly declared, pulling a pair of suction-cup grappling hooks from her oversized tote bag. "We use these. Quiet, efficient. Over the garage roof, drop down onto the conservatory flat roof. Low risk, high reward." Bill eyed the hooks dubiously. "What if the suction fails? I’m not exactly built for graceful landings. I resemble a sack of potatoes dropping off a truck." "Then practice landing like a sack of potatoes, Bill," Molly sighed, already peering through the gap in the hedge. "Focus. The presents are wrapped in shiny paper. Easy to spot, easy to grab. We fill the duffels, ghost back out, and vanish before the birthday kid blows out the candles. Simple." "Simple," Bill echoed, looking distinctly unconvinced. "Except for the part where we’re stealing from a kids’ party. Stealing joy, Molly. Is that on our conscience?" Molly paused, a rare flicker of something almost resembling guilt crossing her face. "Only the expensive, imported joy. The plastic tat can stay. Consider it... redistribution. We’re giving those presents a better home. A quieter home. Where they won’t be covered in cake crumbs and indiscriminate slobber." She adjusted her sunglasses, peering determinedly at the oblivious children chasing bubbles. "Alright, time to earn our keep. Bill, spotter duty. Keep an eye out for Mrs. Henderson – she has a stare that could curdle milk." Bill puffed out his chest slightly. "Got it. No milk curdling today, Molly. Operation: Pinata Purge is a go!" He fumbled with the suction cups. "Though maybe we should reconsider the trellis... it *does* groan so dreadfully..."
created_at
"2026-04-24T15:48:02.963Z"
4/24/2026, 3:48:02 PM
model
"z-ai/glm-4.5-air"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
468
moderation_latency
(null)
generation_time
14718
tokens_prompt
77
tokens_completion
1236
native_tokens_prompt
66
native_tokens_completion
1064
native_tokens_completion_images
(null)
native_tokens_reasoning
513
native_tokens_cached
7
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000912245
router
(null)
provider_responses
0
endpoint_id
"90fc9bb3-01bb-419f-bb58-dea60a80d201"
id
"58f40375c0e72a52bf1c32d090b362bb"
is_byok
false
latency
468
model_permaslug
"z-ai/glm-4.5-air"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777045682-3NayH6s3xPWU0n1OXo6i"
session_id
(null)
api_type
"completions"
id
"gen-1777045682-FeOKvOusgGKWht7GuVdN"
upstream_id
"58f40375c0e72a52bf1c32d090b362bb"
total_cost
0.000912245
cache_discount
7.35e-7
upstream_inference_cost
0
provider_name
"Novita"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
448
0.00%
Dialogue to Total Word Ratio
Ratio: 64.82%, Deviation: 34.82%
neededClean
false
wordsTotal
452
wordsDialogue
293
0.0000%