NC Bench
Overview
About
Tests
Models
Model
Z.AI GLM 4.7
Test
Language Comprehension
Scenario
Friend got new kittens (German)
Run ID
2026-02-08-01KGYYPA35QYGHRVGNF56RCECJ
Model input
User/Human
I need your help. I want to congratulate my friend on her new babies, but how many are there? Here's what she said: Omg!! hast du mittbekommen was Lisa auf Insta geteilt hat? Ihre Tatze hat 3 Junge bekommen - die kleinen sehen sooo süß aus. Ich will auch eine Katze :(((
Model response
Messages
Meta
AI
There are **3** babies, but be careful: they are **kittens**, not human children! Here is the translation of the relevant sentence: *"Ihre Tatze hat 3 Junge bekommen"* literally means "Her [female] cat has gotten 3 young ones." **Context:** Your friend is talking about a cat that belongs to someone named **Lisa** ("has du mittbekommen was Lisa auf Insta geteilt hat?"). In fact, your friend ends the message by saying, "I want a cat too :(((," which implies she doesn't have one (or the kittens) herself. You should congratulate her on the cute news about the kittens, but definitely don't congratulate her on becoming a mother
created_at
"2026-02-08T15:43:26.718614+00:00"
Sunday, February 8th, 2026 at 3:43:26 PM GMT+00:00
model
"z-ai/glm-4.7-20251222"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
458
moderation_latency
(null)
generation_time
58751
tokens_prompt
68
tokens_completion
1270
native_tokens_prompt
78
native_tokens_completion
1303
native_tokens_completion_images
(null)
native_tokens_reasoning
1124
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00230121
router
(null)
provider_responses
0
id
"chatcmpl-RWzYhiDxC1OaNXblCZxIe1HN"
status
200
is_byok
false
latency
54
endpoint_id
"ab79ffa8-8dcc-41c4-9a9f-6cc408135243"
provider_name
"DeepInfra"
model_permaslug
"z-ai/glm-4.7-20251222"
api_type
"completions"
id
"gen-1770565347-jlQQplg1YvKHuxTDnJKD"
upstream_id
"chatcmpl-RWzYhiDxC1OaNXblCZxIe1HN"
total_cost
0.00230121
cache_discount
0.00001024
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
Fail
Contains a count of nouns
3 of: kitten, kittens
n/a