NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8FPW2XZ93N2X3V999S88R
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Language Comprehension
Scenario:
Asking for directions (German)
Duration:
7.1s
Cost:
$0.000314
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
Fail
Matches Regex
(details)
Model input
User/Human
I need your help. I'm in a new city and I need to find the nearest grocery store. I asked a local and they said: Die Straße runter, da ist eine große Kreuzung. Dann an der Littfasssäule links abbiegen. Rechts sollte diese alte Kirche sein, quer über den Markt. Der Supermarkt ist direkt dahinter. What is next to the grocery store? Return the result as your last paragraph, like this: LOCATION: [answer]
Model response
Messages
Meta
AI
Let's analyze the directions step by step: 1. "Die Straße runter, da ist eine große Kreuzung." → Go down the street to a large intersection. 2. "Dann an der Littfasssäule links abbiegen." → Then turn left at the Littfass column (a traditional German advertising column often found in cities). 3. "Rechts sollte diese alte Kirche sein, quer über den Markt." → On the right should be an old church, across the market. 4. "Der Supermarkt ist direkt dahinter." → The supermarket is directly behind (it). The key phrase is "direkt dahinter" — directly behind. What is the supermarket behind? The old church, which is across the market. So the supermarket is located directly behind the old church, relative to the direction of travel. Therefore, next to the grocery store would likely be the church and the market area. But since the supermarket is directly behind the church, and "dahinter" implies proximity and positioning, the most immediate notable feature next to or near the supermarket is the market. Thus, the grocery store is directly behind the old church, across from the market — but the market is "quer über" (directly across from) the church. The supermarket being behind the church likely places it adjacent to or near the market. Hence, the most precise answer is that the supermarket is near or behind the church, and the market is on the same square. "Direkt dahinter" suggests the supermarket is just behind the church, so the church and the market are the closest landmarks. But the question is: What is next to the grocery store? Given the supermarket is behind the church and near the market, the market is "quer über" the church — meaning if the church faces the market, then behind the church would be on the opposite side from the market. Wait — let’s reconsider spatially: - You’re approaching the church, which is across the market (so the market is in front of the church). - Behind the church is the supermarket. - So the supermarket is behind the church, away from the market. Thus, the supermarket is directly behind the church. Therefore, the church is right in front of (or adjacent to) the supermarket. So, what is next to the grocery store? The old church. LOCATION: die alte Kirche
created_at
"2026-03-25T10:26:16.583Z"
3/25/2026, 10:26:16 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
445
moderation_latency
(null)
generation_time
7063
tokens_prompt
112
tokens_completion
603
native_tokens_prompt
113
native_tokens_completion
497
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000314099
router
(null)
provider_responses
0
endpoint_id
"813ce28a-1334-47e3-881e-a7f8833b34b4"
id
"chatcmpl-cb190028-76c2-9b8f-ab80-a28a66975ced"
is_byok
false
latency
445
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434376-UB9BZyN8fr9XUx2lYs0V"
upstream_id
"chatcmpl-cb190028-76c2-9b8f-ab80-a28a66975ced"
total_cost
0.000314099
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
Fail
Matches Regex
/^Location:(.+)church(.*)$/im
n/a