NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-31-01KSYXM8W80KVH3G8MZFZCAC25
Run Details
Model:
Claude Opus 4.8 (Reasoning, High)
Test:
Language Comprehension
Scenario:
Asking for directions (German)
Duration:
6.6s
Cost:
$0.013105
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
Pass
Matches Regex
(details)
Model input
User/Human
I need your help. I'm in a new city and I need to find the nearest grocery store. I asked a local and they said: Die Straße runter, da ist eine große Kreuzung. Dann an der Littfasssäule links abbiegen. Rechts sollte diese alte Kirche sein, quer über den Markt. Der Supermarkt ist direkt dahinter. What is next to the grocery store? Return the result as your last paragraph, like this: LOCATION: [answer]
Model response
Messages
Meta
AI
Let me translate the directions for you: 1. **"Die Straße runter, da ist eine große Kreuzung."** — Go down the street, where there's a big intersection. 2. **"Dann an der Littfasssäule links abbiegen."** — Then turn left at the Litfaßsäule (a cylindrical advertising/poster column common in German cities). 3. **"Rechts sollte diese alte Kirche sein, quer über den Markt."** — On your right should be the old church, across the marketplace. 4. **"Der Supermarkt ist direkt dahinter."** — The supermarket is directly behind it (behind the church). So, to summarize the path: head down the street to the big intersection, turn left at the advertising column, then look for the old church on your right across the market square. The grocery store is right behind that church. LOCATION: The old church (the supermarket is directly behind it)
created_at
"2026-05-31T11:45:08.499Z"
5/31/2026, 11:45:08 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3333
moderation_latency
(null)
generation_time
6527
tokens_prompt
112
tokens_completion
279
native_tokens_prompt
166
native_tokens_completion
491
native_tokens_completion_images
(null)
native_tokens_reasoning
54
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.013105
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_0111aUFjeASm2oCAchkUYkTh"
is_byok
false
latency
1201
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780227908-ZwCJ0HixU1yWLErEhmTy"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780227908-LI2PAHkNeAFOnuJkIf1r"
upstream_id
"msg_0111aUFjeASm2oCAchkUYkTh"
total_cost
0.013105
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Matches Regex
/^Location:(.+)church(.*)$/im
n/a