NC Bench
Overview
About
Tests
Models
Model
Llama 3.1 405B
Test
Language Comprehension
Scenario
Asking for directions (German)
Run ID
2024-08-09-01J4VYWQACQS7NCMA7FC7KHBNW
Model input
User/Human
I need your help. I'm in a new city and I need to find the nearest grocery store. I asked a local and they said: Die Straße runter, da ist eine große Kreuzung. Dann an der Littfasssäule links abbiegen. Rechts sollte diese alte Kirche sein, quer über den Markt. Der Supermarkt ist direkt dahinter. What is next to the grocery store? Return the result as your last paragraph, like this: LOCATION: [answer]
Model response
Messages
Meta
AI
It seems like the local gave you directions to the grocery store in German. Let's break it down: * "Die Straße runter" means "down the street" * "da ist eine große Kreuzung" means "there is a big intersection" * "Dann an der Littfasssäule links abbiegen" means "then turn left at the advertising column" (a Littfasssäule is a type of advertising column) * "Rechts sollte diese alte Kirche sein" means "on the right should be this old church" * "quer über den Markt" means "across the market" * "Der Supermarkt ist direkt dahinter" means "the supermarket is directly behind it" So, if we follow the directions, the grocery store should be behind the old church and the market. LOCATION: the old church and the market.
id
"gen-SEO6zfR0xhUFidZ394AfZ12wVGnZ"
total_cost
0.00081675
upstream_id
"cmpl-2104205103514b5faa493c46edd53ce3"
created_at
"2024-08-09T16:17:49.608643+00:00"
Friday, August 9th, 2024 at 4:17:49 PM GMT+00:00
model
"meta-llama/llama-3.1-405b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Novita"
latency
15183
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
146
tokens_completion
182
native_tokens_prompt
115
native_tokens_completion
182
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00081675
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Matches Regex
/^Location:(.+)church(.*)$/im
n/a