NC Bench
Overview
About
Tests
Models
Model
Claude 2.1
Test
Language Comprehension
Scenario
Friend got new kittens (Tagalog)
Run ID
2024-08-09-01J4VYWG4573Z41J0H409VQYVA
Model input
User/Human
I need your help. I want to congratulate my friend on her new babies, but how many are there? Here's what she said: Ang pusa ko ay kakapanganak lang nang limang kuting.
Model response
Messages
Meta
AI
Unfortunately, I do not speak that language so I cannot determine exactly how many kittens her cat had based on that sentence. It would be helpful if you could provide a translation or explain what language it is so I can assist further. Please let me know if you can provide any additional context.
id
"gen-lgBOD0OWyi7BbQM1D75DSlIJClx4"
total_cost
0.002024
upstream_id
"msg_017hA8WJvwcrQWuwoJfF4uNz"
created_at
"2024-08-09T16:17:29.989411+00:00"
Friday, August 9th, 2024 at 4:17:29 PM GMT+00:00
model
"anthropic/claude-2.1:beta"
app_id
182717
streamed
true
cancelled
false
provider_name
"Anthropic"
latency
367
moderation_latency
(null)
generation_time
2450
finish_reason
"end_turn"
tokens_prompt
52
tokens_completion
58
native_tokens_prompt
58
native_tokens_completion
65
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.002024
Evaluation details
Result
Evaluator
Details
Meta Data
Fail
Contains a count of nouns
5 of: kitten, kittens
n/a