Run: 2024-09-20-01J881Q7H1TD1AW6SK86FC502K

Run Details

Model:: Qwen 2.5 72B
Test:: Data extraction
Scenario:: Guess the pet
Duration:: 418ms
Cost:: $0.000027
Provider:: Hyperbolic

Evaluation summary

Score	Evaluator	Details Link
Pass	Matches text	(details)
Pass	Matches Regex	(details)
no eval	Overall

Model input

User/Human

What pet does Emma own based on the following text? Only return the pet type, nothing else.
John has a dog. Sarah owns a cat. Emma's pet purrs and has whiskers. Peter has a hamster.

Model response

cat

Evaluation details

Result	Evaluator	Details	Meta Data
Pass	Matches text	Case insensitive: cat	n/a
Pass	Matches Regex	/\bcat\b/i	n/a
no eval

id	"gen-dS7XNsTwD9lH8tK1AW0oOsY4A4Ez"
total_cost	0.0000272
upstream_id	"chat-447e0cab50224ee991bc80c72f2562ad"
created_at	"2024-09-20T15:44:59.241396+00:00" 9/20/2024, 3:44:59 PM
model	"qwen/qwen-2.5-72b-instruct"
app_id	182717
streamed	true
cancelled	false
provider_name	"Hyperbolic"
latency	271
moderation_latency	(null)
generation_time	20
finish_reason	"stop"
tokens_prompt	63
tokens_completion	1
native_tokens_prompt	65
native_tokens_completion	3
native_tokens_reasoning	(null)
num_media_prompt	(null)
num_media_completion	(null)
origin	"https://ncbench.com/"
usage	0.0000272