Model

WizardLM 2 8x22b

Test

Data extraction

Scenario

Who's the sister?

Run ID

2024-07-12-01J2KK2BE7T86V8WRY4F4ANJ4G

Model input

User/Human
Who is the sister of Sandie based on the following text? Only return the name, nothing else. Sandie has a doll named Lulu, she carries it around all day. She says she's her little sister, but her actual sister's name is Cassie.

Model response

AI
Cassie is the name of Sandie's actual sister. Therefore, the answer is Cassie.
id"gen-pyMkU8EM37y3ZKJWUUnfGFLrd3N7" 
total_cost0.00012138
upstream_id"cmpl-f7cdc1c9d2d24d309951fceaf8e06314" 
created_at"2024-07-12T13:45:39.923834+00:00" Friday, July 12th, 2024 at 1:45:39 PM GMT+00:00
model"microsoft/wizardlm-2-8x22b" 
app_id182717
streamedfalse
cancelledfalse
provider_name"OctoAI" 
latency642
moderation_latency(null)
generation_time0
finish_reason"stop" 
tokens_prompt89
tokens_completion21
native_tokens_prompt98
native_tokens_completion21
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/" 
usage0.00012138

Evaluation details

Result Evaluator Details Meta Data
Fail Matches text
Case sensitive: Cassie
n/a
Pass Matches Regex
/\bCassie\b/
n/a
50.0000%