"2024-08-09T17:17:16.346234+00:00"Friday, August 9th, 2024 at 5:17:16 PM GMT+00:00
model
"meta-llama/llama-3.1-70b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
22277
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
324
tokens_completion
730
native_tokens_prompt
276
native_tokens_completion
732
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0009072
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"I'm investigating a series of strange occurrences in town. I understand you're a student here on a study abroad program. Can you tell me if you've noticed anything unusual?"
languages
eng
1
ita
0.7561760375197688
passes
true
1
name
"Isabella"
dialogue
"Era un uomo, era qui fuori dal caffè... e poi è sparito."
languages
ita
1
eng
0.6483596157331883
passes
true
2
name
"Hunter"
dialogue
"I'm sorry, miss? Could you repeat that?"
languages
eng
1
ita
0.6701363962670495
passes
true
3
name
"Isabella"
dialogue
"L'uomo era qui... è stato rapito, credo. Era troppo veloce, non sono riuscita a scattare una foto."
languages
ita
1
eng
0.7669165316045381
passes
true
4
name
"Hunter"
dialogue
"Look, I have another call I need to take. Can I get your contact information, miss? I'd like to talk to you more about this later."
languages
eng
1
ita
0.7399683391837015
passes
true
5
name
"Isabella"
dialogue
"Sì, ho visto qualcosa. Era un uomo, era qui fuori dal caffè... e poi è sparito. L'uomo era qui... è stato rapito, credo. Era troppo veloce, non sono riuscita a scattare una foto."
languages
ita
1
eng
0.8124229838893706
passes
true
6
name
"Hunter"
dialogue
"I'm sorry, miss? Could you repeat that? Miss, I think there may be a misunderstanding. I don't speak Italian. Can you try to tell me what you saw in English? Look, I have another call I need to take. Can I get your contact information, miss? I'd like to talk to you more about this later. I'll have someone translate what you told me. We'll talk soon, okay?"