Run: 2024-10-01-01J94RJCNKSB7S92BS2B95FKP6

Run Details

Duration:
19.1s
Cost:
$0.000138
Provider:
Infermatic

Evaluation summary

Score Evaluator
Details Link
50.00% Parse dialogue (details)

Model input

System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."

Model response

AI
This is the transcription of the voice sheet: Selected Text: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "atorio uto utriaiutyuiuyui f^%$*I`%$* to &^%*$&%123" Hunter: "I'm sorry, what? I don't understand you. Do you speak English?"
id"gen-1727810581-brAhYyAjJQ4zhUmoCjfh"
total_cost0.00013775
upstream_id"chatcmpl-753f95d4-965e-4487-bda8-c44df1513bc5"
created_at"2024-10-01T19:23:20.376979+00:00" 10/1/2024, 7:23:20 PM
model"thedrummer/rocinante-12b"
app_id182717
streamedtrue
cancelledfalse
provider_name"Infermatic"
latency6683
moderation_latency(null)
generation_time12230
finish_reason"stop"
tokens_prompt272
tokens_completion134
native_tokens_prompt279
native_tokens_completion136
native_tokens_reasoning(null)
num_media_prompt(null)
num_media_completion(null)
origin"https://ncbench.com/"
usage0.00013775

Evaluation details

Result Evaluator Details Meta Data
50.00% Parse dialogue n/a
dialogue
0
name"Selvi"
dialogue"atorio uto utriaiutyuiuyui f^%$I%$ to &^%$&%123"
detectedLang"hr"
heavyLang"it"
scores
hr0.5585874799357945
no0.5424292845257903
en0.5416666666666666
da0.5338983050847458
it0.5027124773960217
sk0.494485294117647
pl0.4869402985074627
sv0.46911196911196906
tl0.46601941747572817
cs0.461839530332681
yo0.44889779559118237
sq0.42827442827442824
lv0.40217391304347827
lt0.3778280542986425
es0.3648960739030023
tr0.3405275779376499
ca0.3341404358353511
az0.3176178660049628
is0.3176178660049628
sl0.3055555555555556
ro0.24657534246575344
be0.20289855072463767
hu0.19590643274853803
de0.19117647058823528
fi0.19117647058823528
fr0.19117647058823528
et0.1863905325443787
uk0.1863905325443787
vi0.1863905325443787
reliablefalse
passesfalse
1
name"Hunter"
dialogue"I'm sorry, what? I don't understand you. Do you speak English?"
detectedLang"en"
heavyLang"en"
scores
en0.8045920859794822
no0.5670995670995671
da0.5614035087719298
tl0.5139732685297691
sv0.51278928136419
cs0.5
pl0.464524765729585
hr0.4528043775649795
sk0.4227994227994228
de0.39849624060150374
pt0.3902439024390244
sq0.38556067588325654
ro0.3710691823899371
it0.34959349593495936
nl0.3079584775086505
vi0.30313588850174217
yo0.2920353982300885
ca0.25650557620817843
is0.25512104283054005
sl0.23076923076923078
ms0.15254237288135594
be0.14893617021276595
tr0.14893617021276595
hu0.14346895074946467
az0.13606911447084233
et0.13606911447084233
lt0.13606911447084233
lv0.13606911447084233
uk0.13606911447084233
reliabletrue
passestrue